arXiv's 1-Year Ban on Hallucinated References: Full Policy Guide

Update: This article is trending on Hacker News at #13 with 239 points as of May 15, 2026. The academic community is actively discussing the implications of this landmark policy.

Overview: What Happened

On May 14, 2026, arXiv — the premier open-access preprint repository for physics, mathematics, computer science, and related fields — announced a significant policy change: papers found to contain AI-hallucinated references will face a 1-year submission ban for all listed authors.

The policy directly targets the growing problem of LLM-generated fake citations in academic submissions. As large language models (LLMs) like GPT, Claude, Gemini, and DeepSeek become increasingly integral to the research writing process, arXiv has detected a rising tide of papers containing references that look authentic but are entirely fabricated by AI.

This guide explains the policy in detail, why arXiv felt compelled to act, how the ban is enforced, and what researchers need to do to avoid running afoul of the new rules.

The Scale of the Problem

AI hallucinated references — also known as "phantom citations" or "false references" — have been a known issue since LLMs entered mainstream use. When prompted to provide citations, language models frequently generate references that sound plausible but point to non-existent papers, incorrect volume numbers, or real authors matched with imaginary publications.

According to the arXiv team's internal analysis:

  • Hundreds of submissions have been flagged for containing hallucinated references in recent months
  • Some papers contain entirely fictional reference lists where every citation is fabricated
  • The problem has been accelerating as more researchers use LLMs for literature review and paper writing
  • Notably, the issue affects all disciplines — from computer science to physics to mathematics

The concern is not merely academic. Hallucinated references undermine the integrity of the scientific record. When other researchers attempt to build on cited work, they discover the cited paper does not exist, wasting time and propagating misinformation through the citation graph.

The New Policy: Key Details

What Constitutes a Violation

The policy defines a hallucinated reference as any citation in a submission that:

  • Points to a non-existent paper — a work that was never published
  • Attributes a real concept to a wrong author or wrong work
  • Contains invented metadata — fake DOIs, URLs, conference proceedings, or journal volume/issue numbers
  • Merges multiple real papers into a composite fabrication that does not exist in that form

Penalty: 1-Year Submission Ban

For violations found by arXiv moderation or reported by the community:

  • All listed co-authors on the offending paper receive a 1-year ban from submitting new papers to arXiv
  • The paper itself is removed or withdrawn from the repository
  • A public note is appended to the paper's record explaining the reason for removal
  • Repeat offenders face permanent suspension

The joint liability for all co-authors is a critical and slightly controversial aspect — it incentivizes every listed author to carefully review all references before submission, not just delegate the task to the lead author.

How Violations Are Detected

arXiv has implemented a multi-layered detection system:

  • Automated checks: arXiv's moderation system cross-references cited DOIs, arXiv IDs, and publication metadata against established databases (Crossref, DOI Foundation, arXiv's own index)
  • Community reporting: Readers and researchers can flag suspicious reference lists through a new reporting mechanism
  • Random audits: A sample of submissions undergo manual verification of citations
  • Post-publication review: Published papers remain subject to review, and violations found after publication still trigger the ban

Why arXiv Had to Act

The new policy represents a significant escalation in arXiv's moderation stance. Historically, arXiv has operated on a relatively light-touch moderation model — screening for topical relevance and basic formatting, but largely trusting authors to submit honest work.

Several factors pushed arXiv toward the ban:

1. Volume Acceleration

The rate of hallucinated reference submissions has grown sharply. Early cases in 2024-2025 were scattered; by early 2026, arXiv moderators reported a flood of papers with obviously fake references. The volume made manual screening unsustainable.

2. Erosion of Trust

arXiv serves as the primary rapid dissemination channel for cutting-edge research across multiple scientific disciplines. If the repository becomes known for containing unreliable citations, it damages the credibility of the entire preprint ecosystem.

3. Distinction from Genuine Errors

The policy explicitly distinguishes between honest citation errors and systematic hallucinated references. A citation format mistake or a slightly wrong page number from a real paper falls under a different category — it's the fabrication of non-existent papers that triggers the ban.

4. Protecting the Citation Graph

Modern research relies on citation graphs to discover related work, compute impact metrics (h-index, impact factor), and navigate the literature. Fake citations pollute this graph, creating dead ends and skewing metrics.

How LLMs Generate Fake References

Understanding why LLMs hallucinate references helps researchers avoid the problem:

The Underlying Mechanism

LLMs are next-token prediction engines — they generate text by predicting what word or token comes next based on statistical patterns in training data. They do not have a "database" of real papers they can query. When asked for a citation, the model:

  1. Recalls patterns from training data where citations appeared (author names, paper titles, journal names)
  2. Generates a plausible combination of these patterns
  3. Does not verify whether the resulting reference actually exists

The result is often convincing on first glance — real author names paired with plausible-sounding paper titles in real journal formats — but completely fabricated.

Common Hallucination Patterns

  • Real author + fake title: A known researcher "wrote" a paper that doesn't exist
  • Real concept + wrong year: A landmark paper from 2018 is cited as being from 2023
  • Fake DOI: A DOI pointing to a working resolver but referring to a completely different paper
  • Composite references: Combining the author of paper A, the title of paper B, and the journal of paper C
  • Fictional conference proceedings: Citing a paper in a conference that never held that session

How to Check References Before Submission

Every researcher should treat reference verification as a standard part of the submission workflow. Here's a practical checklist:

Step-by-Step Verification Process

  1. Start with DOIs: Every cited paper should have a DOI. Verify each DOI at Crossref or doi.org. If a DOI returns a 404 or redirects to a different paper, investigate.
  2. Check arXiv IDs: For arXiv references, verify each arXiv ID directly on arxiv.org. A reference to "arxiv:2305.12345" that returns "not found" is a red flag.
  3. Search for paper titles: Copy the exact paper title into Google Scholar or Semantic Scholar. If no results appear, the citation is likely hallucinated.
  4. Verify author-paper matching: Check that the cited author has actually published on the cited topic. A machine learning researcher being cited for a medieval history paper is suspicious.
  5. Cross-check with trusted references: Use reference management tools (Zotero, Mendeley, EndNote) that integrate with DOI databases.
  6. Ask an LLM to check its own citations: This is ironic but works — ask a different LLM to verify each reference. The verification LLM will hallucinate too, so use it as a signal not conclusive proof. If a verifier LLM also can't find the paper, that's a strong warning.

Tools for Reference Verification

  • Google Scholar: Free, fast, covers most academic fields
  • Crossref Simple Text Query: API that matches references against DOI records
  • OpenAlex API: Open bibliographic database for large-scale verification
  • Semantic Scholar API: AI-powered search with citation graph analysis
  • CZID (Core Metadata + Citation Integration): Validates references against multiple repositories

Implications for AI-Assisted Research Writing

The arXiv ban does not prohibit using AI tools in research. It targets a specific failure mode. Here's how researchers can responsibly integrate LLMs while staying compliant:

Best Practices

  • Never copy-paste LLM-generated references without independent verification. Treat every AI-suggested citation as a starting point for search, not a finished reference.
  • Use LLMs for reference discovery, not reference generation. Ask the model to suggest search queries or relevant topics, then find the actual papers yourself.
  • Keep a paper trail. Save your verification results. If arXiv questions a reference, you should be able to show how you confirmed it exists.
  • Use reference management software. Tools like Zotero automatically validate metadata against external databases.
  • Implement a reference review step in your lab's submission workflow. Designate one author to check all references before submission.

What Not to Do

  • Don't prompt LLMs to "write the reference section for my paper about X." This is the primary cause of hallucinated citations.
  • Don't assume a reference is real because it feels right. LLMs are excellent at creating plausible-sounding but entirely fictional citations.
  • Don't delegate reference checking to a junior author without oversight. The ban applies to all co-authors equally.

Community and Industry Reactions

The policy has generated significant discussion across academic and AI communities:

Supporters argue that the ban is necessary and overdue. The integrity of scientific publishing depends on accurate citations. A 1-year ban is proportionate — serious enough to deter carelessness, but not so harsh that a single mistake ends a career.

Critics raise concerns about enforcement fairness. How will arXiv distinguish between a hallucinated reference and an honest mistake? Will automated checks flag papers that cite rarely-indexed works (such as dissertations, technical reports, or non-English publications)? How do appeals work?

Publishers are watching closely. If arXiv's approach proves effective, journals and conferences may adopt similar policies. Some major publishers already screen for citation manipulation; this extends that principle to AI-generated content.

AI ethics researchers note the irony: the solution to AI-generated misinformation in citations is more careful human oversight — the exact skill LLMs are supposed to reduce. The arXiv policy reinforces the message that AI tools are assistants, not replacements for scholarly rigor.

Timeline and Next Steps

  • May 14, 2026: Policy announced. Hacker News frontpage (#13, 239 points)
  • June 1, 2026: Policy takes effect for all new submissions
  • July 1, 2026: Retroactive scanning begins for existing submissions
  • Ongoing: arXiv plans to refine detection algorithms and publish transparency reports on enforcement

arXiv has indicated it will publish a detailed FAQ in the coming weeks addressing edge cases, appeals processes, and clarifications on what constitutes a "hallucinated reference" in borderline cases.

What This Means for the Future of AI in Research

The arXiv hallucinated references ban is a landmark moment in the relationship between AI and academic publishing. It acknowledges both the usefulness of AI tools in research and their critical limitations.

Several broader trends are worth watching:

  • Citation-aware LLMs: Model providers are developing systems that retrieve real citations from databases rather than generating them from patterns. This could eventually eliminate the hallucination problem at the source.
  • Formal verification in writing workflows: Expect to see reference-checking plugins for Overleaf, Google Docs, and reference managers that automatically validate citations against DOIs and publication databases.
  • Increased researcher awareness: The HN discussion (239 points, #13) suggests widespread recognition of the problem. The cultural norm is shifting — researchers are now expected to check references, and failing to do so carries consequences.
  • Parallels with code: Just as researchers are being held accountable for AI-generated citations, the software world is grappling with LLM-generated security vulnerabilities and non-existent packages (supply chain attacks via hallucinated package names).

Conclusion

arXiv's 1-year ban on papers with hallucinated references is a necessary and proportionate response to a growing problem. For researchers, the message is clear: AI is a powerful research tool, but you remain responsible for every word in your submission — including the reference list.

The policy's success will depend on fair enforcement, clear guidelines, and a community commitment to upholding citation integrity. As AI continues to reshape how research is conducted and communicated, policies like this one will define the boundaries of responsible AI use in academia.

Stay updated: Bookmark the arXiv policy page for the latest updates on submission guidelines.