ArXiv, the open-access preprint repository that has served as the backbone of research distribution in computer science, mathematics, physics, and related fields for over three decades, has announced a decisive policy to combat the rising tide of AI-generated slop. Starting immediately, authors found to have submitted papers with incontrovertible evidence of unchecked AI generation will face a one-year ban from the platform. The announcement was made by Thomas Dietterich, chair of arXiv’s computer science section, who emphasized that such submissions undermine the trustworthiness of the entire research ecosystem.
The policy is not a blanket prohibition on using large language models (LLMs) for research writing. Many researchers legitimately use AI tools for drafting, editing, or data analysis. The trigger for the penalty is clear evidence that an author pasted LLM output into a paper without any human oversight. This includes hallucinated references that do not correspond to real publications, leftover meta-instructions from the chatbot (such as “here is a 200-word summary; would you like me to make any changes?”), and placeholder data tables with notes like “fill in with the real numbers from your experiments.” These are signs of carelessness that betray a complete lack of reading before submission.
Dietterich’s announcement described this as a “one-strike” rule, but decisions are subject to appeal and must be confirmed by a section chair before being enforced. Banned authors can return after one year, but only after having their subsequent work accepted by a peer-reviewed journal before posting to arXiv. This escalation reflects the severity of the problem and the need to protect the integrity of a platform that has become the de facto distribution channel for cutting-edge research, especially in machine learning and artificial intelligence.
The context for this policy is a dramatic surge in AI-generated slop across academia. A study published in The Lancet in May 2026 by researchers at Columbia University audited 2.5 million biomedical papers and 126 million references indexed on PubMed Central. The study found that fabricated citations have risen twelvefold since 2023. In 2023, roughly one in 2,828 papers contained at least one fake reference. By 2025, the rate had climbed to one in 458. In the first seven weeks of 2026, it was one in 277. The researchers attributed the surge to the proliferation of AI writing tools, with previous studies estimating that 30 to 69 per cent of LLM-generated references in biomedical contexts are fabricated.
ArXiv receives thousands of submissions each month, and its volunteer moderation system was never designed to screen for machine-generated content at scale. By targeting only the most egregious violations—cases where the author’s failure to exercise any oversight is provable from the text itself—arXiv avoids the thorny issue of detecting subtle AI-assisted writing. The platform’s existing policy already states that authors bear “full responsibility” for their content “irrespective of how the contents are generated.” The new penalty enforces that principle by making it clear that submitting unread, AI-produced nonsense will have consequences.
The problem of AI slop is not confined to arXiv. Major computer science conferences like NeurIPS and ICML have reported surges in submissions that appear to be minimally vetted LLM output. Nature published a feature in late 2025 describing how AI slop is creating a crisis in computer science, overwhelming reviewers and diluting the signal-to-noise ratio. Peer-reviewed journals are not immune: The Lancet study found fabricated citations in papers that had already passed peer review, suggesting that reviewers are either not checking references or cannot identify fabrications at the rate they are now appearing. Lead author Maxim Topaz warned that clinicians and guideline developers have no way of knowing when the evidence they rely on does not exist—a gap that ongoing efforts to reduce AI hallucinations have not yet closed.
ArXiv itself is undergoing structural changes to address these challenges. After more than 20 years as a project hosted by Cornell University, the platform is becoming an independent nonprofit. This move gives it greater autonomy over moderation policies and the ability to raise funds specifically for combating quality problems. It has also introduced a requirement for first-time submitters to obtain an endorsement from an established author—a gatekeeping measure to reduce the volume of submissions from accounts created solely to publish AI-generated material.
While the new rule will catch the most careless offenders—those who submit papers they have not read—it will not catch researchers who use LLMs to generate plausible but incorrect claims, fabricate data, or produce papers that are fluent but scientifically vacuous. Those problems require peer review, institutional oversight, and a willingness within the research community to treat AI-assisted misconduct with the same seriousness as traditional forms of fabrication. However, what arXiv’s policy establishes is a principle that has always been true in theory but is now enforced in practice: if you submit a paper, you are responsible for every word in it.
The difference now is that language models have made it trivially easy to produce text that reads like science but contains nothing of substance. ArXiv’s one-year ban is a modest penalty for a serious offense, but it is also the first formal acknowledgement by a major research platform that the problem is no longer one of occasional carelessness. It is structural, it is growing, and it requires dedicated infrastructure to combat. As the research community continues to grapple with the implications of AI-generated content, arXiv’s action sets a precedent that other repositories and journals may soon follow.