ArXiv raises the bar on AI-assisted research
ArXiv, the widely used preprint server, is implementing a stricter stance on papers that include unchecked outputs from large language models. Thomas Dietterich, chair of ArXiv's computer science section, said authors whose submissions contain "incontrovertible evidence that the authors did not check the results of LLM generation" — for example, hallucinated citations or leftover model "meta-comments" — will face a one-year ban.
This move is designed to curb what Dietterich and others have dubbed "AI slop": low-effort use of generative models that undermines scholarly rigor. By tying the ability to resubmit to acceptance at a reputable peer-reviewed venue, ArXiv is creating a clear incentive for authors to verify outputs, document their use of AI tools, and ensure reproducibility.
The policy delivers a positive signal to the research community: responsible use of AI can accelerate discovery, but it must be paired with human verification. Practical examples of the kinds of problematic content include invented references, incorrect experimental descriptions generated by models, or model-side annotations left in the manuscript. The new rules encourage transparency about how AI tools were used and verified.
How researchers can comply:
- Thoroughly check any model-generated text, data, or references before submission.
- Document AI assistance in the manuscript's methods or acknowledgments sections.
- Prefer submitting fully validated results and, when applicable, seek peer-reviewed acceptance before re-uploading to ArXiv.
Overall, ArXiv's policy helps protect the integrity and trustworthiness of preprints, ensuring that AI becomes an aid to rigorous research rather than a shortcut around it.