Incorrect Citation Association for Articles in Online-Only Springer Nature Journals
Tamás Kriváchy
TL;DR
The study identifies widespread distortions in citation metrics for online-only Springer Nature journals caused by a transition from page-based to article-number referencing and incomplete API metadata, with Article Number 1 disproportionately inflating its own citation counts. By analyzing Crossref, OpenCitations, Semantic Scholar, and journal sites, the authors demonstrate false attributions (I.1/I.2) and related miscitations (I.3) that have likely persisted since 2011, affecting millions of articles and authors. The work highlights substantial implications for literature search, impact metrics (e.g., SNIP, IF), and career evaluation, and calls for an urgent fix by Springer Nature along with coordinated updates by major data providers and a move toward unified citation metadata. The findings emphasize the need for transparent remediation, re-evaluation of affected metrics, and robust metadata standards to prevent recurrence in the scholarly ecosystem.
Abstract
We show that citation metrics of journal articles in many of the online-only Springer Nature journals and associated ones are distorted, going back to articles from 2001. We find that most likely due to an API response error, there are many incorrect references which typically lead to Article Number 1 of a given Volume. Among others, the issue affects journals such as Scientific Reports, Nature Communications, Communications journals, Cell Death & Disease, Light: Science & Applications, as well as many BMC, Discovery and npj journals. Beyond the negative effect of introducing incorrect reference information, this distorts the citation statistics of articles in these journals, with a few articles being massively over-cited compared to their peers, while many lose citations; e.g. both in Scientific Reports and in Nature Communications, 5 of the 10 top cited articles have article numbers of 1. We validate the distorted statistics by assessing data from multiple scientific literature databases: Crossref, OpenCitations, Semantic Scholar, and the journals' websites. The issue primarily arises from the inconsistent transition from page-based referencing of articles to article number-based referencing, as well as the improper handling of the change in the publisher's article metadata API. It seems that the most pressing problem has been present since approximately 2011, which we estimate affects the citation count of millions of authors.
