Investigating Document Type, Language, Publication Year, and Author Count Discrepancies Between OpenAlex and Web of Science
Preprent
Abstract
Bibliometrics, whether used for research or research evaluation, relies on large multidisciplinary databases of research outputs and citation indices. The Web of Science (WoS) was the main supporting infrastructure of the field for more than 30 years until several new competitors emerged. OpenAlex, a bibliographic database launched in 2022, has distinguished itself for its openness and extensive coverage. While OpenAlex may reduce or eliminate barriers to accessing bibliometric data, one of the concerns that hinders its broader adoption for research and research evaluation is the quality of its metadata. This study aims to assess metadata quality in OpenAlex and WoS, focusing on document type, publication year, language, and number of authors. By addressing discrepancies and misattributions in metadata, this research seeks to enhance awareness of data quality issues that could impact bibliometric research and evaluation outcomes.
Links
Preprint paper
Citation
@misc{mongeon2025,
author = {Mongeon, Phillipe and Hare, Madelaine and Riddle, Poppy and
Wilson, Summer and Krause, Geoff and Marjoram, Rebecca and Toupin,
Rémi},
title = {Investigating {Document} {Type,} {Language,} {Publication}
{Year,} and {Author} {Count} {Discrepancies} {Between} {OpenAlex}
and {Web} of {Science}},
date = {2025-08-26},
url = {https://doi.org/10.48550/arXiv.2508.18620},
doi = {10.48550/arXiv.2508.18620},
langid = {en}
}