Enhancing Peer Review in Astronomy: A Machine Learning and Optimization Approach to Reviewer Assignments for ALMA
John M. Carpenter, Andrea Corvillón, Nihar B. Shah
TL;DR
The paper tackles the scalability challenge of ALMA's peer review by integrating Latent Dirichlet Allocation–based topic modeling of proposals with reviewer expertise inferred from their proposal history, then solving a leximin fairness optimization (adapted from PeerReview4All) to assign reviews. Key findings show the Cycle 10 deployment yielded a median proposal–reviewer similarity of $0.71$ (up from $0.20$ in Cycle 9) and increased reviewers reporting expertise from $45\%$ to $65\%$, while eliminating reassignment due to mismatches and saving $3$–$5$ days of manual effort per cycle. The approach demonstrates robust automation for large-scale, equitable reviewer assignments, with manageable topology trade-offs and clear pathways for future enhancement, including addressing manipulation risks and advancing NLP with transformer-based models. This work has broad implications for modernizing scientific peer review in astronomy and other fields facing similar scalability pressures.
Abstract
The increasing volume of papers and proposals that undergo peer review emphasizes the pressing need for greater automation to effectively manage the growing scale. In this study, we present the deployment and evaluation of machine learning and optimization techniques to assign proposals to reviewers that were developed for the Atacama Large Millimeter/submillimeter Array (ALMA) during the Cycle 10 Call for Proposals issued in 2023. Using topic modeling algorithms, we identify the proposal topics and assess reviewers' expertise based on their previous ALMA proposal submissions. We then apply an adapted version of the assignment optimization algorithm from PeerReview4All (Stelmakh et al. 2021) to maximize the alignment between proposal topics and reviewer expertise. Our evaluation shows a significant improvement in matching reviewer expertise: the median similarity score between the proposal topic and reviewer expertise increased by 51 percentage points compared to the previous cycle, and the percentage of reviewers reporting expertise in their assigned proposals rose by 20 percentage points. Furthermore, the assignment process proved highly effective in that no proposals required reassignment due to significant mismatches, resulting in a savings of 3 to 5 days of manual effort.
