Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Paper Quality Assessment based on Individual Wisdom Metrics from Open Peer Review

Created by
  • Haebom

Author

Andrii Zahorodnii, Jasper JF van den Bosch, Ian Charest, Christopher Summerfield, Ila R. Fiete

Outline

We highlight the shortcomings of the traditional closed peer review system and study the efficacy and accuracy of an alternative peer review system based on an open, top-down process. Using data from two major scientific conferences (CCN2023 and ICLR2023), we highlight the high variability and low correlation in reviewer scores. Using a reviewer quality estimator, we reveal the lack of correlation between reviewer quality and author quality. Furthermore, we find that authors with moderate article scores are the best reviewers. Using a Bayesian method, we estimate article quality and demonstrate that reviewer evaluation in an open system can yield high-quality article scores. Finally, we propose an incentive structure to recognize high-quality reviewers and encourage broader review coverage.

Takeaways, Limitations

Takeaways:
We demonstrate that open, top-down peer review processes can be scalable, reliable, and fair.
Bayesian methodology can be used to improve paper quality assessment.
It sheds new light on the relationship between the qualities of a reviewer and the qualities of an author.
Enables reliable paper evaluation by utilizing reviewer ratings.
We offer an incentive structure that encourages high-quality reviewers and broadens the scope of their reviews.
Limitations:
Further research on generalizability is needed based on data from two specific conferences (CCN2023 and ICLR2023).
Further investigation is needed into potential challenges associated with the practical implementation of open peer review systems (e.g., spam, bias, etc.).
Further evaluation of the long-term effectiveness and sustainability of the proposed incentive structure is needed.
👍