Proj CJI Paper Reading: CycleResearcher: Improving Automated Research via Automated Review

Abstract

  • Background: automating the entire research process with open-source LLMs remains largely unexplored
  • Task: using open-source post-trained LLMs as agents of performing the full cycle of automated research and review
  • Tools
    • Automatic Researcher: CycleResearcher
      • Experiment:
        • 效果:
          1. a 26.89% improvement in mean absolute error (MAE) over individual human reviewers in predicting paper scores
          2. LLMs can surpass expert-level performance in research evaluation
    • Automatic Reviewer: CycleReviewer
      • Experiment
        • 效果
          • CycleResearcher model achieved a score of 5.36 in simulated peer reviews, surpassing the preprint level of 5.24 from human experts and approaching the accepted paper level of 5.69
  • Benchmarks:
    • Review-5k: real-world peer review dynamics
    • Research-14k: real-world machine learning research
  • Github: https://wengsyx.github.io/Researcher/
posted @ 2025-02-26 22:52  雪溯  阅读(38)  评论(0)    收藏  举报