Sign In

AIRA_2: Overcoming Bottlenecks in AI Research Agents

Created by
  • Haebom
Category
Empty

μ €μž

Karen Hambardzumyan, Nicolas Baldwin, Edan Toledo, Rishi Hazra, Michael Kuchnik, Bassel Al Omari, Thomas Simon Foster, Anton Protopopov, Jean-Christophe Gagnon-Audet, Ishita Mediratta, Kelvin Niu, Michael Shvartsman, Alisia Lupidi, Alexis Audran-Reiss, Parth Pathak, Tatiana Shavrina, Despoina Magka, Hela Momand, Derek Dunfield, Nicola Cancedda, Pontus Stenetorp, Carole-Jean Wu, Jakob Nicolaus Foerster, Yoram Bachrach, Martin Josifoski

πŸ’‘ κ°œμš”

λ³Έ 논문은 AI 연ꡬ μ—μ΄μ „νŠΈμ˜ μ„±λŠ₯을 μ €ν•΄ν•˜λŠ” 동기식 단일 GPU μ‹€ν–‰, 검증 기반 μ„ νƒμœΌλ‘œ μΈν•œ μΌλ°˜ν™” 격차, κ³ μ •λœ LLM μ—°μ‚°μžμ˜ ν•œκ³„λ₯Ό κ·Ήλ³΅ν•˜κΈ° μœ„ν•œ AIRA$_2$λ₯Ό μ œμ•ˆν•©λ‹ˆλ‹€. AIRA$_2$λŠ” 비동기 닀쀑 GPU μ›Œμ»€ ν’€, μˆ¨κ²¨μ§„ μΌκ΄€λœ 평가 ν”„λ‘œν† μ½œ, λ™μ μœΌλ‘œ μ•‘μ…˜μ„ λ²”μœ„ν™”ν•˜κ³  λŒ€ν™”μ‹μœΌλ‘œ λ””λ²„κΉ…ν•˜λŠ” ReAct μ—μ΄μ „νŠΈλ₯Ό 톡해 μ΄λŸ¬ν•œ 병λͺ© ν˜„μƒμ„ ν•΄κ²°ν•©λ‹ˆλ‹€.

πŸ”‘ μ‹œμ‚¬μ  및 ν•œκ³„

β€’
AIRA$_2$λŠ” 높은 μ‹€ν—˜ μ²˜λ¦¬λŸ‰ 증가와 μž₯기적인 κ²€μƒ‰μ—μ„œ μ•ˆμ •μ μΈ 평가 μ‹ ν˜Έλ₯Ό μ œκ³΅ν•˜μ—¬ κΈ°μ‘΄ 방법둠 λŒ€λΉ„ λ›°μ–΄λ‚œ μ„±λŠ₯을 λ‹¬μ„±ν–ˆμŠ΅λ‹ˆλ‹€.
β€’
μ œμ•ˆλœ μ•„ν‚€ν…μ²˜ ꡬ성 μš”μ†Œ 각각은 μ„±λŠ₯ ν–₯상에 ν•„μˆ˜μ μ΄λ©°, μ΄λŠ” LLM 백본에 걸쳐 μΌκ΄€λœ ν™•μž₯ 법칙을 λ”°λ¦…λ‹ˆλ‹€.
β€’
이전 μ—°κ΅¬μ—μ„œ 보고된 "과적합"은 데이터 μ•”κΈ°λ³΄λ‹€λŠ” 평가 λ…Έμ΄μ¦ˆμ—μ„œ λΉ„λ‘―λ˜μ—ˆμŒμ„ ν™•μΈν–ˆμŠ΅λ‹ˆλ‹€.
πŸ‘