ResearchGym: Evaluating Language Model Agents on Real-World AI Research

18 hours agovia huggingface0 pts

AI Score: 26%paper

Comments

Comments are not yet available for curated items. Check back soon!