SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks6 hours agovia huggingface30 ptshuggingface.co(opens in new window)AI Score: 23%paper