UniT: Unified Multimodal Chain-of-Thought Test-time Scaling21 days agovia arxivarxiv.org(opens in new window)AI Score: 39%paper