UniT: Unified Multimodal Chain-of-Thought Test-time Scaling23 days agovia arxivarxiv.org(opens in new window)AI Score: 39%paper