Skip to main content
Latent
Signal
About
$
feed
AI-curated breakthroughs
hot
new
today
this week
this month
this year
all time
login to submit
#1
5m
via
hackernews
Show HN: Skill that lets Claude Code/Codex spin up VMs and GPUs
(opens in new window)
0
#2
2h
via
hackernews
GPT-5.2 derives a new result in theoretical physics
(opens in new window)
0
#3
1h
via
lobsters
I used a local LLM to analyze my journal entries
(opens in new window)
0
#4
6h
via
huggingface
Stemphonic: All-at-once Flexible Multi-stem Music Generation
(opens in new window)
0
#5
6h
via
hackernews
I asked Claude Code to remove jQuery. It failed miserably
(opens in new window)
0
#6
5h
via
lobsters
The future of software engineering - The future of software development retreat
(opens in new window)
0
#7
10h
via
huggingface
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning
(opens in new window)
0
#8
10h
via
huggingface
RISE: Self-Improving Robot Policy with Compositional World Model
(opens in new window)
0
#9
10h
via
huggingface
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
(opens in new window)
0
#10
10h
via
huggingface
χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies
(opens in new window)
0
#11
10h
via
huggingface
EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration
(opens in new window)
0
#12
5h
via
lobsters
microgpt
(opens in new window)
0
#13
10h
via
huggingface
ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images
(opens in new window)
0
#14
10h
via
huggingface
Adapting Vision-Language Models for E-commerce Understanding at Scale
(opens in new window)
0
#15
16h
via
arxiv
3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting
(opens in new window)
0
#16
14h
via
huggingface
Multimodal Fact-Level Attribution for Verifiable Reasoning
(opens in new window)
0
#17
14h
via
huggingface
Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision
(opens in new window)
0
#18
14h
via
huggingface
T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization
(opens in new window)
0
#19
14h
via
huggingface
NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control
(opens in new window)
0
#20
14h
via
huggingface
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling
(opens in new window)
0
#21
16h
via
arxiv
Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education
(opens in new window)
0
#22
16h
via
arxiv
Energy-Aware Spike Budgeting for Continual Learning in Spiking Neural Networks for Neuromorphic Vision
(opens in new window)
0
#23
10h
via
huggingface
Thinking with Drafting: Optical Decompression via Logical Reconstruction
(opens in new window)
0
#24
10h
via
huggingface
Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation
(opens in new window)
0
#25
16h
via
arxiv
Bandit Learning in Matching Markets with Interviews
(opens in new window)
0
#26
16h
via
arxiv
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
(opens in new window)
0
#27
16h
via
arxiv
ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images
(opens in new window)
0
#28
16h
via
arxiv
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching
(opens in new window)
0
#29
18h
via
huggingface
Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity
(opens in new window)
0
#30
16h
via
arxiv
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
(opens in new window)
0
Load more posts