
SenseTime Open-Sources Omni-Modal Model That Thinks in Pixels and Words
SenseTime open-sourced an omni-modal AI that reasons in pixel-word space without visual encoder or VAE, challenging dominant multimodal architectures.
Every story, newest first — research, funding, product launches, policy, and analysis.