Computer Vision & Multimodal
Build systems that understand images, video, and multimodal inputs. Generation, perception, 3D.
4
Open Positions
Core Skills
Diffusion ModelsVision TransformersCLIPStable DiffusionNeRF3D ReconstructionOpenCVCUDA
Active Positions (4)
Applied Scientist / Research Engineer - Multimodal (Come to Singapore)mid
Mistral AI·Paris
multimodal learningOmni-modelsVision-Language Models (VLMs)audio-text modelsvideo-text modelsimage generation models
Senior Optical Engineer - AI&T, Space Imagingsenior
Anduril·Boulder, Colorado, United States
spaceborne systemsoptical assembly, integration, and test (AI&T)electro-optical systemsinfrared systemscomputer visionperception
Research Scientist – Controlled 3D GenerationmidRemote
Stability AI·Remote
flow matchingscore-based generative models3D generationGaussiansNeRFssigned-distance fields
Research Engineer / Research Scientist, Visionmid
Anthropic·New York City, NY; San Francisco, CA; Seattle, WA
Computer VisionSpatial ReasoningMultimodal CapabilitiesAgentic InfrastructurePretrainingReinforcement Learning