Computer Vision & Multimodal

Build systems that understand images, video, and multimodal inputs. Generation, perception, 3D.

0
Open Positions

Core Skills

Diffusion ModelsVision TransformersCLIPStable DiffusionNeRF3D ReconstructionOpenCVCUDA

Active Positions (8)

Research Engineer / Research Scientist, Visionmid
Anthropic·New York City, NY; San Francisco, CA; Seattle, WA
Computer VisionSpatial ReasoningMultimodal CapabilitiesAgentic InfrastructurePretrainingReinforcement Learning
Senior Machine Learning Engineer, Computer Vision - Roboticssenior
Scale AI·San Francisco, CA
3D ReconstructionSLAMhand pose estimationgesture recognitionfull-body trackingObject Detection and Tracking (MOT/SOT)
Research Engineer, SLAM & Multi-View Geometrymid
OpenAI·San Francisco
SLAMmulti-view geometry3D reconstructionpoint trackingteleoperationmulti-camera sensor stacks
Research Engineer, Multimodal Generative AI (Image/Video)mid
Google DeepMind·Kirkland, Washington, US; Seattle, Washington, US
Multimodal Generative AIImage GenerationImage EditingNano BananaDeep Learning for VisionReinforcement Learning for Generative Models
Senior Optical Engineer AI&T, Space Imagingsenior
Anduril·Lexington, Massachusetts, United States
Spaceborne Optical SystemsElectro-Optical (EO) ImagingInfrared (IR) SystemsOptical Assembly, Integration, and Test (AI&T)
Senior Software Engineer, Intelligence Systems (Augmented Reality)senior
Anduril·Reston, Virginia, United States
Lattice OSaugmented realitymixed realitygame engine componentsrendering pipelinesphysics simulations
Software Engineer, AR/VR Calibrationmid
Anduril·Bellevue, Washington, United States
AR/VR calibrationdisplay pipeline softwareoptical testscalibration algorithmsEagle EyeWarfighter OS (WFOS)
Senior Software Engineer, Realtime Imagingsenior
Anduril·Lexington, Massachusetts, United States
Real-time image processingLow-Latency Software OptimizationImaging systems for defenseHigh-performance imaging softwareCross-functional imaging system development