Research Scientist
Push the frontier of AI. Publish papers, develop new architectures, advance capabilities.
92
Open Positions
Core Skills
Transformer ArchitecturesScaling LawsSelf-Supervised LearningMixture-of-ExpertsDiffusion ModelsWorld ModelsPyTorch
Active Positions (50)
Research Engineer (New Grad)mid
Genmo·San Francisco HQ
Diffusion ModelsPyTorchDistributed TrainingFoundation ModelsTransformer Architectures
Research Engineer – Benchmarking, Evals & Failure Analysismid
Mercor·San Francisco
Evaluation FrameworksReward ModelingPost-TrainingSynthetic Data GenerationAnnotation PipelinesData Curation
AI Researchermid
1X·San Carlos, CA
World ModelsPre-TrainingDistributed TrainingSynthetic Data GenerationEvaluation FrameworksData Curation
Senior Machine Learning Engineer, Simulation Evaluationsenior
Waymo·Mountain View, CA, USA; San Francisco, CA, USA
World ModelsVision-Language Models (VLMs)Diffusion ModelsEvaluation FrameworksSynthetic Data GenerationMultimodal AI
Senior Research Scientist, Foundation Model for Simulationsenior
Waymo·Mountain View, CA USA; San Francisco, CA USA
Foundation ModelsVision-Language Models (VLMs)JAXPost-TrainingDistillationEmbeddings
Senior Research Scientist, World Action Modelingsenior
Waymo·Mountain View, CA, USA; San Francisco, CA, USA; Kirkland, WA, USA; New York City, NY, USA
World ModelsDiffusion ModelsFoundation ModelsJAXDistributed TrainingReinforcement Learning
Senior Staff ML Engineer, Driver Understanding and Evaluationsenior
Waymo·Mountain View, CA, United States
Reinforcement Learning from Human Feedback (RLHF)Reward ModelingVision-Language Models (VLMs)Foundation ModelsEvaluation FrameworksEmbodied AI Systems
Staff Machine Learning Engineer – VLM/LLM Evaluationstaff
Waymo·Mountain View, CA, USA; San Francisco, CA, USA; Kirkland, WA, USA; New York City, NY, USA
Vision-Language Models (VLMs)Large Language Models (LLMs)Reinforcement LearningEvaluation FrameworksFoundation ModelsEmbodied AI
PhD Fall Machine Learning Intern (ATG — Visual, Multimodal, and Recommender Systems)intern
Pinterest·San Francisco, CA, US; Palo Alto, CA, US; Seattle, WA, US; New York, NY, US
Recommendation SystemsComputer VisionContrastive LearningMultimodal AIDiffusion ModelsSelf-Supervised Learning
[2026] Applied Scientist - PhD Internintern
Roblox·San Mateo, CA, United States
Recommendation SystemsMultimodal AIAgentic AILarge Language Models (LLMs)Diffusion ModelsVision-Language Models (VLMs)
Senior Machine Learning Scientistsenior
Roblox·San Mateo, CA, United States
Time-Series ForecastingFeature EngineeringA/B TestingExperiment DesignMLOps
AI Researcher (Early Talent)midRemote
Nebius·Amsterdam, Netherlands; Berlin, Germany; Remote - Europe; Remote - United States
Reinforcement LearningLarge Language Models (LLMs)Speculative DecodingDistillationInference OptimizationEvaluation Frameworks
Senior ML Engineer (AI Research)seniorRemote
Nebius·Amsterdam, Netherlands; Israel; Remote - Europe; United Kingdom
Reinforcement LearningAgentic AILong-Context ModelingDistillationReward ModelingPost-Training
Research Scientist, Wayve Labsmid
Wayve·London
World ModelsDiffusion ModelsReinforcement LearningSelf-Supervised LearningMultimodal AIEmbodied AI Systems
Senior Research Engineersenior
Decagon·San Francisco
Retrieval-Augmented Generation (RAG)Model Fine-TuningLong-Context ModelingEvaluation FrameworksAgent OrchestrationMulti-Agent Systems
Advanced Technology: AI/ML Research Scientistmid
Cerebras·Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada
Scaling LawsDistributed TrainingFoundation ModelsPyTorch
Machine Learning Research Scientist, Behavior Planning and Prediction mid
Nuro·Mountain View, California (HQ)
Self-Supervised LearningReinforcement LearningWorld ModelsDiffusion ModelsEmbodied AIPath Planning
Machine Learning Research Scientist: Generative Modeling for Planningmid
Nuro·Mountain View, California (HQ)
Diffusion ModelsReward ModelingReinforcement LearningWorld ModelsFoundation ModelsVision-Language Models (VLMs)
ML Research Scientist, Prediction & Smart Agentsmid
Nuro·Mountain View, California (HQ)
Diffusion ModelsSelf-Supervised LearningWorld ModelsSynthetic Data GenerationReinforcement Learning
Senior ML Research Scientist, End-to-End Autonomous Drivingsenior
Nuro·Mountain View, California (HQ)
Foundation ModelsVision-Language Models (VLMs)Sensor FusionSelf-Supervised LearningObject DetectionLiDAR Processing
Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planningsenior
Nuro·Mountain View, California (HQ)
Diffusion ModelsReward ModelingReinforcement LearningWorld ModelsFoundation ModelsSelf-Supervised Learning
Researcher, Post Trainingmid
Cartesia·*HQ - San Francisco, CA
Reinforcement Learning from Human Feedback (RLHF)Direct Preference Optimization (DPO)Post-TrainingReward ModelingAlignmentEvaluation Frameworks
Research Scientist Intern (PhD) - Model Team - Londonintern
H Company·Hybrid London
Reinforcement Learning from Human Feedback (RLHF)Post-TrainingSynthetic Data GenerationReward ModelingVision-Language Models (VLMs)World Models
Applied AI Researcher, System Discoverymid
Distyl AI·San Francisco
Multi-Agent SystemsRetrieval-Augmented Generation (RAG)Agentic AIAgent OrchestrationEvaluation FrameworksFoundation Models
Applied AI Researcher, System Self-Improvementmid
Distyl AI·San Francisco
Reward ModelingReinforcement Learning from Human Feedback (RLHF)Evaluation FrameworksMechanistic InterpretabilitySelf-Supervised LearningPost-Training
Applied AI Researcher, System Self-Constructionmid
Distyl AI·San Francisco
Agent OrchestrationMulti-Agent SystemsAgentic AIFoundation ModelsEvaluation Frameworks
Applied AI Researcher, AI Systemsmid
Distyl AI·San Francisco
Multi-Agent SystemsAgent OrchestrationRetrieval-Augmented Generation (RAG)Memory SystemsAgentic AIFoundation Models
Applied AI Researcher, Benchmarkingmid
Distyl AI·San Francisco
Evaluation FrameworksAdversarial TestingHuman-in-the-Loop SystemsExperiment DesignMulti-Agent Systems
Applied AI Researcher, Multi-Agent Systemsmid
Distyl AI·San Francisco
Multi-Agent SystemsAgent OrchestrationAgentic AIFoundation ModelsEvaluation Frameworks
Research Scientistmid
OpenEvidence·San Francisco
Evaluation FrameworksLarge Language Models (LLMs)Retrieval-Augmented Generation (RAG)Foundation Models
Research Scientist (post-training)mid
Genmo·San Francisco HQ
Reinforcement Learning from Human Feedback (RLHF)Direct Preference Optimization (DPO)Diffusion ModelsPost-TrainingPyTorchEvaluation Frameworks
Research Scientist, Life Sciencesmid
Anthropic·San Francisco, CA
Post-TrainingEvaluation FrameworksAgentic AIReinforcement Learning from Human Feedback (RLHF)Model Fine-TuningAnnotation Pipelines
Technical Program Manager, Researchmanager
Anthropic·San Francisco, CA | New York City, NY
Reinforcement LearningEvaluation FrameworksScaling LawsDistributed TrainingGPU Clusters
Research Engineer, Materials Sciencemid
Google DeepMind·Mountain View, California, US
Foundation ModelsLarge Language Models (LLMs)Self-Supervised LearningSynthetic Data GenerationMLOps
Research Scientist, Gemini Personal Intelligencemid
Google DeepMind·Mountain View, California, US
Post-TrainingReinforcement LearningAgentic AILong-Context ModelingReward ModelingAgent Orchestration
Member of Technical Staff, Data Analysis and Evaluationstaff
Cohere·London
Evaluation FrameworksData CurationAnnotation PipelinesModel Fine-TuningDistributed TrainingExperiment Design
Senior Research Engineer, Model Evaluationsenior
Cohere·Toronto
Evaluation FrameworksLarge Language Models (LLMs)Scaling LawsData CurationSynthetic Data Generation
Senior Research Scientist, Cohere Labssenior
Cohere·London
Foundation ModelsMulti-modal AIAgentic AIScaling LawsNatural Language Processing (NLP)Multilingual AI Capabilities
Senior Research Scientist, Model Evaluationsenior
Cohere·Toronto
Evaluation FrameworksLarge Language Models (LLMs)Synthetic Data GenerationData CurationScaling Laws
Research Internship Reinforcement Learning (Summer)intern
Cohere·Paris
Reinforcement LearningReinforcement Learning from Human Feedback (RLHF)DistillationLarge Language Models (LLMs)Long-Context ModelingReward Modeling
Sr. Staff AI Research TLM - AI Systemssenior
Databricks·Mountain View, California; San Francisco, California
Scaling LawsDistributed TrainingPost-TrainingReinforcement LearningInference OptimizationLarge Language Models (LLMs)
Research Scientistmid
Cursor·San Francisco
Reinforcement LearningReward ModelingScaling LawsEvaluation FrameworksData CurationGRPO (Group Relative Policy Optimization)
Data Scientist, Performance and Reliabilitymid
Cursor·San Francisco
Evaluation FrameworksA/B TestingExperiment DesignModel Monitoring & Observability
Multimodal LLM Researcher (MLLM)mid
Pika·Palo Alto HQ
Multimodal AIDiffusion ModelsVision-Language Models (VLMs)Agent OrchestrationAudio GenerationModel Fine-Tuning
Research Scientistmid
Anduril·Broomfield, Colorado, United States
Reinforcement LearningSignal ProcessingScaling LawsSynthetic Data Generation
Senior Advanced Research Scientist senior
Anduril·Broomfield, Colorado, United States; Fort Collins, Colorado, United States
Signal ProcessingSensor FusionObject DetectionComputer VisionAnomaly Detection
Machine Learning Scientist (All Levels)mid
Abridge·SF Office
Natural Language Processing (NLP)Foundation ModelsASRTransformer ArchitecturesPyTorchEvaluation Frameworks
Senior Data Scientistsenior
Abridge·SF Office
A/B TestingExperiment DesignEvaluation FrameworksModel Monitoring & ObservabilityAnomaly Detection
Director, Data Sciencedirector
Abridge·SF Office
Evaluation FrameworksA/B TestingExperiment DesignModel Monitoring & ObservabilityAnnotation Pipelines
Applied Scientist / Domain Expert, AI4Engineering - EMEAmid
Mistral AI·Paris
Synthetic Data GenerationDigital TwinsData CurationFoundation ModelsEvaluation Frameworks