Research Scientist
Push the frontier of AI. Publish papers, develop new architectures, advance capabilities.
0
Open Positions
Core Skills
Transformer ArchitecturesScaling LawsSelf-Supervised LearningMixture-of-ExpertsDiffusion ModelsWorld ModelsPyTorch
Active Positions (42)
Staff GenAI Research Scientiststaff
Databricks·New York City, New York
generative AI modelsLLMstext-to-image modelsfine-tuningRLHFLLM tool-use
Research Scientist, Post-AGI Researchmid
Google DeepMind·London, UK
post-AGI researchscaling laws (test-time)group agentssuperhuman regime benchmarkingAI progress forecastingAGI trajectory modeling
Research Scientist, Strategic Initiatives, Multimedia MLmid
Google DeepMind·London, UK
SynthIDmultimedia GenAI pre-trainingmultimedia GenAI post-trainingdata influence analysisdata trustworthinessmultimodal inference
Applied Research Science Lead, Reinforcement Learning seniorRemote
Runway·Remote
Reinforcement Learning (RL)Model alignmentLanguage generation alignmentImage generation alignmentVideo generation alignmentHuman-in-the-loop systems
Member of Technical Staff, Applied Research ScientiststaffRemote
Runway·Remote
World ModelsGenerative ModelsComputer VisionMultimodal AIMedia Generation
Member of Technical Staff, Research EngineerstaffRemote
Runway·Remote
Generative ModelsComputer VisionMedia GenerationMultimodal AI
[Expression of Interest] Research Engineer, Production Model Post-Training - Londonmid
Anthropic·London, UK
post-training stackproduction Claude modelsfrontier scale trainingpost-training techniquesproduction runs
[Expression of Interest] Research Manager, Interpretabilitymanager
Anthropic·San Francisco, CA
mechanistic interpretabilityneural network reverse engineeringtransformer circuitsinterpretability researchAI safety via interpretability
Research Engineer, Discoverymid
Anthropic·San Francisco, CA
AI ScientistScientific AGIVM/Sandboxing/Container DeploymentLarge Scale Data PipelinesEvaluation Frameworks
Research Engineer, Economic Researchmid
Anthropic·San Francisco, CA
Economic Impact ResearchPrivacy-Preserving Analysis ToolsClio Research ToolsAI Usage Pattern MonitoringScalable Data Systems for Research
Research Engineer, Machine Learning (Reinforcement Learning)mid
Anthropic·London, UK
Reinforcement LearningRLHFConstitutional AIScalable RL infrastructureModel reasoning capabilitiesAutonomy capabilities
Research Engineer, Science of Scalingmid
Anthropic·London, UK
Science of ScalingTraining Infrastructure OptimizationDev ToolingCompute EfficiencyExperimental DesignLarge Language Model Development
Research Scientist, Interpretabilitymid
Anthropic·San Francisco, CA
Mechanistic InterpretabilityNeural Network Reverse EngineeringCircuit Discovery in Neural NetworksModel Mechanistic UnderstandingInterpretability Tool Development
Machine Learning Research Engineer, GenAI Applied MLmid
Scale AI·San Francisco, CA; New York, NY
Multi-agent systemsAgentic reasoning validationAgentic LLMsAgent failure modesAI tools for prototypingData-driven evaluations
Staff Research Engineer, Discovery Teamstaff
Anthropic·San Francisco, CA
AI Scientist DevelopmentLong-horizon task completionScientific AGIModel Capability Evaluation Frameworks
Research Engineer/Scientist - Generative UI, Consumer Devicesmid
OpenAI·San Francisco
Generative ModelsUI Generation EvaluationFuture of Computing ResearchConsumer Devices AI ResearchModel Capability Evaluation Recipes
Research Engineer, AlphaEarth, Sciencemid
Google DeepMind·London, UK; New York City, New York, US
AlphaEarthGeospatial AIGeospatial Intelligence
Research Engineer, Educationmid
Google DeepMind·London, UK
Complex Instruction FollowingMultimodal UnderstandingReinforcement Learning for EducationAgentic AILearnLMGuided Learning
Research Engineer, GenMediamid
Google DeepMind·Mountain View, California, US
Imagen 4Nano BananaVeoGenerative ModelsModel Training OptimizationDataset Curation for Generative AI
Research Engineer, Multimodal Reinforcement Learning mid
Google DeepMind·Zurich, Switzerland
Multimodal Reinforcement LearningMeta Reinforcement LearningRetrieval-Augmented Generation (RAG)Conversational Learning EnvironmentsChain-of-Thought (CoT)Multimodal Reasoning
Research Engineer, Quantum Computingmid
Google DeepMind·London, UK
Quantum ComputingAI for Scientific DiscoveryQuantum Algorithm DevelopmentFault-Tolerant Quantum ComputingMachine Learning for Quantum Systems
Research Scientist, AI-powered Scientific Discoverymid
Google DeepMind·Montreal, Canada
AI for Scientific DiscoveryLLM Fine-Tuning with Reinforcement LearningCode Execution with LLMsRetrieval-Augmented Generation (RAG) for ScienceLarge Language Models (LLMs) for ExplorationOpen-ended Empirical Research with AI
Research Scientist, Science of Post-Training and Reinforcement Learningmid
Google DeepMind·London, UK
Post-TrainingReinforcement Learning for LLMsLLM-Based AgentsScaling Laws for Post-TrainingEvaluation Frameworks
Research Engineermid
Cohere·Toronto
Retrieval-Augmented Generation (RAG)Agentic AINatural Language Processing (NLP)
Research Scientist, Reinforcement Learningmid
Google DeepMind·London, UK
reinforcement learning algorithmsDQNAlphaGoRainbowAlphaZeroMuZero
AI Research Engineer, Enterprise Evaluationsmid
Scale AI·San Francisco, CA; New York, NY
GenAI Evaluation SuiteLLM-as-a-Judge autorater frameworksRLAIFmodel-judging-model setupsAI-assisted evaluation systemshuman-rated datasets
Machine Learning Fellow - Human Frontier Collective (Canada)mid
Scale AI·Canada
Human Frontier Collective (HFC) FellowshipGPU optimizationPyTorch model optimizationSciPredictPropensityBenchProfessional Reasoning Benchmark
Machine Learning Research Intern (Summer 2026)intern
Scale AI·San Francisco, CA
frontier modelsscalable oversightsynthetic data pipelinesred teamingevaluation sciencedangerous capabilities measurement
Machine Learning Research Scientist / Engineer, Reasoningmid
Scale AI·San Francisco, CA; Seattle, WA; New York, NY
LLM reasoningbrowser agentssoftware engineering (SWE) agentsplanning algorithmsagentic reasoningdata generation for LLMs
Research Engineer, Core MLmid
Together AI·San Francisco
RL algorithmsGRPO-style objectivesSGLangvLLMspeculative decodingATLAS
Senior Research Scientistsenior
Anduril·Broomfield, Colorado, United States
Lattice OSSensor FusionAutonomyComputer VisionInformation ScienceHigh-performance Software
Staff Research Engineering Leadsenior
Anduril·Costa Mesa, California, United States
reinforcement learningsensor perceptionpredictiondecision-makingAgentic Reasoningintegrated agents
Research Scientist (Measurement and Evaluation)mid
Abridge·NYC Office
Ambient AI evaluationClinical conversation data analysisQuasi-Experimental MethodsMeasurement Frameworks for AI ImpactProvider Experience MeasurementClinical decision-making evaluation
Applied Scientistmid
Wolt·Berlin, Germany; Helsinki, Finland; Stockholm, Sweden; Tallinn, Estonia
phishing-resistant MFAJust-In-Time accessCIS Critical Security ControlsNIST Cybersecurity Framework (CSF)Endpoint Detection and Response (EDR)secure SaaS enablement
Senior+ Software Engineer, Research Toolssenior
Anthropic·San Francisco, CA | New York City, NY
Human feedback interfaces for model evaluationExperiment Orchestration PlatformsModel Behavior Visualization ToolsResearch workflow optimizationFull-stack applications for AI researchFeedback Collection Systems for AI Experiments
Senior Research Scientist, Reward ModelsseniorRemote
Anthropic·Remote-Friendly (Travel Required) | San Francisco, CA
Reward ModelingRLHFLLM-Based EvaluationRubric-Based Grading MethodsReward Hacking MitigationPreference Learning at Scale
Research Scientist, Gemini Diffusionmid
Google DeepMind·London, UK
text diffusion modelsGemini Diffusiongenerative AI latencyparadigm-shifting AI researchfrontier model capabilitiesspeculative AI research
Deputy Chief Scientistmid
Anduril·McHenry, Mississippi, United States
Lattice OSautonomous dronessolid rocket motorssensor fusioncomputer visionTactical Recon & Strike (TRS)
Senior Applied Scientistsenior
Datadog·Paris, France
Anomaly DetectionError Outlier DetectionFaulty Deployment AnalysisStreaming data analysisMachine Learning Model MonitoringAcademic research paper review (journal club)
Research Scientist, AQUAmid
Google DeepMind·Bangalore, India
Autonomous AgentsReinforcement Learning for AgentsML Optimization Methods for AgentsEmergent Agentic BehaviorsGemini ModelsLarge Language Models (LLMs) for Agents
Research Scientist: Multilingual, Multicultural and Multimodal LLMmid
Google DeepMind·Tokyo, Japan
multilingual LLMsmulticultural LLMsmultimodal LLMsAPAC region AIspeech-vision-text integrationGemini multimodal research
Technical Advisor Specialist - GenAImid
Scale AI·San Francisco, CA
competitive codinggenerative AImodel failure modesAI reasoning tasks