Data & Synthetic Data Engineer
Build data pipelines, curation systems, and synthetic data generation for training AI models.
0
Open Positions
Core Skills
Synthetic Data GenerationData Curation PipelinesSparkAirflowdbtData QualityAnnotation PipelinesBigQuery
Active Positions (8)
Software Engineer, Human Data Interfacemid
Anthropic·San Francisco, CA | New York City, NY
Human Data Interfacesdata collection pipelinescrowdworker experiencevendor toolingdata quality at scalerapid iteration systems
Data Scientist, Integritymid
OpenAI·San Francisco
AI Fraud Detection SystemsPlatform Abuse MitigationGPT-5 for Fraud DetectionScaled Abuse PreventionAdversarial AI DetectionTrust & Safety Operations Analytics
Director of Research Engineering, DatasetsdirectorRemote
Runway·Remote
dataset engineeringdata acquisition strategydata partnership managementdata cost optimizationdata ecosystem analysis
Senior Software Engineer, Agentic Data Productssenior
Scale AI·San Francisco, CA
agent-powered toolsLLM integrationsvector databasesagentic frameworksfull-stack product developmentReact + TypeScript frontends
Senior Data Engineer - AI Focused (x/f/m)senior
Doctolib·Paris, Paris, France
Large Language Models (LLM)Vision-Language Models (VLMs)Retrieval-Augmented Generation (RAG)AI Medical CompanionVector DatabasesGoogle Cloud Platform (GCP)
Staff AI Data Engineer (x/f/m)staff
Doctolib·Paris, Paris, France
DagsterBigQueryDBT (Data Build Tool)Data Anonymization for Privacy ComplianceGDPR adherence in data pipelinesOnline metrics pipelines for AI monitoring
Data Quality Analystmid
Figure AI·San Jose, CA
Proprietary Annotation SoftwareRobot data annotation workflows
Helix Data Creatormid
Figure AI·Spartanburg, SC
Humanoid robot motion data collectionSensor-Guided Motion CaptureAI for Robotics