Data & Analytics Engineer
Build data pipelines and analytics systems for AI training and operational insights
320
Open Positions
Core Skills
SQLSparkAirflowBigQuerydbtETL Pipelines
Active Positions (50)
Senior Product Engineer, Growth & Lifecycle Infrastructure - Music & AudioseniorRemote
Stability AI·Los Angeles, CA or Remote (United States)
A/B TestingExperiment DesignETL PipelinesEmbeddings
Manager, Field Engineering - Financial Services Industrymanager
Databricks·Toronto, Canada
Apache SparkDatabricks Data Intelligence PlatformDelta LakeMLflow
Data Engineer, People Innovation Labsmid
OpenAI·San Francisco
Databricks Data Intelligence PlatformETL PipelinesApache AirflowFeature Engineering
Sales Systems Engineer, Enterprise Operationsmid
Perplexity AI·San Francisco
SnowflakedbtETL Pipelines
Delivery Solutions Architect - Communications, Media, Entertainment & Gamesmid
Databricks·United States
Databricks Data Intelligence PlatformApache SparkDelta LakeRecommendation SystemsMLOps
Lakebase Sales Specialist - Retailmid
Databricks·United States
Databricks Data Intelligence PlatformVector DatabasesAgentic AI
Quantitative UX Researchermid
OpenAI·San Francisco
A/B TestingExperiment DesignUser Research
Data Scientist, Core Experimentation mid
OpenAI·Seattle
Experiment DesignA/B TestingEvaluation FrameworksAnomaly DetectionFeature Engineering
Applied Data Science & Insights Leader - GTM Intelligence Solutions and Technical Successsenior
OpenAI·San Francisco
Recommendation SystemsA/B TestingAnomaly DetectionFeature EngineeringExperiment DesignTime-Series Forecasting
Data Engineer, Data Foundationsmid
Cohere·New York
Apache SparkApache AirflowBigQuerydbtETL PipelinesData Quality
Data & AI Platform Architect (Professional Services)mid
Databricks·Paris, France
Apache SparkDatabricks Data Intelligence PlatformDelta LakeMLflowETL PipelinesMLOps
Delivery Solutions Architectmid
Databricks·Amsterdam, Netherlands
Databricks Data Intelligence PlatformApache SparkDelta LakeMLflowMLOps
Data Scientist, Infrastructuremid
OpenAI·San Francisco
GPU ClustersExperiment DesignTime-Series ForecastingFeature Engineering
Director, Lakebase Sales Specialists - Retaildirector
Databricks·United States
Databricks Data Intelligence PlatformApache SparkDelta Lake
IT Controls Data Engineermid
OpenAI·San Francisco
ETL PipelinesdbtData QualityApache Airflow
Data Scientist, Financial Engineering mid
OpenAI·San Francisco
A/B TestingExperiment DesignAnomaly DetectionTime-Series Forecasting
Lead Data Scientist, Platform Productsenior
Anthropic·New York City, NY | Seattle, WA; San Francisco, CA
A/B TestingExperiment DesignModel Context Protocol (MCP)Agent OrchestrationFeature EngineeringModel Monitoring & Observability
Data Full Stackmid
Alan·Paris, France; Marseille, France; Bordeaux, France; Biarritz, France; Brussels, Belgium
Apache AirflowSnowflakeExperiment DesignFeature Engineering
Data Scientist, Platform and B2B Productsmid
OpenAI·San Francisco
A/B TestingExperiment DesignAgentic AILarge Language Models (LLMs)Model Monitoring & Observability
Senior Data Engineer, Core Experimentationsenior
OpenAI·Seattle
ETL PipelinesApache AirflowDatabricks Data Intelligence PlatformFeature EngineeringExperiment Design
Software Engineer, Research Data Platformmid
Anthropic·San Francisco, CA | New York City, NY
ETL PipelinesData CurationFeature StoresModel Monitoring & ObservabilityReinforcement LearningData Quality
Data Analyst - Physical Infrastructuremid
xAI·Memphis, TN
Time-Series ForecastingAnomaly DetectionETL Pipelines
Staff Product Analytics EngineerstaffRemote
Runway·Remote
ETL PipelinesdbtA/B TestingModel Monitoring & ObservabilityDatabricks Data Intelligence Platform
AI Solutions Architect (Pre-sales) - Strategic Accountsmid
Databricks·Amsterdam, Netherlands
Databricks Data Intelligence PlatformApache SparkDelta LakeMLflowFeature Engineering
Data Engineermid
Abridge·SF Office
ETL PipelinesData CurationApache AirflowAnnotation PipelinesMLOps
Senior Software Engineer, Mapping Field Responsesenior
Waymo·Mountain View, CA, USA
ETL PipelinesData CurationReal-time Systems
Data EngineermidRemote
Dataiku·Germany, Berlin - Remote; Germany, Remote; Netherlands, Amsterdam; Netherlands, Remote; Spain, Remote; United Kingdom, London; United Kingdom, Remote
SnowflakeETL PipelinesdbtApache SparkData Quality
Data Platform Engineermid
Cursor·San Francisco
Databricks Data Intelligence PlatformETL PipelinesApache SparkData QualityData CurationVector Databases
Technical Lead Manager, Data Engineering, Trust & Safetysenior
OpenAI·San Francisco
ETL PipelinesApache SparkData QualityAnomaly DetectionMLOpsData Curation
Sr. Staff Software Engineer, Data Product PlatformseniorRemote
Pinterest·San Francisco, CA, US; Remote, US
ETL PipelinesData QualityData CurationApache SparkFeature StoresAI Governance
Software Engineer, Logs Infrastructuremid
Waymo·Mountain View, CA, USA; San Francisco, CA, USA
ETL PipelinesData CurationAutonomous Driving
Tech Lead Manager, Data Engineersenior
Waymo·Mountain View, CA
ETL PipelinesApache SparkApache KafkaBigQueryData QualityData Curation
Senior Software Engineer - Data Infrastructure, Safetysenior
Roblox·San Mateo, CA, United States
MLOpsApache KafkaAnomaly DetectionData Curation
Senior Data Platform Engineersenior
Pinecone·New York City
BigQuerydbtApache AirflowApache KafkaSnowflakeETL Pipelines
Machine Learning Data Systemsmid
Cursor·San Francisco
Apache SparkDatabricks Data Intelligence PlatformETL PipelinesData CurationRayData Quality
Sr. Software Engineer, Big Data, tvScientificseniorRemote
Pinterest·San Francisco, CA, US; Remote, US
Apache SparkETL PipelinesApache KafkaData Curation
Engineering Manager, HADRmanager
Stripe·Seattle
ElasticsearchApache SparkStreaming Data (Flink)Vector Databases
Analytics Engineer, Safety Systemsmid
OpenAI·San Francisco
Data QualityEvaluation FrameworksModel Monitoring & ObservabilitydbtAnomaly DetectionETL Pipelines
Software Engineer, Data Infrastructuremid
Cohere·New York
Apache SparkApache AirflowBigQuerydbtETL PipelinesData Curation
Staff Software Engineer, Big Data StoragestaffRemote
Pinterest·Palo Alto, CA, US; Remote, US
Apache SparkETL PipelinesData CurationVector Databases
Data/Infrastructure Advocate Engineer - EMEA RemotemidRemote
Hugging Face·Paris, France
Data CurationEmbeddingsVector DatabasesApache SparkData Quality
Senior Software Engineer - Analytics Data Platform Lakehousesenior
Datadog·New York, New York, USA
Apache SparkETL PipelinesApache KafkaDatabricks Data Intelligence Platform
Software Engineermid
Waymo·PERM - N/A
ETL PipelinesData CurationAnomaly DetectionMLOps
Software Engineer III, Data Platformmid
Agility Robotics·Hybrid- Any Office (Fremont, CA, Salem, OR, or Pittsburgh, PA)
Apache SparkApache KafkaStreaming Data (Flink)Data CurationMLOpsModel Monitoring & Observability
Data/Infrastructure Advocate Engineer - US RemotemidRemote
Hugging Face·New York, United States
Data CurationEmbeddingsVector DatabasesApache SparkData Quality
Data Engineer, Scaling Analyticsmid
OpenAI·San Francisco
ETL PipelinesData QualityApache SparkMLOpsBigQuerySnowflake
Data Scientist, Codexmid
OpenAI·San Francisco
A/B TestingEvaluation FrameworksLarge Language Models (LLMs)AI-Assisted Code GenerationExperiment Design
Backend Software Engineer, Growthmid
OpenAI·San Francisco
A/B TestingExperiment DesignLLM Integration
Data Engineer IImidRemote
Dataiku·Netherlands, Amsterdam; Netherlands, Remote; Spain, Remote; United Kingdom, London; United Kingdom, Remote
SnowflakeETL PipelinesdbtApache SparkData Quality
Manager, Field Engineering France - Specialist Solutions Architectsmanager
Databricks·Paris, France
Apache SparkDatabricks Data Intelligence PlatformDelta LakeMLflow