AI Compiler & Kernel Engineer
Write CUDA kernels, ML compilers, and low-level optimizations for AI workloads.
0
Open Positions
Core Skills
CUDA KernelsTritonXLAMLIRC++GPU ProgrammingCompiler DesignFlashAttention
Active Positions (6)
Software Engineer, Encoding Librariesmid
Anthropic·San Francisco, CA | New York City, NY
Encoding libraries for multimodal dataText Encoding for LLMs
Staff Software Engineer - GenAI Performance and Kernelstaff
Databricks·San Francisco, California
GPU OptimizationAttention Kernel OptimizationMLP Kernel OptimizationKernel FusionMixed Precision TrainingQuantization Techniques
Member of Engineering (Pre-training / CUDA)midRemote
Poolside AI·Remote (EMEA/East Coast)
CUDAPre-training of AI ModelsAgentic AICoding Assistants
Staff Software Engineer - Linux/Kernelstaff
Datadog·Tel Aviv, Israel
eBPF (Extended Berkeley Packet Filter)Zero-instrumentation ObservabilityLayer 7 Protocol ClassificationTLS-Encrypted Traffic DecodingRED Metrics (Requests, Errors, Duration)Automatic Service Discovery
TPU Kernel Engineermid
Anthropic·San Francisco, CA | New York City, NY | Seattle, WA
TPU kernel optimizationlow-precision inferencequantizationkernel designML framework internalstransformer language modeling
Senior GenAI Research Engineer - Optimization and Kernelssenior
Databricks·San Francisco, California
Kernel FusionMixed Precision TrainingMemory Layout OptimizationTiling StrategiesTensorizationGPU Optimization