Speech & Audio AI
Build speech recognition, text-to-speech, audio generation, and voice AI systems.
0
Open Positions
Core Skills
Text-to-SpeechASRAudio TransformersWhisperVoice CloningAudio GenerationSignal Processing
Active Positions (5)
Research Engineer/Research Scientist, Audiomid
Anthropic·San Francisco, CA
Audio MLAudio codecsSpeech language modelsAudio diffusion modelsSpeech-to-speechSpeech translation
Research Engineer / Machine Learning Engineer - Applied Voicemid
OpenAI·San Francisco
speech-to-speechtranscribingtext to speechgpt-realtimeASRTTS
Applied Research Science Lead, Generative Audio seniorRemote
Runway·Remote
generative audiomultimodal AIapplied ML researchcomputer visionresearch-to-production transition
Senior Machine Learning Engineer - Voice Model(ASR/STT) - AI Teams (x/f/m)senior
Doctolib·Paris, Paris, France
ASRSTTWERmedical term error ratediarizationdomain adaptation
Research Scientist, Audiomid
Google DeepMind·New York City, New York, US
audio tokenizersaudio-visual understandingaudio generation modelingaudio dubbingacoustic representationsaudio pre-training