logo inner

Data Scientist

DeepgramOnsite
This job is no longer open

Company Overview


Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

The Opportunity


At Deepgram, we believe that data is the key to unlock the future of voice-enabled experiences. But building with audio data is hard -- audio poses incredibly rich scientific, engineering, and infrastructure challenges that are orders of magnitude harder than working with text. As a Data Scientist at Deepgram, you will tackle conversational audio at scale, establishing automated data streams that will power the next generation of Voice AI foundation models. The models we build will go beyond basic transcription and comprehension to capture nuanced meanings in complex conversations, adapt robustly to diverse speech patterns, and generate empathic responses with human-like, contextualized speech.  Domain-specific expertise in speech or language AI is not required.

Rather we’re looking for seasoned scientists who have a track record of solving hard data problems while exploring research frontiers. Our start-up environment offers a stunning growth trajectory for adventure-seeking individuals, providing a level of project ownership and on-ground connection with end-customers that larger research labs simply cannot provide. 

What You’ll Do



  • Build high performance data acquisition, preparation and synthesis pipelines and drive them to generate data for training foundational voice models across modalities and tasks

  • Develop advanced characterizations of complex conversational audio utilizing a diverse toolkit of signals processing techniques and deep learning methods

  • Collaborate with DataOps and Engineering to create automated systems which scale the ability of human annotators to label high value data and provide feedback on model outputs

  • Build advanced benchmarking methodologies for evaluating  interactive, conversational agent systems

It’s Important To Us That You Have



  • Experience building data processing pipelines from a blank page and owning the entire data stack including acquisition, characterization, cleaning, serving and transformation

  • Experience applying statistical methods or deep learning models to understand complex data

  • Ability to design and carry out research programs independently and with minimal oversight

  • Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch

  • Strong communication skills and the ability to translate complex concepts in simple terms, depending on the target audience

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing.

We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.We are happy to provide accommodations for applicants who need them.Compensation Range: $150K - $220K

This job is no longer open

Life at Deepgram

Deepgram builds Speech Recognition for Enterprise. Powered by our own Deep Learning models, our scalable API reliably translates high-value audio into parsable data with industry leading accuracy. Deepgram is an NVIDIA partner and a Y Combinator company.
Thrive Here & What We Value* Equal Opportunity Employer* Collaboration and Doing the Right Thing* CustomerCentric Approach* GrowthOriented Company* CuttingEdge Technology
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2024