Description
PolyAI is at the forefront of automating customer service through voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction. We are seeking a dedicated and innovative Director of Data and Model Optimisation to join our team and elevate our machine learning models to new heights.As the Director of Data and Model Optimisation, you will be responsible for shaping and implementing the strategy for data collection, data pipeline development, and model optimisation.
Your primary focus will be on Automatic Speech Recognition (ASR) and Large Language Models (LLM). You will play a critical role in ensuring the delivery of high-quality data and annotations, iterating on model improvements, and driving the overall performance of our production-grade machine learning models.
Requirements
- PhD in Computer Science, Machine Learning, or a related field, or equivalent industry experiences.
- Minimum of 5+ years of experience in working with deep learning and statistical models, with at least 3+ years of industry experience.
- Experience in training and optimising large language models (LLMs).
- Strong expertise in developing and managing data pipelines for machine learning applications.
- In-depth knowledge of data quality standards and annotation processes.
- Proficiency in Python and familiarity with relevant ML frameworks and libraries.
- Experience with cloud services such as AWS, GCP, or Azure.
- Demonstrated ability to lead and mentor technical teams, promoting a collaborative and innovative work environment.
- Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences.
- A passion for tackling technical challenges and driving practical solutions.
Responsibilities:
- Develop and manage data pipelines to efficiently and compliantly transport data into our machine learning data lake.
- Define the structure and quality standards for data collection to ensure high-quality annotations and robust datasets.
- Lead initiatives to improve data quality and quantity, focusing on the specific needs of ASR and LLM models.
- Iterate on model quality improvements, employing cutting-edge techniques and methodologies to enhance performance.
- Collaborate closely with cross-functional teams, including data engineers, machine learning engineers, and product teams to align data and model optimisation strategies with business goals.
- Oversee the end-to-end lifecycle of model development, from data collection and preprocessing to model training, evaluation, and deployment.
- Stay abreast of the latest advancements in machine learning, ASR, and LLM to ensure the continuous evolution of our technologies.
- Mentor and guide a team of data scientists and machine learning engineers, fostering a culture of innovation and excellence.
Benefits
💰 Participation in the company’s employee share options plan🏝 25 days holiday, plus bank holidays🏡 Flexible working from home policy plus a one-off WFH allowance when you join 🌎 Work from outside of the UK for up to 6 months each year🧡 Enhanced parental leave📚 Yearly learning budget🚲 Bike2Work scheme📚 Annual learning and development allowance🏡 One-off WFH allowance when you join👨👩👧 Company-funded fertility and family-forming programmes🌸 Menopause care programme with Maven🏥 Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programsEqual Opportunity Statement:PolyAI is proud to be an equal opportunity employer.
We celebrate diversity and are committed to creating an inclusive environment for all employees.All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.