The Role
The Lead Machine Learning Engineer is an individual contributor and a technical lead who will build, monitor, and maintain Tala’s core machine learning and causal inference services and tooling. You will own customer-facing real-time streaming feature extraction and model inference, model-related batch compute platforms and jobs, service level objective definition and measurement, root cause analysis, software and architecture design, enterprise technical maturity assessment, highly effective cross-functional collaboration, and mentorship.
What You'll Do
- Develop Data Scientist and Analyst-friendly self-service tooling and frameworks to explore new data sources, extract new features, and train, test, deploy, and monitor models
- Optimize the model development and software development life cycles
- Maximize quality of models, services, and tooling with unit testing, integration testing, dry run and blue-green deployment, infrastructure-as-code, automation, observability, and fault tolerance
- Write and review design documents, perform code reviews, and weigh in on implementation choices from other technical teams
- Collaborate with and support cross-functional teams (Product, Data Platform, Credit, and Business Development)
What You'll Need
- 6+ years backend software experience in consumer scale applications, at least 4 of them with Python
- 2+ of those years in real-time streaming data (Kafka, Kinesis, Beam, Flink, Spark Streaming)
- 2+ of those years in a tech lead role
- Proficiency with machine learning tools and tech (Jupyter, Pandas, Scikit-Learn, Xgboost, Tensorflow, Pytorch, Hugging Face
- Strong database experience, both relational and non-relational (MySQL, PostgreSQL, Cassandra, HDFS, Snowflake, Druid)
- Strong hands-on experience in cloud computing (AWS, GCP, Azure, Kubernetes)
- Experience with batch processing platforms (Airflow, Metaflow)
- Experience autonomously building machine learning or causal inference models to solve business problems
Apply for this job