AI Researcher

LabelboxSan Francisco BayRemote, Onsite

Job FunctionsConduct cutting-edge research on advanced methods for integrating human feedback into AI systemsDesign and develop sophisticated systems to measure, enhance, and leverage the quality of human-generated data for AI trainingCreate AI-assisted tools that incorporate active learning and adaptive sampling techniques to increase the efficiency and effectiveness of the human data labeling processInvestigate the impact of different types of human feedback (e.g., demonstrations, preferences, critiques) on model performance and alignmentDevelop and implement novel algorithms for learning from human preferences and for optimizing the human feedback collection process

Job RequirementsPh.D. or Master's degree in Computer Science, Machine Learning, AI, Human-Computer Interaction, or a related fieldExperience with frontier models (e.g., large language models, multimodal models) and their human data requirementsFamiliarity with advanced AI alignment techniques and their practical applications

SkillsStrong background in machine learning, deep learning, and natural language processingExtensive experience with advanced human-AI interaction techniques, particularly RLHF and other human-in-the-loop learning methodsExpertise in designing and implementing data quality measurement systems for human-generated dataDeep understanding of frontier models (e.g., large language models, multimodal models) and their human data requirementsProficiency in programming languages such as Python, and experience with deep learning frameworks (e.g., PyTorch, TensorFlow)Familiarity with advanced AI alignment techniques and their practical applicationsExperience with human subject research, experimental design for data collection, and ethical considerations in AIExcellent research skills with a track record of publications in top-tier AI conferences or journals related to human-AI interactionStrong analytical and problem-solving abilitiesExcellent communication skills and ability to collaborate in a multidisciplinary team

Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform combines on-demand labeling services with the industry-leading data labeling platform. The Boost labeling service is powered by the Alignerr community of highly-educated experts, who span all major languages and a diverse range of advanced subjects. They are available on-demand to rapidly generate new data for supervised fine-tuning, RLHF, and more. Labelbox’s software-first approach delivers unmatched control and transparency into the labeling process, leading to the generation of high-quality, consistent data at scale.

Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.

About the Role

As a Human Data AI Researcher at Labelbox Labs, you will be at the forefront of developing cutting-edge methods to create, analyze, and leverage high-quality human-generated data for frontier model developers. Your role will involve designing and implementing advanced systems that incorporate human feedback into AI training processes, such as Reinforcement Learning from Human Feedback (RLHF). You will also work on innovative techniques to measure and improve human data quality, and develop AI-assisted tools to enhance the data labeling process.

Your expertise in machine learning, human-AI interaction, and advanced human data integration techniques will be crucial in pushing the boundaries of AI capabilities and delivering state-of-the-art solutions to meet the evolving needs of our customers.

About You

Ph.D. or Master's degree in Computer Science, Machine Learning, AI, Human-Computer Interaction, or a related field
Strong background in machine learning, deep learning, and natural language processing
Extensive experience with advanced human-AI interaction techniques, particularly RLHF and other human-in-the-loop learning methods
Expertise in designing and implementing data quality measurement systems for human-generated data
Deep understanding of frontier models (e.g., large language models, multimodal models) and their human data requirements
Proficiency in programming languages such as Python, and experience with deep learning frameworks (e.g., PyTorch, TensorFlow)
Familiarity with advanced AI alignment techniques and their practical applications
Experience with human subject research, experimental design for data collection, and ethical considerations in AI
Strong background in developing and implementing human preference learning algorithms
Excellent research skills with a track record of publications in top-tier AI conferences or journals related to human-AI interaction
Strong analytical and problem-solving abilities
Excellent communication skills and ability to collaborate in a multidisciplinary team

Your Day to Day

Conduct cutting-edge research on advanced methods for integrating human feedback into AI systems, including RLHF and other novel approaches
Design and develop sophisticated systems to measure, enhance, and leverage the quality of human-generated data for AI training
Create AI-assisted tools that incorporate active learning and adaptive sampling techniques to increase the efficiency and effectiveness of the human data labeling process
Investigate the impact of different types of human feedback (e.g., demonstrations, preferences, critiques) on model performance and alignment
Develop and implement novel algorithms for learning from human preferences and for optimizing the human feedback collection process
Collaborate with engineering and product teams to integrate research findings into Labelbox's product suite, focusing on scalable and practical applications of human-AI interaction techniques
Engage with customers and the AI community to understand evolving human data needs for frontier models and to share insights on best practices
Publish research findings in top-tier academic journals and present at leading AI conferences
Stay at the forefront of advancements in AI, particularly in areas related to human data quality, human-AI collaboration, and AI alignment
Contribute to technical documentation, blog posts, and educational content to establish Labelbox as a thought leader in human-centric AI development

Research at Labelbox Labs

At Labelbox Labs, we're committed to pushing the boundaries of AI and data-centric machine learning, with a particular focus on advanced human-AI interaction techniques. We believe that high-quality human data and sophisticated human feedback integration methods are key to unlocking the next generation of AI capabilities. Our research team works at the intersection of machine learning, human-computer interaction, and AI ethics to develop innovative solutions that can be practically applied in real-world scenarios.We foster an environment of intellectual curiosity, collaboration, and innovation.

We encourage our researchers to explore new ideas, engage in open discussions, and contribute to the wider AI community through publications and conference presentations. Our goal is to be at the forefront of human-centric AI development, setting new standards for how AI systems learn from and interact with humans.Labelbox strives to ensure pay parity across the organization and discuss compensation transparently. The expected annual base salary range for United States-based candidatesis below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.Annual base salary range$220,000 - $300,000USD

Excel in a remote-friendly hybrid model.

We are dedicated to achieving excellence and recognize the importance of bringing our talented team together. While we continue to embrace remote work, we have transitioned to a hybrid model with a focus on nurturing collaboration and connection within our dedicated tech hubs in the San Francisco Bay Area, New York City Metro Area, and Wrocław, Poland. We encourage asynchronous communication, autonomy, and ownership of tasks, with the added convenience of hub-based gatherings.

Your Personal Data Privacy:

Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s

Job Applicant Privacy notice.Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.

Life at Labelbox

Labelbox is the simplest platform to train and operate machine intelligence. Labelbox solves the problem of taking artificial intelligence and machine learning initiatives from research and development into production. In addition to working directly with their customers, Labelbox's main product is a platform that makes it easy to create and manage labeled data, enabling rapid deployment of artificial intelligence applications. Visit us, see what organizations are building with us, and explore Labelbox for free at www.labelbox.com.

Thrive Here & What We Value1. Remote-friendly hybrid model2. Nurturing collaboration and connection within dedicated tech hubs (San Francisco Bay Area, New York City Metro Area, Wrocław)3. Asynchronous communication, autonomy, task ownership4. Collaborative excellence with shared responsibility in decision-making5. Recognizing team talent and achieving excellence6. Striving for a great developer experience through continuous fine-tuning7. Learning by pushing boundaries and engaging in open debate8. Commitment to execution of creative solutions9. Customer success as the North Star10. Embracing remote work with hybrid model focus on tech hubs