Research Inference, Tech Lead
San Francisco, CA
About the Team
The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference.
Our priorities are to maximize training throughput (how quickly we can train a new model) and researcher throughput (how quickly we can develop new models) with the goal of accelerating progress towards AGI. We frequently collaborate with other teams to speed up the development of new capabilities.
About the Role
We are looking for an experienced Technical Lead to lead critical work on our shared internal inference stack and grow the team. Our inference stack is primarily built by the Applied AI engineering team and we will improve and extend it for research use cases.
In this role, you will:
Get SOTA throughput for our most important research models.
Reduce the time it takes to get efficient inference for new model architectures.
Collaborate closely with Applied AI engineering to maximize the benefits of our shared internal inference stack.
Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think.
You might thrive in this role if you:
Have experience with ML systems, particularly high scale distributed training or inference for modern LLMs.
Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.
Have lead large scale engineering projects end-to-end.
Are an expert in core HPC technologies: InfiniBand, MPI, CUDA, OpenAI Triton.
Deep understanding of GPU/hardware accelerators, networking performance and how to maximize multi-device inference throughput (including overlap of compute and communication).
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to help the team succeed.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Tags: AGI Architecture CUDA Engineering GPU HPC InfiniBand LLMs Machine Learning OpenAI Privacy Research
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Science Intern jobs
- Open Junior Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Data Science Manager jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Marketing Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Research Scientist jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open MLOps Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open NLP-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open Hadoop-related jobs
- Open CI/CD-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs