Software Engineer, Machine Learning Infrastructure, Tesla Bot – Palo Alto, California

Tesla

What to Expect
As a Software Engineer for the Tesla Bot team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and deploying neural networks to the bot, and evaluate experimental results. You will help us automate the entire workflows of training, validation, and production of the Tesla Bot. Most importantly, you will see your work repeatedly shipped to and utilized by thousands of Humanoid Robots in real world applications. Practical experience programming in Python and C++.
What You’ll Do
Build and improve our AI infrastructure for stable and faster training 
Build the tooling and infrastructure for reporting and visualizing model metrics and performance 
Build the pipelines to run and validate our PyTorch models 
Manage, analyze, and visualize our training and test datasets 
Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning
Build and improve tooling to deploy trained neural nets to Tesla hardware 
What You’ll Bring
Proficient in system-level software, particularly hardware-software interactions and resource utilization
Understanding of modern machine learning concepts and state of the art deep learning
Experience working with training frameworks, ideally PyTorch
Demonstrated experience scaling neural network training jobs across a cluster of GPU’s
Experience in deep learning deployment or profiling/optimizing CPU-GPU interactions preferred

Palo Alto, California

Full time

Job Overview