Cloud Ops / SRE, Applied Machine Learning – 200545286 -Sunnyvale, California, United States

Apple

Apple’s Applied Machine Learning team has built systems for a number of large-scale data science applications. We work on many high-impact projects that serve various Apple lines of business. We use the latest in open source technology and as committers on some of these projects, we are opening up the boundaries. Working with multiple lines of business we manage many streams of Apple-scale data. We bring it all together and spark business value. We do all this with an exceptional group of software engineers, data scientists, dev-ops engineers and managers. We are looking for a hardworking and dedicated engineers to join our team to bring passion for infrastructure and distributed systems, to build world-class platforms/products at a very large scale across cloud environments.

Join Apple’s Applied Machine Learning Team, as a Senior Software Engineer, to build & support innovative software applications. Candidates should have strong background in setting up and supporting the infrastructure for large scale big data applications in public cloud like AWS or GCP.

THE MAIN RESPONSIBILITIES FOR THIS POSITION INCLUDE:
Build & Support CI/CD tools to port & manage applications on AWS/GCP & Kubernetes

Ability to understand the application requirements (Performance, Security, Scalability etc.) and assess the right services/topology on AWS/GCP & Kubernetes

Deploy & Support applications onto Kubernetes based environments – On-prim K8s/AWS EKS/GCP GKE.

Build automation to enable self-healing systems

Build tools to monitor high performance & alert the low latency applications on AWS/GCP

Ability to troubleshoot application specific, core network, system & performance issues.

Involvement in challenging and fast paced projects supporting Apple’s business by delivering innovative solutions.

Monitor production, staging, test and development environments for a myriad of applications in an agile and dynamic organization.

The candidate is expected to be self-motivated, proactive, and a solution-oriented individual.

5+ years of experience in SRE/DevOpsBachelors with 4+ years or MS plus 2+ years experience or related experience.Strong programming skills in Unix & PythonExtensive experience in managing the applications on AWS/GCP & KubernetesStrong Experience in Infrastructure templating tools like CloudFormation, TerraformStrong proficiency with Helm or Kustomize for managing Kubernetes applications and configurations.

 

Job Overview