Data Engineer II, RISC – USA, WA, Seattle

Amazon

  • Full Time

DESCRIPTION

Amazon Regulatory Intelligence Safety and Risk (RISC) team mission is to protect customers from products that are unsafe, illegal, illegally marketed, controversial or otherwise in violation of Amazon’s policies while enabling our Selling Partners to offer their broadest selection of safe and compliant products. We achieve these objectives worldwide by: (1) taking a science-first approach to offer trustworthy listings to our customers, (2) inventing intuitive and precise tools to simplify our selling partners’ compliance journey and (3) innovating to reduce our cost to serve.

The RISC Data Engineering team is seeking an experienced Data Engineer with solid engineering skills and machine learning background (MLOps) to join our team. In this role, you will be responsible for designing, building, and maintaining large scale robust data pipelines and infrastructure to empower our machine learning, data science and analytics initiatives. You will collaborate closely with Applied Scientists, Machine Learning Scientists, and business stakeholders to understand their requirements and support cutting-edge AI/ML solutions. Join our expert team to build scalable data solutions, improving Amazon business efficiency and simplifying our selling partners’ compliance journey.

Key job responsibilities
1. Design, build, and maintain scalable, fault-tolerant, and efficient data pipelines and infrastructure for machine learning operations (MLOps) leveraging AWS technologies such as Lambda, Glue, EMR/Spark, Step Functions, Airflow, DynamoDB and AWS Batch.
2. Automate infrastructure deployment, maintenance processes, and incorporate CI/CD principles to streamline the MLOps ecosystem, using AWS services and scripting languages like Python or Scala.
3. Develop optimized data models, ETL/ELT processes, data transformations, and data warehouse to ensure high-quality, well-structured data for ML and analytics, using S3, Redshift, Glue, Athena and Lake Formation.
4. Collaborate closely with Applied Scientists, Machine Learning Scientists, and analytics teams to understand data requirements, and provide scalable data solutions.
5. Continuously monitor, optimize, and enhance data pipelines, processes, and infrastructure to support ML and analytics.
6. Implement and enforce rigorous data governance, security, and compliance standards for our data, including data validation, cleansing, and lineage tracking.
7. Mentor junior engineers, promoting best practices and knowledge sharing in data engineering and MLOps.
8. Stay updated with emerging MLOps technologies, tools, and trends, incorporating them into the existing ecosystem for continuous improvement.

About the team
Who Are We
We are a team of scientists and engineers building AI/ML and data solutions to improve Amazon business efficiency and simplify our selling partners’ compliance journey.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

BASIC QUALIFICATIONS

– Bachelor’s degree in computer science, engineering, mathematics, statistics or a related field
– 3+ years of data engineering experience
– Experience with ML
– Experience with data modeling, warehousing and building ETL pipelines
– Knowledge of distributed systems
– Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence

PREFERRED QUALIFICATIONS

– Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, Step Functions, Airflow, DynamoDB and AWS Batch, SageMaker, IAM roles and permissions
– Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
– Experience with advanced ML system design, implementation and maintenance
– Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
– Strong problem-solving and engineering skills, with the ability to translate business requirements into technical solutions

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $118,900/year in our lowest geographic market up to $205,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.

 

USA, WA, Seattle