Job
Description
Were seeking an exceptional Data Engineer to join our Risk and Compliance Solutions (RCS) Data Engineering team, where youll architect and build data systems that power Amazons compliance operations. This role combines technical expertise with business acumen to transform complex data into actionable insights that protect Amazons ecosystem of buyers, brands, and sellers.
Your work will directly influence Amazons compliance framework and risk management capabilities, ensuring the companys continued growth while maintaining regulatory compliance. Youll be instrumental in building the next generation of data-driven compliance tools that protect Amazons global marketplace. Your experience with real-time data processing, high-throughput systems, and end-to-end platform development. Knowledge of modern data engineering tools and technologies is essential. The ideal candidate combines technical excellence with strategic thinking, bringing both the ability to architect complex systems and the vision to drive innovation in compliance technology. Core Responsibilities: Contribute to the architecture, design and implementation of next generation BI solutions including streaming data applications. Manage AWS resources including EC2, RDS, Redshift, Kinesis, EMR, Lambda etc. Collaborate with data scientists, BIEs and BAs to deliver high quality data architecture and pipelines. Interface with other technology teams to extract, transform, and load data from a wide variety of data sources Continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers Basic Qualifications: Bachelors degree in computer science, engineering, mathematics, or a related technical discipline Industry experience in software development, data engineering, business intelligence, data science, or related field with a track record of manipulating, processing, and extracting value from large datasets Experience using big data technologies (Hadoop, Hive, Hbase, Spark, EMR, etc.) Experience working with AWS big data technologies (EMR, Redshift, S3, AWS Glue, Kinesis and Lambda for Serverless ETL) Knowledge of data management fundamentals and data storage principles Knowledge of distributed systems as it pertains to data storage and computing Hands-on experience and advanced knowledge of SQL Basic scripting skills using Python and Scala Basic understanding of Machine Learning Design and implement scalable data infrastructure supporting RCSs compliance and risk management initiatives Develop robust data pipelines and analytics processes that enable real-time decision making Collaborate with compliance officers, software engineers, and product managers to deliver reliable data solutions Lead technical initiatives and mentor team members in best practices for data engineering Create automated systems to replace manual processes and support Amazons global expansion 3+ years of data engineering experience Experience with data modeling, warehousing and building ETL pipelines Experience with SQL Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)