remote
Data Scientist, Traffic Quality - Amazon.com
Data Scientist
Data Scientist focused on detecting and mitigating invalid traffic at petabyte scale, leveraging Python, Machine Learning, and AWS to protect advertiser trust across Amazon’s multi‑surface advertising ecosystem.
About the role
Key Responsibilities
- Design and implement scalable ML models to identify sophisticated invalid traffic patterns across desktop, mobile, and connected TV platforms.
- Develop and maintain data pipelines in AWS (S3, Redshift, Glue) to ingest, process, and analyze petabyte‑scale clickstream data.
- Collaborate with cross‑functional teams to define metrics, dashboards, and reporting tools that monitor traffic quality in real time.
- Experiment with novel feature engineering techniques and model architectures to improve detection accuracy and reduce false positives.
- Document methodologies, model performance, and operational insights for internal stakeholders and external partners.
Requirements
- 5+ years of experience in data science or machine learning, with a strong focus on large‑scale data processing.
- Proficiency in Python, SQL, and AWS services (S3, Redshift, Glue, SageMaker).
- Hands‑on experience building production‑grade ML pipelines and deploying models at scale.
- Deep understanding of traffic analytics, fraud detection, and anomaly detection techniques.
- Excellent communication skills and ability to translate complex technical concepts to non‑technical audiences.
Skills
pythonmachine learningawssql