remote
Senior AI Data Engineer - Accertify, Inc.
Data Engineer
Lead the design and implementation of scalable AI‑driven data pipelines, leveraging Python, Spark, and AWS to process billions of transaction records for fraud detection and predictive analytics.
About the role
Key Responsibilities
- Architect, build, and maintain high‑performance data pipelines that ingest, transform, and store petabyte‑scale transaction data.
- Develop and operationalize machine‑learning models for real‑time fraud detection and risk scoring.
- Collaborate with data scientists, product managers, and engineering teams to translate business requirements into robust data solutions.
- Implement data quality, monitoring, and alerting frameworks to ensure reliability and compliance.
- Optimize workloads on cloud platforms (AWS) using services such as S3, Redshift, EMR, and Lambda.
Requirements
- 5+ years of experience in data engineering, with strong proficiency in Python and SQL.
- Hands‑on expertise with Apache Spark (or similar distributed processing frameworks) and building ETL pipelines.
- Deep understanding of cloud infrastructure, particularly AWS services for storage, compute, and orchestration.
- Experience deploying and scaling machine‑learning models in production environments.
- Solid grasp of data modeling, schema design, and performance tuning for large‑scale analytical workloads.
Skills
pythonsqlapache sparkawsmachine learning