remote
Senior Software Engineer Data Platform - Maropost
Software Engineer
Senior Software Engineer driving the design and implementation of a scalable data platform using Python, Java, Spark, and AWS to enable real‑time analytics and robust data pipelines for a fast‑growing commerce ecosystem.
About the role
Key Responsibilities
- Architect, develop, and maintain high‑performance data pipelines and services that ingest, process, and store petabyte‑scale ecommerce and marketing data.
- Collaborate with product, analytics, and ML teams to define data models, APIs, and schema evolution strategies.
- Implement real‑time streaming solutions using Kafka and Spark Structured Streaming to support low‑latency analytics and personalization.
- Optimize data storage and query performance on AWS services (S3, Redshift, Athena) and relational databases.
- Lead code reviews, mentor junior engineers, and champion best practices for testing, CI/CD, and observability.
Requirements
- 5+ years of professional software engineering experience building data platforms or large‑scale backend systems.
- Strong proficiency in Python and Java, with hands‑on experience in Apache Spark or similar distributed processing frameworks.
- Deep understanding of AWS ecosystem (S3, EC2, Lambda, Redshift, Glue) and infrastructure‑as‑code concepts.
- Experience designing and operating streaming pipelines using Kafka, Kinesis, or comparable technologies.
- Solid SQL skills and ability to design efficient data models for analytical workloads.
Skills
pythonjavaapache sparkawskafkasql