remote
Senior Software Engineer, Data Infrastructure - Roblox
Software Engineer
Lead the design and implementation of large‑scale data pipelines and infrastructure, leveraging Python, Scala, Spark, Hadoop, and AWS to support millions of daily users on a global platform.
About the role
Key Responsibilities
- Architect, develop, and maintain high‑throughput data pipelines that ingest, transform, and store terabytes of data daily.
- Collaborate with data scientists and product teams to define data models, schemas, and performance requirements.
- Optimize existing workflows for scalability, reliability, and cost efficiency across AWS services (EMR, S3, Redshift).
- Implement monitoring, alerting, and automated testing to ensure data quality and system resilience.
- Mentor junior engineers and drive best practices in coding, documentation, and DevOps.
Requirements
- 5+ years of experience building production data pipelines in a cloud environment.
- Proficiency in Python and Scala with deep knowledge of Apache Spark and Hadoop ecosystems.
- Hands‑on experience with AWS data services (EMR, S3, Redshift, Glue).
- Strong understanding of distributed systems, fault tolerance, and performance tuning.
- Excellent communication skills and a collaborative mindset.
Skills
pythonscalaapache sparkhadoopaws