remote
Sr. Developer - Thermo Fisher Scientific
Software Engineer
Senior developer leading the design and implementation of scalable, distributed data pipelines on AWS, leveraging PySpark, Python, and SQL, while architecting robust security and governance for Databricks environments.
About the role
Key Responsibilities
- Design, build, and maintain scalable, distributed data pipelines using AWS services such as S3, Redshift, Glue, Lambda, EMR, Athena, and Kinesis.
- Develop transformation logic in PySpark, Python, and SQL to support data ingestion, processing, and analytics.
- Architect and lead the implementation of a comprehensive security framework for the Databricks platform, covering IAM, data governance, network security, encryption, and audit controls.
- Collaborate with cross‑functional teams to define data architecture, performance tuning, and best practices for data engineering.
- Mentor junior engineers, conduct code reviews, and promote a culture of continuous improvement and high quality.
Requirements
- 5+ years of experience in data engineering and cloud architecture.
- Proficiency with AWS services (S3, Redshift, Glue, Lambda, EMR, Athena, Kinesis) and Databricks.
- Strong programming skills in Python and PySpark, with solid SQL knowledge.
- Hands‑on experience designing security and governance frameworks, including IAM and data encryption.
- Excellent communication skills and ability to work collaboratively in a fast‑paced environment.
Skills
awspythonsqldatabricksiam