onsite
Software Engineer, Data Infrastructure & Acquisition - Miami, FL, USA - Speechify
Software Engineer
Lead the design and implementation of scalable data pipelines and infrastructure to support Speechify’s text‑to‑speech services, leveraging Python, AWS, and modern data engineering tools to ingest, transform, and serve large volumes of content efficiently.
About the role
Key Responsibilities
- Design, build, and maintain robust data pipelines that ingest content from diverse sources (PDFs, web pages, documents) into the Speechify ecosystem.
- Implement scalable ETL processes using Python, Apache Spark, and AWS services (S3, Glue, Redshift).
- Collaborate with product and engineering teams to define data models, optimize query performance, and ensure data quality.
- Automate workflow orchestration with Airflow, monitoring pipeline health and troubleshooting failures.
- Develop and maintain documentation for data architecture, pipeline logic, and best practices.
Requirements
- 3+ years of experience in data engineering or related roles.
- Hands‑on experience with Airflow or equivalent workflow orchestration tools.
- Excellent problem‑solving skills and a passion for building reliable, high‑performance data systems.
Skills
pythonawssqlapache sparkairflow