onsite
Data Acquisition Specialist - aaru
Software Engineer
Lead end‑to‑end data acquisition for large‑scale AI simulations, building robust pipelines, cleaning datasets, and integrating diverse data sources to fuel predictive models.
About the role
Key Responsibilities
- Design, develop, and maintain scalable data ingestion pipelines to support AI agent simulations.
- Extract, transform, and load data from varied sources (APIs, databases, web scraping) ensuring high quality and consistency.
- Implement data validation, cleansing, and enrichment processes to meet strict accuracy standards.
- Collaborate with data science and engineering teams to define data requirements and optimize data flow.
- Monitor pipeline performance, troubleshoot issues, and continuously improve efficiency.
Requirements
- Proven experience with Python and SQL for data manipulation and pipeline development.
- Hands‑on knowledge of ETL tools and frameworks (e.g., Airflow, dbt, or similar).
- Strong understanding of data cleaning, validation, and enrichment techniques.
- Experience integrating APIs and handling large datasets.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.