remote
Quality Assurance Engineer - Anika Systems
QA Engineer
Quality Assurance Engineer focused on automated testing of ETL pipelines and data platforms, leveraging Python, SQL, and cloud technologies to validate Apache Iceberg and XBRL datasets, ensuring data integrity and performance across enterprise architectures.
About the role
- Design, develop, and maintain automated QA frameworks for data pipelines, APIs, and analytics platforms using Python and SQL.
- Build reusable testing utilities for data validation, regression testing, and pipeline certification.
- Integrate automated tests into CI/CD pipelines to support continuous testing and deployment.
- Develop unit, integration, and end-to-end test cases for complex data workflows.
- Leverage AI-assisted testing tools to generate test cases, identify edge cases, and improve test coverage.
- Validate ETL/ELT pipelines to ensure accurate ingestion, transformation, and delivery of data.
- Create automated checks for data completeness, consistency, accuracy, and timeliness.
- Test ingestion and transformation of complex datasets, including XBRL financial data.
- Implement reconciliation and audit mechanisms across source-to-target mappings.
- Apply AI-driven anomaly detection to identify data quality issues and pipeline failures.
- Schema evolution validation
- Time travel and versioning accuracy
- Partitioning and performance behavior
- Query performance benchmarking
- Data freshness and latency
- Storage efficiency and maintenance overhead
- Ensure alignment between precomputed datasets (materialized views) and underlying source data.
- Implement automated validation for data quality rules, lineage, and metadata accuracy.
- Support context engineering by validating that datasets include proper business context, definitions, and relationships.
- Integrate QA processes with enterprise data catalogs and metadata systems to ensure discoverability and trust.
- Validate AI-generated metadata, lineage, and transformations for accuracy and traceability.
- Apply AI/ML and generative AI tools to enhance QA processes, including intelligent test generation, defect prediction, and automated root cause analysis.
- Validate data readiness for AI/ML and generative AI use cases, ensuring datasets meet quality, completeness, and governance standards.
- Collaborate with data and AI teams to test data pipelines supporting RAG, analytics, and machine learning workflows.
- Ensure alignment with responsible AI practices, including traceability, explainability, and data integrity.
- Support enterprise data management programs and OCDO initiatives by ensuring data quality and reliability across systems.
- Contribute to data maturity assessments by evaluating data quality, testing coverage, and governance adherence.
- Align QA processes with Federal Data Strategy and Evidence Act requirements.
- Work closely with data engineers, data architects, and analysts to define test strategies and acceptance criteria.
- Participate in stakeholder engagement sessions and listening campaigns to understand data quality expectations and pain points.
- Document test results, defect