onsite
Data Architect - CMP Techsseract LLP
Software Engineer
Senior Data Architect with 5‑8 years of experience designing scalable data solutions using Python, PySpark, and Apache Spark. Expert in relational and NoSQL databases, data governance, and CI/CD pipelines within Agile environments.
About the role
Key Responsibilities
- Design, develop, and maintain enterprise data architectures that support analytics, reporting, and data science initiatives.
- Implement data pipelines using PySpark and Apache Spark, ensuring high performance and fault tolerance.
- Integrate and optimize relational and NoSQL databases, managing schema design, indexing, and data modeling.
- Define and enforce data governance, security, and compliance policies across the data ecosystem.
- Collaborate with DevOps to build CI/CD workflows for data pipelines and infrastructure as code.
- Lead Agile ceremonies, mentor junior team members, and drive continuous improvement in data processes.
Requirements
- 5–8 years of professional experience in data architecture and engineering.
- Hands‑on experience with CI/CD tools, Git, and Agile methodologies.
- Excellent problem‑solving skills and ability to communicate complex concepts to cross‑functional teams.
Skills
pythonapache sparkcicd