remote
Principal Engineer - AI Platform - Target
Software Engineer
Lead the design and scaling of a cloud‑native AI platform, driving end‑to‑end ML pipelines, model serving, and data infrastructure using Python, AWS, and Kubernetes.
About the role
Key Responsibilities
- Architect and evolve a production‑grade AI platform that supports data ingestion, model training, and real‑time inference at scale.
- Collaborate with data scientists and product teams to translate ML research into robust, deployable services.
- Design and maintain CI/CD pipelines, monitoring, and observability for AI workloads on AWS and Kubernetes.
- Lead technical mentorship, code reviews, and knowledge sharing across engineering and data science teams.
- Drive performance optimization, cost management, and security best practices for cloud‑based AI services.
Requirements
- 10+ years of software engineering experience with a focus on AI/ML systems.
- Proficiency in Python, AWS services (SageMaker, Lambda, ECS/EKS), and container orchestration with Kubernetes.
- Strong background in data engineering, pipeline design, and large‑scale distributed computing.
- Experience with MLOps tooling, model versioning, and automated testing frameworks.
- Excellent communication skills and a proven ability to lead cross‑functional technical initiatives.
Skills
pythonmachine learningawskubernetes