remote
Staff AI Engineer - Grafana AI/ML
AI Engineer
Lead the design and implementation of AI/ML-driven agent workflows on AWS and Azure, integrating with Grafana for advanced alerting and observability. Drive end-to-end solutions that scale across cloud environments.
About the role
Key Responsibilities
- Architect and develop AI/ML agent workflows using AWS Agent Frameworks and Azure services, ensuring high reliability and scalability.
- Integrate AI insights into Grafana dashboards, enabling real‑time alerting and proactive incident response.
- Collaborate with cross‑functional teams to define data pipelines, model training, and deployment pipelines.
- Optimize model performance and resource utilization across multi‑cloud environments.
- Mentor junior engineers and lead technical discussions on AI/ML best practices.
Requirements
- 10+ years of software engineering experience with a focus on AI/ML and cloud platforms.
- Deep expertise in AWS (SageMaker, Lambda, Step Functions) and Azure (ML, Functions, Logic Apps).
- Proven track record building agent-based systems and integrating with Grafana for observability.
- Strong knowledge of Python, Docker, Kubernetes, and CI/CD pipelines.
- Excellent communication skills and ability to translate complex technical concepts to stakeholders.