onsite
Platform Engineer, APIs & Observability - StackAI
Devops Engineer
Platform Engineer focused on building and maintaining a robust public API and comprehensive observability stack for a no-code AI workflow platform, leveraging AWS, Python, and OpenTelemetry to deliver reliable, scalable services.
About the role
Key Responsibilities
- Design, develop, and maintain the public REST/GraphQL API that powers the no-code AI workflow platform.
- Implement and evolve observability solutions (metrics, logs, traces) using OpenTelemetry and AWS CloudWatch to provide deep insight into API performance and customer usage.
- Collaborate with product and engineering teams to define API contracts, versioning strategies, and backward‑compatibility guarantees.
- Automate deployment pipelines, monitor health, and troubleshoot incidents to ensure high availability and low latency.
- Drive continuous improvement of API security, rate limiting, and documentation for internal and external developers.
Requirements
- 3+ years of experience building production‑grade APIs in Python or a comparable language.
- Hands‑on experience with observability tools (OpenTelemetry, Prometheus, Grafana, AWS CloudWatch).
- Strong understanding of REST/GraphQL design principles and API versioning.
- Proficiency with AWS services (Lambda, API Gateway, DynamoDB, CloudWatch).
- Excellent problem‑solving skills and a collaborative mindset.