remote
Senior Software Engineer - Infrastructure Storage - Lambda
Software Engineer
Senior engineer responsible for designing, building, and scaling high‑performance storage infrastructure for AI workloads, leveraging Linux, C++, Go, Kubernetes and cloud services.
About the role
Key Responsibilities
- Design and implement scalable, low‑latency storage solutions that support GPU‑intensive AI training and inference workloads.
- Develop and maintain core storage services in C++ and Go, ensuring reliability and performance at petabyte scale.
- Integrate storage stacks with Kubernetes and other orchestration tools to provide seamless provisioning for compute clusters.
- Collaborate with networking, hardware, and cloud teams to optimize data paths and reduce I/O bottlenecks.
- Implement monitoring, alerting, and automated remediation for storage health and capacity planning.
Requirements
- 5+ years of professional experience building distributed storage or file‑system services on Linux.
- Strong proficiency in C++ and Go, with a solid understanding of systems programming concepts.
- Hands‑on experience with Kubernetes, container runtimes, and cloud platforms such as AWS.
- Deep knowledge of networking protocols, NVMe, RDMA, and high‑throughput I/O architectures.
- Proven ability to troubleshoot complex performance issues and deliver production‑grade solutions.
Skills
cgolinuxkubernetesaws