onsite
Principal Software Engineer, Agent Policy Fabric - NVIDIA
Software Engineer
Lead the design and implementation of a cloud‑native governance platform for autonomous agents, delivering signed policies, runtime verification, and audit capabilities using C++, Python, Kubernetes, and gRPC.
About the role
Key Responsibilities
- Architect and build the core Agent Policy Fabric platform, enabling signed policy enforcement, runtime verification, and audit across heterogeneous runtime environments.
- Design and implement high‑performance services in C++ and Python, leveraging gRPC for inter‑service communication and Kubernetes for scalable deployment.
- Integrate security primitives such as credential mediation, detector verdict handling, and policy projection to ensure robust governance of agentic systems.
- Collaborate with cross‑functional teams to define APIs, data models, and observability standards, driving consistency across enterprise integrations.
- Establish CI/CD pipelines, automated testing, and monitoring frameworks to maintain platform reliability and rapid iteration.
Requirements
- 10+ years of software engineering experience with strong expertise in C++ and Python.
- Deep knowledge of cloud‑native technologies, including Kubernetes, container orchestration, and micro‑service architectures.
- Proven experience building secure, distributed systems with gRPC or similar RPC frameworks.
- Solid understanding of security concepts such as credential management, policy signing, and runtime verification.
- Track record of leading complex, large‑scale platform projects from proof‑of‑concept to production readiness.
Skills
cpythonkubernetesgrpccicd