onsite
Senior Staff Engineer, Platform Infrastructure
Senior Staff Engineer, Platform Infrastructure
Epic Games is seeking a Senior Staff Engineer, Platform Infrastructure to design, build, and maintain the core, scalable infrastructure powering Epic's games and services. This role involves leading architecture and development using cloud-native technologies, driving best practices for observability and reliability, and mentoring other engineers.
About the role
About the Role
Epic Games is looking for a passionate and experienced Senior Staff Engineer, Platform Infrastructure to join our team. In this role, you will be instrumental in designing, building, and maintaining the core infrastructure that powers Epic's games and services. You will work on highly scalable, fault-tolerant, and secure systems, directly impacting the experience of millions of players worldwide.
What You'll Do
- Design and implement highly scalable, reliable, and secure cloud-native infrastructure solutions.
- Lead the architecture and development of core platform services using technologies like Kubernetes, Docker, and various cloud platforms (AWS, Azure, GCP).
- Collaborate with engineering teams across Epic to understand their infrastructure needs and provide robust, scalable solutions.
- Drive best practices for observability (metrics, logging, tracing), incident response, and post-mortems.
- Mentor junior and mid-level engineers, fostering a culture of technical excellence and continuous improvement.
- Identify and address performance bottlenecks and operational challenges in a complex distributed environment.
- Participate in on-call rotations and contribute to the overall reliability and stability of our platform.
- Research and evaluate new technologies to improve our infrastructure and development workflows.
What We're Looking For
- Extensive experience (Senior Staff level) in designing, building, and operating large-scale distributed systems.
- Deep expertise with cloud platforms such as AWS, Azure, or GCP.
- Strong proficiency in containerization technologies like Docker and orchestration with Kubernetes.
- Solid programming skills in at least one of the following: Go, Python, C++, Java, or C#.
- Experience with infrastructure-as-code tools and CI/CD pipelines.
- Familiarity with Linux operating systems and command-line tools.
- Proven track record of driving significant technical projects from conception to deployment.
- Experience with observability tools for metrics, logging, and tracing.
- Excellent communication and collaboration skills, with the ability to influence and lead technical discussions.
- A passion for mentoring and guiding other engineers.
- Strong problem-solving abilities and a proactive approach to identifying and resolving issues.
- Experience with chaos engineering and security best practices for cloud infrastructure is a plus.