remote
IT Operations Support Engineer - Vivienne Court
Software Engineer
Support and maintain high‑availability trading infrastructure using Linux, AWS, and monitoring tools, ensuring rapid incident resolution and continuous improvement of operational processes.
About the role
Key Responsibilities
- Maintain and troubleshoot production Linux servers and AWS infrastructure supporting real‑time trading systems.
- Implement and manage monitoring solutions (Prometheus, Grafana, CloudWatch) to detect and resolve performance issues proactively.
- Develop and maintain automation scripts in Python and Bash for deployment, configuration, and incident response.
- Participate in on‑call rotations, diagnosing and resolving incidents with minimal downtime.
- Collaborate with engineering teams to design and implement infrastructure improvements and best practices.
Requirements
- 3+ years of experience in IT operations or system administration in a high‑frequency trading or financial services environment.
- Strong proficiency with Linux (Ubuntu/RHEL) and AWS services (EC2, S3, CloudWatch).
- Hands‑on experience with monitoring and alerting tools such as Prometheus, Grafana, or Datadog.
- Solid scripting skills in Python and Bash for automation and troubleshooting.
- Excellent problem‑solving skills, ability to work under pressure, and strong communication abilities.