remote
Systems Operations Manager - Data Platforms Teradata & Hadoop - Wells Fargo
Systems Engineer
Lead end‑to‑end operations for a large‑scale, multi‑tenant Teradata and Hadoop environment, ensuring platform stability, reliability, and continuous improvement through SRE practices and advanced monitoring.
About the role
Key Responsibilities
- Oversee 24x7 operations of Teradata and Hadoop clusters, ensuring high availability and performance for 100+ tenants.
- Lead a cross‑functional SRE team to implement automation, incident response, and capacity planning.
- Design and enforce monitoring, alerting, and logging strategies using industry‑standard tools.
- Collaborate with data engineering, security, and compliance teams to maintain data integrity and regulatory adherence.
- Drive continuous improvement initiatives, including performance tuning, cost optimization, and process standardization.
Requirements
- 5+ years of experience managing enterprise Teradata and Hadoop environments.
- Strong background in Linux system administration, SQL, and SRE principles.
- Proficiency with monitoring/alerting platforms (e.g., Prometheus, Grafana, Splunk).
- Excellent communication skills and ability to lead a distributed team.
- Experience with cloud platforms (AWS, Azure, or GCP) is a plus.