onsite
Hadoop Platform Support Engineer - InfiCare Technologies
Software Engineer
Provide L3 support and maintenance for a large‑scale Hadoop environment, ensuring high availability, performance tuning, and issue resolution for HDFS and related ecosystem components.
About the role
Key Responsibilities
- Monitor, troubleshoot, and resolve production incidents across the Hadoop cluster, focusing on HDFS health and data integrity.
- Perform performance tuning, capacity planning, and routine maintenance to keep the platform running at optimal efficiency.
- Collaborate with data engineering and analytics teams to support job execution, data pipelines, and ecosystem tools such as Hive, Impala, and Spark.
- Develop and maintain automation scripts (Bash/Python) for routine operational tasks, backups, and recovery procedures.
- Document incidents, root‑cause analyses, and standard operating procedures to improve knowledge sharing and reduce mean time to resolution.
Requirements
- 3+ years of hands‑on experience supporting Hadoop clusters (HDFS, YARN) in a production environment.
- Strong Linux system administration skills and proficiency in shell scripting.
- Solid understanding of Hadoop ecosystem components (e.g., Hive, Impala, Spark) and data ingestion frameworks.
- Experience with Cloudera or Hortonworks distributions and related management tools.
- Excellent problem‑solving abilities, with a track record of diagnosing complex platform issues and implementing effective fixes.