Overview
Group 42 is an Abu Dhabi based artificial intelligence (AI) and cloud computing company, uniquely positioned in the national ecosystem to develop and deploy holistic and scalable AI solutions. G42 Healthcare is committed to developing a world-class, sustainable healthcare sector in the UAE and wider region. At the forefront in the battle against the pandemic, G42 Healthcare partnered with Abu Dhabi authorities to develop a massive throughput laboratory in 14 days and spearheaded the world’s first Phase 3 clinical trial of COVID-19 inactivated vaccine. Beyond Covid-19, G42 Healthcare is also developing a program of activities to support the health of future generations – ranging from genomics, imaging and diagnostics to digitization programs, manufacturing and cutting-edge research.
Responsibilities
- Define the architecture, scope and deliver various Big Data solutions.
- Build and maintain large scale deployment of data lakes & power analytics
- Build and maintain large scale deployments of elastic search
- Should be able to make healthcare data conversion to multiple formats with compliances in place
- Support other teams by providing guidance on data modelling, data usage, processing and how they can best leverage the platform
- Build scalable data pipelines to ingest data from a variety of data sources, identify critical data elements and define data quality rules.
- Leverage Spark/Hadoop ecosystem knowledge to design and develop capabilities to deliver innovative and improved data solutions.
- Provide insights on area of improvements including Data Governance, best practices, large scale processing
- Support the bug fixing and performance analysis along the data pipeline
- Collaborate, coach, and mentor colleagues in an energetic, growing team
- Complete end-to-end ownership from requirements to go-live
- Bracing ambiguity and prioritizing right items
- Managing stakeholders and driving business goals
- Willing to take challenges and step out of comfort zone
Qualifications
- 4+ years of experience as software engineer, with strong skills in at least one programming language is mandatory, preferably Scala or Java or Python
- 1+ years of experience with MLOps
- 1+ year of experience as a Big Data Engineer or similar role
- 1+ year of experience with Hadoop and/or Spark
- Expertise with distributed systems and design/implementation for reliability, availability, scalability and performance
- Proven experience with Cloud technologies like Object Blob storage, Map reduce, Infrastructure as code.
- Creative and innovative approach to problem-solving
- Experience with CICD using Jenkins, Terraform or other related technologies