About the Role
As an Associate AI Data Engineer / AI Data Engineer at SAP Labs Singapore, you will join a dedicated team of AI scientists, AI Developers, dev-ops engineers, and architects. Your primary goal will be to build best-in-class AI products by pushing the boundaries of artificial intelligence and machine learning. You will be responsible for handling multi-terabyte datasets, developing scalable data processing pipelines, and ensuring the quality and integrity of data used for AI systems.
Your Future Team
The Artificial Intelligence team within SAP Labs Singapore consists of experienced professionals tackling challenges for SAP customers. Based in Singapore, you will work on breakthrough projects addressing real-world problems and collaborate with global experts as part of the product engineering team. SAP Labs Singapore provides an open, modern working environment and fosters global community engagement.
Expectations and Tasks
- Push the frontiers of what is possible in the area of artificial intelligence and machine learning to create new solutions.
- Explore, understand, and implement the most recent technologies and approaches for data pipelines for data pre and post processing.
- Comfortably handle multi-terabyte data sets in scale-up and scale-out environments.
- Support machine learning research/project teams with all aspects around data.
- Build scalable and re-usable data processing pipelines.
- Work on data generators to enrich datasets.
- Set up and manage annotation projects with annotation partner companies.
- Work with domain experts and AI Scientists to define annotation guidelines and drive annotation efforts towards high-quality data.
- Coordinate & support data auditing efforts.
- Coordinate & support data acquisition activities.
- Integrate, validate, organize, improve and curate datasets including metadata (e.g. data versioning, cleaning, storing, etc.).
- Work closely with the team, customers and partners to holistically understand business and user requirements and derive adequate application development concepts.
- Ability to work in global teams with different time zones.
- Immerse yourself quickly into new topics, terminology, and development tasks.
Required Educational and Experience
- Degree in Computer Science or a related field.
- 0 to 3 years of working experience as a software developer or data engineer in AI-focused application development.
- A solid foundation in computer science, with strong competencies in algorithms, data structures, object-oriented programming, design patterns, multi-threaded programming, and software design principles.
- Hands-on knowledge in at least one of the server-side programming languages such as Java, Scala, Go, Python, C#, or C++.
- Basic knowledge of cloud platforms and their configurations (AWS, Azure & GCP).
- Familiar with Agile/Scrum methodologies.
- Strong desire to overcome obstacles and make your work benefit SAP's customers.
- Excellent written and communication skills in English language.
- Able to commence January 2024.
Optional Skills
- Operational knowledge of Linux distributions such as Redhat, Ubuntu, SUSE.
- Knowledge of container technologies like Docker, Kubernetes.
- Knowledge of Machine Learning fundamentals.