Software Engineer
Data Processing Engineer focused on building a high‑performance engine for large‑scale analytics and GenAI preprocessing, leveraging Spark, Flink, Ray, distributed systems, query optimization, and hardware‑accelerated computing in a cloud‑native environment.
Data Processing Engineer - I/O
Mountain View, CA / Hyderabad, IN / Remote
About DataPelago :
DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for specialists to join our engineering team and shape the future of accelerated data processing.
The Opportunity:
As a Data Processing Engineer - I/O, you will be a key individual contributor in advancing data read and write capabilities of DataPelago ’s data processing engine. You will enhance functional breadth, performance, scale, and reliability of the DataPelago engine in reading and writing large scale data of various data types from diverse data sources and data sinks. This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers.
What You'll Do:
Apache Parquet, Apache Iceberg) and identify opportunities for our engine to enhance technology and product leadership.
What You'll Bring:
experience.
ORC, Apache Iceberg, Apache Spark, and similar technol
Posted June 25, 2026