remote
Senior Software Engineer - Semantic Data Lake - WEX
Software Engineer
Lead the design and implementation of a semantic data lake, transforming raw enterprise data into reusable, trusted assets using Python, Java, Spark, and AWS services.
About the role
Key Responsibilities
- Architect and develop scalable data pipelines that ingest, transform, and enrich raw data into semantically rich assets.
- Implement data modeling and ontology design to enable semantic search and analytics across the platform.
- Collaborate with data scientists and business stakeholders to define data quality, governance, and metadata standards.
- Optimize performance of Spark jobs and AWS data services for large‑scale processing.
- Mentor junior engineers and conduct code reviews to maintain high code quality.
Requirements
- 5+ years of software engineering experience with a focus on data platforms.
- Proficiency in Python, Java, and SQL for data processing and API development.
- Hands‑on experience with Apache Spark, AWS Glue, S3, Redshift, and related services.
- Strong understanding of semantic web technologies, ontologies, and metadata management.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythonjavasqlawsapache spark