onsite
Senior Performance Database Engineer - Caris Life Sciences
Software Engineer
Lead the design, optimization, and scaling of high‑performance PostgreSQL databases for precision medicine data, ensuring fast query response, robust data integrity, and seamless integration with AI pipelines.
About the role
Key Responsibilities
- Architect and maintain large‑scale PostgreSQL databases, focusing on performance, reliability, and scalability for genomics and clinical data.
- Design and implement efficient data models, indexing strategies, and partitioning schemes to support complex analytical workloads.
- Develop and optimize ETL pipelines using Python and SQL, ensuring data quality and timely ingestion from diverse sources.
- Collaborate with data scientists and ML engineers to provide low‑latency access to training and inference data.
- Monitor database health, analyze query performance, and implement tuning solutions (e.g., vacuum, autovacuum, query rewrite).
- Document architecture, best practices, and troubleshooting guides for internal teams.
Requirements
- 5+ years of experience with PostgreSQL in a production environment.
- Deep understanding of database internals, query optimization, and performance monitoring tools.
- Experience with large genomic or clinical datasets and related data pipelines.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
sqlpostgresqlpythonaws