onsite
Data Scientist IV - Kaiser Permanente
Data Scientist
Individual contributor designing and building data pipelines, cleansing and transforming raw data, and developing machine‑learning models to generate actionable insights for client‑focused analytics.
About the role
Key Responsibilities
- Design, develop, and maintain scalable data pipelines that ingest, cleanse, and store raw data from diverse sources and formats.
- Transform and engineer features for use in statistical and machine‑learning models, ensuring data quality and reproducibility.
- Formulate problem statements, define hypotheses, and conduct exploratory analysis on complex datasets to uncover key patterns.
- Train, evaluate, and deploy predictive models, monitoring performance and iterating to improve accuracy and reliability.
- Collaborate with cross‑functional teams to translate analytical findings into actionable recommendations for target customers.
Requirements
- Strong proficiency in Python and SQL for data manipulation, analysis, and pipeline development.
- Experience building ETL processes and data‑engineering solutions in a cloud or on‑prem environment.
- Hands‑on expertise with machine‑learning frameworks and statistical modeling techniques.
- Ability to communicate complex analytical results clearly to both technical and non‑technical stakeholders.
- Bachelor’s or higher degree in Computer Science, Statistics, Engineering, or a related quantitative field.
Skills
pythonsqlmachine learning