remote
Data Scientist - Bright Star Solutions LLC
Data Scientist
Data Scientist role focused on advancing data management and governance for the Department of State’s Bureau of Diplomatic Technology, leveraging Python, Databricks, and machine learning to deliver actionable insights and robust data solutions.
About the role
Key Responsibilities
- Design, develop, and maintain scalable data pipelines using Databricks and Python to support analytics and reporting initiatives.
- Implement machine learning models to extract insights from large, complex datasets, ensuring reproducibility and performance.
- Collaborate with cross‑functional teams to define data governance standards, data quality metrics, and metadata management practices.
- Analyze and document data flows, perform root‑cause analysis for data anomalies, and recommend remediation strategies.
- Provide technical guidance and mentorship to junior analysts and data engineers on best practices in data science and engineering.
Requirements
- Proven experience in Python programming and data science libraries (pandas, scikit‑learn, PySpark).
- Hands‑on expertise with Databricks, Delta Lake, and Spark SQL.
- Strong foundation in machine learning concepts, model deployment, and evaluation.
- Solid understanding of data governance principles, data quality frameworks, and metadata management.
- Excellent analytical, problem‑solving, and communication skills.
Skills
pythondatabricksmachine learningsql