onsite
Data Scientist - University of California - San Francisco
Data Scientist
Data Scientist focused on computational biology, building machine learning models, managing large-scale datasets, and maintaining cloud and on‑prem infrastructure for research studies.
About the role
Key Responsibilities
- Design, develop, and deploy machine learning and statistical models to analyze biological data.
- Build and maintain scalable databases, ensuring efficient data retrieval and integration.
- Collaborate with researchers to design experiments and translate end‑user requirements into computational solutions.
- Manage cloud (AWS) and on‑premises computational resources, optimizing performance and cost.
- Integrate public and proprietary bioinformatics databases, ensuring data quality and accessibility.
Requirements
- Proficiency in Python and R for data analysis and model development.
- Experience with machine learning frameworks (scikit‑learn, TensorFlow, PyTorch) and statistical methods.
- Strong SQL skills and familiarity with database design and optimization.
- Hands‑on experience with AWS services (EC2, S3, EMR) and containerization (Docker).
- Background in bioinformatics or computational biology preferred.
Skills
pythonmachine learningsqlawsdocker