remote
Data Scientist / Data Architect - Data Direct Networks
Data Scientist
Lead the design and deployment of AI‑driven data pipelines, leveraging Python, Machine Learning, and Big Data technologies to power high‑performance storage solutions for demanding AI workloads.
About the role
Key Responsibilities
- Design, develop, and maintain scalable data pipelines and architectures that support AI and machine learning workloads.
- Implement advanced analytics and ML models using Python, Spark, and SQL to extract actionable insights from large datasets.
- Collaborate with cross‑functional teams to define data requirements, optimize storage solutions, and ensure data quality and governance.
- Deploy and monitor models in production environments on AWS, ensuring high availability and performance.
- Continuously evaluate emerging technologies and propose improvements to enhance data processing efficiency and scalability.
Requirements
- Proven experience as a Data Scientist or Data Architect in a high‑performance computing environment.
- Strong proficiency in Python, SQL, and Big Data frameworks such as Spark.
- Hands‑on experience with AWS services (S3, EMR, Redshift, SageMaker) and model deployment.
- Deep understanding of machine learning concepts, model training, and evaluation.
- Excellent problem‑solving skills and ability to communicate complex technical concepts to non‑technical stakeholders.
Skills
pythonmachine learningsqlaws