remote

Data Scientist / Data Architect - Data Direct Networks

Data Scientist

Lead the design and deployment of AI‑driven data pipelines, leveraging Python, Machine Learning, and Big Data technologies to power high‑performance storage solutions for demanding AI workloads.

About the role

Key Responsibilities

Design, develop, and maintain scalable data pipelines and architectures that support AI and machine learning workloads.
Implement advanced analytics and ML models using Python, Spark, and SQL to extract actionable insights from large datasets.
Collaborate with cross‑functional teams to define data requirements, optimize storage solutions, and ensure data quality and governance.
Deploy and monitor models in production environments on AWS, ensuring high availability and performance.
Continuously evaluate emerging technologies and propose improvements to enhance data processing efficiency and scalability.

Requirements

Proven experience as a Data Scientist or Data Architect in a high‑performance computing environment.
Strong proficiency in Python, SQL, and Big Data frameworks such as Spark.
Hands‑on experience with AWS services (S3, EMR, Redshift, SageMaker) and model deployment.
Deep understanding of machine learning concepts, model training, and evaluation.
Excellent problem‑solving skills and ability to communicate complex technical concepts to non‑technical stakeholders.

Skills

pythonmachine learningsqlaws

CompanyData Direct Networks

DepartmentResearch

LocationUnited States

Experience7+ years

Tenurefull-time

LevelLead

Posted June 23, 2026