AI ML Data Scientist position — see original posting for full details.
About the role
Job Family :
Travel Required :
Clearance Required :
What You Will Do :
Partner with stakeholders to define and deliver AI/analytics use cases, translating business needs into scalable data science solutions.
Design and develop machine learning models and analytical approaches to support search, discovery, and insight generation across structured and unstructured data.
Build and implement NLP, semantic search, and entity resolution capabilities to enable advanced information retrieval and relationship analysis.
Leverage document-based data (e.g., OCR/ICR outputs, metadata, and free text) to extract insights and support downstream analytics and search solutions.
Collaborate with data engineers to integrate models into production environments, including Palantir Foundry, Databricks, and AWS-based platforms.
Develop model evaluation frameworks, confidence scoring, and explainability approaches to ensure transparency and usability of AI outputs.
Support development of analytics, reporting, and dashboards to drive operational insights and decision-making.
Operate within an Agile delivery model, contributing to sprint planning, experimentation, and iterative solution delivery.
Communicate findings and recommendations clearly to both technical and non-technical audiences, including client stakeholders.
Contribute to solution design, proposal support, and thought leadership in AI/analytics capabilities.
What You Will Need :
Bachelor’s or Master’s degree in Data Science, Computer Science, Statistics, Mathematics, or related field.
A minimum of 4 years of experience in data science, machine learning, or applied analytics roles
U.S. Citizenship required and ability to obtain and maintain a Public Trust clearance.
Natural Language Processing (NLP)
Semantic search or information retrieval
Entity resolution or relationship modeling
Experience working with large-scale structured and unstructured data, particularly document-based datasets (e.g., text, PDFs, images).
Experience leveraging metadata and extracted features to support analytics and modeling.
Strong proficiency in Python for data science and machine learning (e.g., Pandas, Scikit-learn, PyTorch or TensorFlow) and solid SQL skills.
Experience working with Databricks and/or Spark-based environments for scalable data processing.
Familiarity with AWS cloud services for data access, processing, or model deployment.
Experience working with data lake or lakehouse architectures (e.g., AWS S3, Databricks), including querying and transforming large-scale datasets.
Experience integrating models into production environments (e.g., APIs, batch pipelines, or embedded analytics platforms).
Understanding of
Skills
machine learningnatural language processingnlptensorflowpytorchscikit learn