remote

AI Applied Scientist - PhD Intern, Evaluation Systems and Metrics

AI Applied Scientist - PhD , Evaluation Systems and Metrics

Develop cutting-edge evaluation methodologies for AI systems, creating robust, scalable metrics and frameworks to assess AI quality, consistency, and performance.

About the role

About the team

About the role

We are seeking remote PhD interns for Summer 2026!

As an intern, you will help develop cutting-edge evaluation methodologies for AI systems. Your research will focus on creating robust, scalable metrics and frameworks to assess the quality, consistency, and performance of generative models across multiple modalities. You may contribute in one or more of the following areas:

Novel Evaluation Metrics : Develop innovative assessment methodologies for emerging AI capabilities, focusing on consistency and quality across complex multi-modal outputs

Self-Improving Assessment : Design evaluation systems that learn and adapt from feedback, automatically discovering new evaluation criteria and improving assessment quality over time

Privacy-Preserving Evaluation : Design frameworks that incorporate domain-specific implementations of differential privacy to protect sensitive user information while maintaining utility for model training and assessment.

Ethical Fair Housing Evaluation : Develop scalable methodologies for assessing agentic systems, ensuring compliance with fair housing standards and promoting ethical, responsible AI deployment

Who you are

Currently enrolled as a PhD student in computer science, machine learning, computer vision, or a related field, with strong publication record

Candidates should have a background in one or more of the following areas:

Evaluation methodologies for AI/ML systems

Computer vision metrics and 3D consistency assessment

Generative model evaluation (text, image, video, 3D)

Multi-modal assessment and automated feedback systems

Knowledge of data privacy methods (e.g., differential privacy, federated learning, secure ML) and their application.

Single agent or multi-agent system evaluations

Familiarity with modern deep learning frameworks (e.g., PyTorch, Hugging Face Transformers)

Strong research mindset, with motivation to publish

Interest in applying AI to complex, multi-stakeholder domains

A record of publication in conferences, workshops, or journals is a plus

Here at Zillow - we value the experience and perspective of candidates with non-traditional backgrounds. We encourage you to apply if you have transferable skills or related experiences.

Get to know us

At Zillow , we’re reimagining how people move—through the real estate market and through their careers. As the most-visited real estate platform in the U.S., we help customers navigate buying, selling, financing and renting with greater ease and confidence. Whether you're working in tech, sales, operations, or design, you’ll be part of a company that's reshaping an industry and helping more people make home a reality.

Zillow is honored to be recognized among the best workplaces in the country. Zillow was named one of FORTU

About the role

About the team

About the role

We are seeking remote PhD interns for Summer 2026!

Novel Evaluation Metrics : Develop innovative assessment methodologies for emerging AI capabilities, focusing on consistency and quality across complex multi-modal outputs

Self-Improving Assessment : Design evaluation systems that learn and adapt from feedback, automatically discovering new evaluation criteria and improving assessment quality over time

Ethical Fair Housing Evaluation : Develop scalable methodologies for assessing agentic systems, ensuring compliance with fair housing standards and promoting ethical, responsible AI deployment

Who you are

Currently enrolled as a PhD student in computer science, machine learning, computer vision, or a related field, with strong publication record

Candidates should have a background in one or more of the following areas:

Evaluation methodologies for AI/ML systems

Computer vision metrics and 3D consistency assessment

Generative model evaluation (text, image, video, 3D)

Multi-modal assessment and automated feedback systems

Knowledge of data privacy methods (e.g., differential privacy, federated learning, secure ML) and their application.

Single agent or multi-agent system evaluations

Familiarity with modern deep learning frameworks (e.g., PyTorch, Hugging Face Transformers)

Strong research mindset, with motivation to publish

Interest in applying AI to complex, multi-stakeholder domains

A record of publication in conferences, workshops, or journals is a plus

Here at Zillow - we value the experience and perspective of candidates with non-traditional backgrounds. We encourage you to apply if you have transferable skills or related experiences.

Get to know us

Zillow is honored to be recognized among the best workplaces in the country. Zillow was named one of FORTU

AI Applied Scientist - PhD Intern, Evaluation Systems and Metrics

About the role

AI Applied Scientist - PhD Intern, Evaluation Systems and Metrics

About the role

Skills