Harsh Parikh
CEADS Lab

Harsh Parikh

Assistant Professor of Biostatistics
Yale School of Public Health
Fellow, Benjamin Franklin College
Affiliated Faculty, Yale Institute for Global Health
Building rigorous causal inference methods for high-stakes decisions in healthcare, public health, and social sciences.

About
Causal Evidence & Decisions Studio

I lead the Causal Evidence and Decisions Studio (CEADS) at Yale, where we develop machine learning–aided causal inference methods for high-stakes decision-making. Our work sits at the intersection of nonparametric and semiparametric statistics, interpretable ML, and data science—grounded in first-principles thinking and process-driven science.


We partner with domain experts across a vast spectrum of critical challenges—from estimating disease prevalence and understanding deforestation in Madagascar, to quantifying the socio-economic burden of health shocks. Our collaboration partners span North America, Europe, Africa, and Asia, working in public health, medicine, environmental science, ecology, and supply chain management.

Assistant Professor of Biostatistics
Yale School of Public Health · 2025–present
Guest Researcher
Danish Centre for Health Economics, U. of Southern Denmark
Affiliate, Biostatistics
Johns Hopkins Bloomberg School of Public Health
Applied Scientist III
Amazon SCOT (Supply Chain Optimization Technologies)
PhD, Computer Science
Duke University · 2023
Research
Research Themes

Our tools are flexible, robust, and trustworthy, designed for scenarios where standard causal assumptions are challenged. We commit to delivering solutions that are accurate, trustworthy, and domain-conscious.

Data Fusion for Causal Inference

Combining experimental and observational data, partial identification, integrating disparate outcome measures, and characterizing underrepresented populations.

data integrationgeneralizabilityexternal validity

Interpretable ML for Causal Inference

Matching methods (MALTS, VIM, AME), the Rashomon effect, distributional treatment effects, and regularized extrapolation.

matchinginterpretabilityRashomon sets

AI Sandbox for Causal Inference

Validating and benchmarking causal methods through controllable generative sandboxes and power analysis for trial design.

validationsimulationtrial design

Causal Inference for Networks

Causal relational learning on structured data, graph ML estimators for network effects, and transporting effects across social networks.

networksgraph MLrelational data

Experienced Burden of Diseases

Quantifying heterogeneous socio-economic impacts of health shocks, including breast cancer income loss and lasting costs of illness.

health economicsheterogeneous effects

Anthropogenic Threats to Biodiversity

Causal analysis of deforestation drivers in Madagascar and the relationship between vanilla farming and wildlife hunting pressures.

ecologydeforestationnatural experiments

Data Equity in Public Health

Developing frameworks for ensuring equitable data representation in public health data science and evidence-based policymaking.

data equitypublic healthhealth disparities

Publications
Selected Papers
2026
JAMA HF
Ten Core Concepts for Ensuring Data Equity in Public Health
Y Wang, AE Boyd, L Rountree, Y Ren, K Nyhan, R Nagar, J Higginbottom, ML Ranney, H Parikh, B Mukherjee
arXiv
Why Are There Many Equally Good Models? An Anatomy of the Rashomon Effect
H Parikh
arXiv
Demystifying Prediction Powered Inference
Y Song, DM Kluger, H Parikh, T Gu
2025
JASA
Who Are We Missing? A Principled Approach to Characterizing the Underrepresented Population
H Parikh, RK Ross, EA Stuart, KE Rudolph
Nat. Commun.
Breast Cancer and Income Loss in Denmark: Heterogeneous Outcomes and Longitudinal Effects
EK Johnson, H Parikh, KR Olsen, AY Chang, L Sopina
JRSS-C
Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression
CL Brantner, TQ Nguyen, H Parikh, C Zhao, H Hong, EA Stuart
Obs. Studies
A Double Machine Learning Approach for Combining Experimental and Observational Studies
H Parikh, M Morucci, V Orlandi, S Roy, C Rudin, A Volfovsky
IJROBP
Representativeness and Generalizability of NCI-Funded Multi-Modality RCTs
CFPM de Sousa, H Parikh, JD Bradley, E Stuart, C Hu
arXiv
Regularizing Extrapolation in Causal Inference
D Arbour, H Parikh, B Niknam, EA Stuart, KE Rudolph, A Feller
arXiv
A Cautionary Tale on Integrating Studies with Disparate Outcome Measures
H Parikh, TQ Nguyen, EA Stuart, KE Rudolph, CH Miles
arXiv
Data Fusion for Partial Identification of Causal Effects
Q Lanners, C Rudin, A Volfovsky, H Parikh
arXiv
Demystifying Proximal Causal Inference
GV Ringlein, TQ Nguyen, PP Zandi, EA Stuart, H Parikh
arXiv
Towards Enhancing Data Equity in Public Health Data Science
Y Wang, AE Boyd, L Rountree, Y Ren, K Nyhan, R Nagar, J Higginbottom, ML Ranney, H Parikh, B Mukherjee
JMM
Almost-Exact Matching for Interpretable and Trustworthy Causal Inference
Q Lanners, H Parikh, S Katta, D Page, C Rudin, A Volfovsky
2024
ACTN
How Many Patients Do You Need? Trial Designs for Anti-Seizure Treatment
H Parikh, H Sun, R Amerineni, ES Rosenthal, A Volfovsky, C Rudin, MB Westover, SF Zafar
AISTATS
Safe and Interpretable Estimation of Optimal Treatment Regimes
H Parikh, Q Lanners, Z Akras, S Zafar, MB Westover, C Rudin, A Volfovsky
AISTATS
Interpretable Causal Inference for Wearable, Sensor, and Distributional Data
S Katta, H Parikh, C Rudin, A Volfovsky
HDSR
Towards Generalizing Inferences from Trials to Target Populations
MY Huang, H Parikh
arXiv
Graph ML-based Doubly Robust Estimator for Network Causal Effects
SB Khatami, H Parikh, H Chen, S Roy, B Salimi
2023
Lancet DH
Effects of Epileptiform Activity on Discharge Outcome in Critically Ill Patients
H Parikh, K Hoffman, H Sun, SF Zafar, W Ge, J Jing, L Liu, J Sun, A Struck, A Volfovsky, C Rudin, MB Westover
ICLR
Synthetic Control as Balancing Scores
H Parikh
UAI
Variable Importance Matching for Causal Inference
Q Lanners, H Parikh, A Volfovsky, C Rudin, D Page
2022
JMLR
MALTS: Matching After Learning to Stretch
H Parikh, A Volfovsky, C Rudin
ICML
Validating Causal Inference Methods
H Parikh, C Varjao, L Xu, EJ Tchetgen Tchetgen
NeurIPS
AME: Interpretable Almost Exact Matching for Causal Inference
H Jiang, T Howell, NR Gupta, V Orlandi, M Morucci, H Parikh, S Roy, C Rudin, A Volfovsky
2020–2021
SIGMOD
Causal Relational Learning
B Salimi, H Parikh, M Kayali, L Getoor, S Roy, D Suciu
arXiv
Heterogeneous Treatment Effects in Social Networks
A Gilad, H Parikh, S Roy, B Salimi

Teaching
Courses
Fall 2026

Introduction to Data Science

Master the art of algorithmic thinking, prediction modeling, and data visualization within the context of public health. Learn to translate 'messy' real-world problems into precise mathematical solutions using Python. Course website →

Tutorial

Interpretable ML Tutorial

An overview of recent research in interpretable machine learning, covering fundamental principles and hands-on activities with real-world data for effective analysis and responsible decision-making. Tutorial site →


Contact
Get in Touch

Office

Department of Biostatistics
Yale School of Public Health
New Haven, CT


harsh.parikh@yale.edu

Prospective Students

I am always looking for motivated students interested in causal inference, machine learning, and public health. If you're interested in joining CEADS as a PhD student or postdoc, please reach out with your CV and a brief description of your research interests.


People
CEADS Team

Members and affiliates of the Causal Evidence and Decisions Studio.

Srikar Katta
Srikar Katta
Camille Desisto
Camille Desisto
Peter Liu
Peter Liu
Yiren Hou
Yiren Hou
Ana Karina Raygoza Cortez
Ana Karina Raygoza Cortez
Quinn Lanners
Quinn Lanners
Ge Ge
Ge Ge
Qi Zhang
Qi Zhang