Harsh Parikh

About

Causal Evidence & Decisions Studio

I lead the Causal Evidence and Decisions Studio (CEADS) at Yale, where we develop machine learning–aided causal inference methods for high-stakes decision-making. Our work sits at the intersection of nonparametric and semiparametric statistics, interpretable ML, and data science—grounded in first-principles thinking and process-driven science.

We partner with domain experts across a vast spectrum of critical challenges—from estimating disease prevalence and understanding deforestation in Madagascar, to quantifying the socio-economic burden of health shocks. Our collaboration partners span North America, Europe, Africa, and Asia, working in public health, medicine, environmental science, ecology, and supply chain management.

Assistant Professor of Biostatistics

Yale School of Public Health · 2025–present

Assistant Professor (Secondary)

Statistics and Data Science, Yale University

Affiliated Faculty

Foundations of Data Science · ISPS, Yale University

Guest Researcher

Danish Centre for Health Economics, U. of Southern Denmark

Affiliate, Biostatistics

Johns Hopkins Bloomberg School of Public Health

Applied Scientist III

Amazon SCOT (Supply Chain Optimization Technologies)

PhD, Computer Science

Duke University · 2023

Research

Research Themes

Our tools are flexible, robust, and trustworthy, designed for scenarios where standard causal assumptions are challenged. We commit to delivering solutions that are accurate, trustworthy, and domain-conscious.

⊕

Data Fusion for Causal Inference

Combining experimental and observational data, partial identification, integrating disparate outcome measures, and characterizing underrepresented populations.

data integrationgeneralizabilityexternal validity

◈

Interpretable ML for Causal Inference

Matching methods (MALTS, VIM, AME), the Rashomon effect, distributional treatment effects, and regularized extrapolation.

matchinginterpretabilityRashomon sets

⬡

AI Sandbox for Causal Inference

Validating and benchmarking causal methods through controllable generative sandboxes and power analysis for trial design.

validationsimulationtrial design

⊞

Causal Inference for Networks

Causal relational learning on structured data, graph ML estimators for network effects, and transporting effects across social networks.

networksgraph MLrelational data

♺

Experienced Burden of Diseases

Quantifying heterogeneous socio-economic impacts of health shocks, including breast cancer income loss and lasting costs of illness.

health economicsheterogeneous effects

⌬

Anthropogenic Threats to Biodiversity

Causal analysis of deforestation drivers in Madagascar and the relationship between vanilla farming and wildlife hunting pressures.

ecologydeforestationnatural experiments

≡

Data Equity in Public Health

Developing frameworks for ensuring equitable data representation in public health data science and evidence-based policymaking.

data equitypublic healthhealth disparities

Publications

Selected Papers

2026

Obs. Studies

A Double Machine Learning Approach for Combining Experimental and Observational Studies

H Parikh, M Morucci, V Orlandi, S Roy, C Rudin, A Volfovsky

JAMA HF

Lasting Income Costs of Mental and Physical Illness

EK Johnson, H Parikh, CK Ettman, G Ge, L Sopina, AY Chang

JAMA HF

Ten Core Concepts for Ensuring Data Equity in Public Health

Y Wang, AE Boyd, L Rountree, Y Ren, K Nyhan, R Nagar, J Higginbottom, ML Ranney, H Parikh, B Mukherjee

AISTATS

Regularizing Extrapolation in Causal Inference

D Arbour, H Parikh, B Niknam, EA Stuart, KE Rudolph, A Feller

arXiv

TEA-Time: Transporting Effects Across Time

H Parikh, G Levin-Konigsberg, D Perrault-Joncas, A Volfovsky

arXiv

Controllable Generative Sandbox for Causal Inference

Q Zhang, H Parikh, A Naimi, R Nabi, C Kim, T Lash

arXiv

When Does Agroforestry Income Reduce Deforestation? Evidence from a Natural Experiment in Madagascar

C DeSisto, R Rasolofoson, M Foley, H Parikh

arXiv

The Epidemiology of Artificial Intelligence

H Parikh, T McCormick, E Johnson, L Hickey, M Ranney, B Mukherjee

arXiv

Why Are There Many Equally Good Models? An Anatomy of the Rashomon Effect

H Parikh

arXiv

Demystifying Prediction Powered Inference

Y Song, DM Kluger, H Parikh, T Gu

2025

JASA

Who Are We Missing? A Principled Approach to Characterizing the Underrepresented Population

H Parikh, RK Ross, EA Stuart, KE Rudolph

Nat. Commun.

Breast Cancer and Income Loss in Denmark: Heterogeneous Outcomes and Longitudinal Effects

EK Johnson, H Parikh, KR Olsen, AY Chang, L Sopina

JRSS-C

Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression

CL Brantner, TQ Nguyen, H Parikh, C Zhao, H Hong, EA Stuart

IJROBP

Representativeness and Generalizability of NCI-Funded Multi-Modality RCTs

CFPM de Sousa, H Parikh, JD Bradley, E Stuart, C Hu

NeurIPS

A Cautionary Tale on Integrating Studies with Disparate Outcome Measures

H Parikh, TQ Nguyen, EA Stuart, KE Rudolph, CH Miles

NeurIPS

Data Fusion for Partial Identification of Causal Effects

Q Lanners, C Rudin, A Volfovsky, H Parikh

arXiv

Demystifying Proximal Causal Inference

GV Ringlein, TQ Nguyen, PP Zandi, EA Stuart, H Parikh

arXiv

Towards Enhancing Data Equity in Public Health Data Science

Y Wang, AE Boyd, L Rountree, Y Ren, K Nyhan, R Nagar, J Higginbottom, ML Ranney, H Parikh, B Mukherjee

JMM

Almost-Exact Matching for Interpretable and Trustworthy Causal Inference

Q Lanners, H Parikh, S Katta, D Page, C Rudin, A Volfovsky

2024

ACTN

How Many Patients Do You Need? Trial Designs for Anti-Seizure Treatment

H Parikh, H Sun, R Amerineni, ES Rosenthal, A Volfovsky, C Rudin, MB Westover, SF Zafar

AISTATS

Safe and Interpretable Estimation of Optimal Treatment Regimes

H Parikh, Q Lanners, Z Akras, S Zafar, MB Westover, C Rudin, A Volfovsky

AISTATS

Interpretable Causal Inference for Wearable, Sensor, and Distributional Data

S Katta, H Parikh, C Rudin, A Volfovsky

HDSR

Towards Generalizing Inferences from Trials to Target Populations

MY Huang, H Parikh

arXiv

Graph ML-based Doubly Robust Estimator for Network Causal Effects

SB Khatami, H Parikh, H Chen, S Roy, B Salimi

2023

Lancet DH

Effects of Epileptiform Activity on Discharge Outcome in Critically Ill Patients

H Parikh, K Hoffman, H Sun, SF Zafar, W Ge, J Jing, L Liu, J Sun, A Struck, A Volfovsky, C Rudin, MB Westover

ICLR

Synthetic Control as Balancing Scores

H Parikh

UAI

Variable Importance Matching for Causal Inference

Q Lanners, H Parikh, A Volfovsky, C Rudin, D Page

2022

JMLR

MALTS: Matching After Learning to Stretch

H Parikh, A Volfovsky, C Rudin

ICML

Validating Causal Inference Methods

H Parikh, C Varjao, L Xu, EJ Tchetgen Tchetgen

NeurIPS

AME: Interpretable Almost Exact Matching for Causal Inference

H Jiang, T Howell, NR Gupta, V Orlandi, M Morucci, H Parikh, S Roy, C Rudin, A Volfovsky

2020–2021

SIGMOD

Causal Relational Learning

B Salimi, H Parikh, M Kayali, L Getoor, S Roy, D Suciu

arXiv

Heterogeneous Treatment Effects in Social Networks

A Gilad, H Parikh, S Roy, B Salimi

Data Fusion for Causal Inference

Interpretable ML for Causal Inference

AI Sandbox for Causal Inference

Causal Inference for Networks

Experienced Burden of Diseases

Anthropogenic Threats to Biodiversity

Data Equity in Public Health

Introduction to Data Science

Interpretable ML Tutorial

Office

Prospective Students