Harsh Parikh, PhD
Assistant Professor of Biostatistics
I lead the Causal Evidence and Decisions Studio (CEADS) at Yale, where we develop machine learning–aided causal inference methods for high-stakes decision-making. Our work sits at the intersection of nonparametric and semiparametric statistics, interpretable ML, and data science—grounded in first-principles thinking and process-driven science.
What do you do with Data Science?
I use data science to design machine learning–aided causal inference methods that help answer high-stakes questions in health, policy, and other applied domains. My work focuses on creating accurate approaches—able to estimate heterogeneous treatment effects in complex, small-data settings; trustworthy—transparent enough for domain experts to interpret, validate assumptions, and ensure safety; and domain-conscious—incorporating contextual knowledge to bridge the gap between research and practice. I have developed methods for integrating experimental and observational data, generalizing trial findings to new populations, and learning optimal treatment regimes, with applications ranging from "evaluating anti-seizure treatments in acute brain injury patients" to "characterizing underrepresented groups in clinical trials". My publications appear in venues such as the Journal of the American Statistical Association, Harvard Data Science Review, Journal of Machine Learning Research, Nature Communications and The Lancet Digital Health. Going forward, I aim to expand these methods to settings with complex data structures—such as networks, longitudinal records, and multi-modal health data—to enable safer and trustworthy decision-making.
- Social Algorithms
- AI
- Applications to Social Science
- Causal Inference
- Computer Science
- Econometrics
- Environmental Data Science
- Healthcare
- Mathematical Statistics
- Networks
- Observational Data
- Operations Research
- Probability and Statistics
- Public Health
- Randomized Controlled Trials
- Randomized Experiments
- Reinforcement Learning
- Selection Bias
- Semi-parametric Models
- Semiparametrics
- Social AI
- Social Networks
- Social Science
- Social Sciences
- statistical inference
- Statistics of Networks
- Statsitical Learning Theory
- Uncertainty Estimation
