I have broad research interests spanning causal inference, machine learning, and multimodal large language models. My work includes projects on individualized optimal decision-making, variable importance for heterogeneous treatment effects, mediation analysis, and generalizability for causal inference, as well as recent contributions to LLM applications, such as multimodal figure-to-code generation and clustering with LLM-guided constraints during my internship.
Large language models (LLMs) generate rationales that improve reasoning and interpretability, but existing evaluations using binary human or LLM preferences are limited and opaque. We introduce an attribute-based evaluation framework that defines key rationale qualities, explains human preferences, and enables more nuanced model comparisons through attribute-specific analysis.
We proposed a framework for using placebo samples in observational studies to detect and correct for unmeasured confounding bias. It develops identification assumptions and estimation methods—including regression, weighting, and doubly robust approaches—and validates them through simulations and an applied case study on tax credits and infant health.
We developed a potential outcomes framework to estimate the ITT effect of an active control versus placebo in active-controlled trials, using historical placebo-controlled data. Our method enables ITT estimation when the placebo arm is unavailable and accounts for unmeasured confounders using instrumental variables.
We introduced a new criterion for individualized treatment rules based on the adjusted probability of longer survival, offering a clear and clinically relevant objective. Our method, optimal adjusted probability learning, constructs the best treatment rule by maximizing this nonparametric survival benefit.
Presentations
Generalizing the Intention-to-Treat Effect of an Active Control from Historical Placebo-Controlled Trials
- The Translational Data Science Integrated Research Center Retreat. Kirkland, WA, 2023.
- 20th Annual STI & HIV Research Symposium. Seattle, WA, 2023.
- American Causal Inference Conference. Seattle, WA, 2024
- Joint Statistical Meetings. Portland, OR, 2024.
Approximate Bayesian Computation (ABC)-Calibrated Microsimulation Model for Predicting HIV-1 Prevention
Efficacy of Broadly Neutralizing Antibodies
- HVTN Africa Regional Meeting. Cape Town, South Africa, 2024.
Industry Experience
PhD Intern (Summer 2025)
Ads Platform, Apple
Teaching Experience
Department of Statistics, University of Washington
Teaching Assistant in STAT 311 Elements of Statistical Methods (Winter 2024, Autumn 2025)
Academic tutoring center, School of Mathematics, Sun Yat-sen University
Tutor in Mathematical analysis (Fall 2018)
TAL Education Group
Teaching Assistant in primary-school Olympiad Mathematics (2017-2018)