Recent & Upcoming Talks

2026

Miss(ing) Congeniality: The Cost of Incompatible Imputation Models

Doubly robust estimators are often viewed as protected against model misspecification, but this protection can fail in the presence of missing data. When multiple imputation is used, lack of congeniality between the imputation and analysis models can induce bias even when both the propensity score and outcome models are correctly specified. Valid inference requires imputation models to include all variables from both the propensity score and outcome models, specified in compatible functional forms. Violations of these conditions can lead to substantial bias in treatment effect estimates. A general framework for combining multiple imputation with doubly robust estimation is presented, the conditions required for congeniality are characterized, and the consequences of misspecification are illustrated. The talk concludes with practical recommendations for specifying imputation models to preserve the validity of doubly robust methods in applied causal analyses.

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

This talk provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we demonstrate that if a confounder has missing data, the corresponding imputation model must include all variables appearing in either the propensity score model or the outcome model, in addition to both the exposure and the outcome, and that these variables must enter the imputation model in the same functional form as in the final analysis. Violating these conditions can lead to biased treatment effect estimates, even when both components of the doubly robust estimator are correctly specified. We present a mathematical framework for doubly robust estimation combined with multiple imputation, establish the theoretical requirements for proper imputation in this setting, and demonstrate the consequences of misspecification through simulation. Based on these findings, we offer concrete recommendations to ensure valid inference when using multiple imputation with doubly robust methods in applied causal analyses.

April 20, 2026

9:00 AM – 10:00 AM

Distinguished Lecture at Innovations in Design, Analysis, and Dissemination 2026


By Lucy D'Agostino McGowan in Invited Oral Presentation

slides

Causal Inference in R

This workshop provides a structured introduction to causal inference, guiding participants from formulating causal questions to estimating and communicating causal effects using R. Topics include the transition from associational to causal thinking, the role of counterfactuals, and the use of causal diagrams to formalize assumptions. Participants will learn to define causal estimands, implement and diagnose propensity score models, and build outcome models. The workshop also covers methods for continuous exposures, including g-computation, and concludes with approaches to sensitivity analysis. Hands-on exercises in R reinforce each concept, enabling participants to apply modern causal inference techniques in practice.

April 9, 2026

8:30 AM – 5:30 PM

Workshop on Experiments at NEOMA Business School Reims, France 2026


By Lucy D'Agostino McGowan in Invited Workshop

details

2025

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

This talk provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we show that when a confounder has missing data the corresponding imputation model must include all variables used in either the propensity score model or the outcome model, and that these variables must appear in the same functional form as in the final analysis. Violating these conditions can lead to biased treatment effect estimates, even when both components of the doubly robust estimator are correctly specified. We present a mathematical framework for doubly robust estimation combined with multiple imputation, establish the theoretical requirements for proper imputation in this setting, and demonstrate the consequences of misspecification through simulation. Based on these findings, we offer concrete recommendations to ensure valid inference when using multiple imputation with doubly robust methods in applied causal analyses.

The Art of Data Refinement: Severance Analyses

This talk demonstrates data extraction from multiple sources using the popular television series Severance as an example. For example, we collected and analyzed elevator sounds predict narrative events, voice recordings underwent cepstral analysis to estimate fundamental frequencies and characterize speaker- specific distributions, with k-nearest neighbors used for classification, and text mining was performed on episode scripts to quantify dialogue patterns. These analyses illustrate how statistical methods can be applied to unconventional data sources from entertainment media.

Exploring the Potential of Large Language Models in Generating Saturated DAGs for Causal Inference

This talk investigates whether large language models (LLMs) could potentially assist in the creation of “saturated DAGs”, graphical representations that exhaustively map all possible causal pathways in a system. We’ll critically examine if and how LLMs might help identify the full space of plausible causal relationships that traditional approaches may overlook. The presentation will assess the strengths and limitations of prompting LLMs to generate comprehensive causal structures, identify backdoor paths, and navigate complex causal systems.

Causal Inference Is Not Just A Statistics Problem

In this talk we will discuss four datasets, similar to Anscombe’s quartet, that aim to highlight the challenges involved when estimating causal effects. Each of the four datasets is generated based on a distinct causal mechanism: the first involves a collider, the second involves a confounder, the third involves a mediator, and the fourth involves the induction of M-Bias by an included factor. Despite the fact that the statistical summaries and visualizations for each dataset are identical, the true causal effect differs, and estimating it correctly requires knowledge of the data-generating mechanism. These example datasets can help practitioners gain a better understanding of the assumptions underlying causal inference methods and emphasize the importance of gathering more information beyond what can be obtained from statistical tools alone.

Causal Inference in R

In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. In both data science and academic research, prediction modeling is often not enough; to answer many questions, we need to approach them causally. In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. We’ll also show that by distinguishing predictive models from causal models, we can better take advantage of both tools. You’ll be able to use the tools you already know–the tidyverse, regression models, and more–to answer the questions that are important to your work. This workshop is for you if you: know how to fit a linear regression model in R, have a basic understanding of data manipulation and visualization using tidyverse tools, and are interested in understanding the fundamentals behind how to move from estimating correlations to causal relationships.

September 16, 2025

9:00 AM – 4:00 PM

posit::conf 2025


By Lucy D'Agostino McGowan and Malcolm Barrett in Invited Workshop

details

Immaculate vibes, questionable code

Inspired by the practical problem of chairing a useR conference session, this talk walks through building a Slido replacement in pure “vibe coding” mode — no specs, no tests, just a vision and a stubborn refusal to stop until it worked (sort of). We’ll explore the chaotic joy, the accidental brilliance, and the inevitable questionable code that got us there.

Building Strong ASA Student Chapters: Strategies for Growth, Inclusivity, and Maximizing Value

This invited panel session offers guidance for statistics and data science faculty and students seeking to establish or enhance their ASA student chapter. The panel will feature a diverse group of faculty advisors and chapter officers who will share their experiences starting and running a successful student chapter. The panelists will discuss the benefits of ASA affiliation, strategies for maintaining an active and inclusive chapter, and resources available to support students and faculty. Attendees will gain practical tips and strategies for transitioning from a statistics club to an ASA student chapter, overcoming challenges, and fostering a strong community of statistics and data science learners.

August 3, 2025

2:00 PM – 4:00 PM

ASA Joint Statistical Meeting 2025


By Lucy D'Agostino McGowan in Invited Panel