Mapping the Mind

Junior computer science and mathematics double major Brooke Guo analyzes neural connections to understand the causes of complex brain conditions like schizophrenia. When Brooke Guo arrived at the University of Maryland as a freshman in 2022, she knew she wanted to help people and work in a health-related field someday.… Read More
Four Science Terps Awarded 2025 Goldwater Scholarships

Four undergraduates in the University of Maryland’s College of Computer, Mathematical, and Natural Sciences (CMNS) have been awarded 2025 scholarships by the Barry Goldwater Scholarship and Excellence in Education Foundation, which encourages students to pursue advanced study and research careers in the sciences, engineering and mathematics. Over the last 16 years, UMD’s nominations… Read More
Announcing the Winners of the Frontiers of Science Awards

Congratulations to our colleagues who won the 2025 Frontiers of Science Award: - Dan Cristofaro-Gardiner, for his join paper with Humbler and Seyfaddini: “Proof of the simplicity conjecture”, Annals of Mathematics 2024. - Dima Dolgopyat & Adam Kanigowski, for their joint paper with Federico Rodriguez Hertz: “Exponential mixing implies Bernoulli”, Annals of Mathematics… Read More
2024 Putnam Results

We are very excited to report that our MAryland Putnam team ranked 7th among 477 institutions that participated in the 2024 Putnam math competition. Our team members this year were Daniel Yuan, Isaac Mammel, and Clarence Lam. Daniel Yuan ranked 26th among 3,988 participants. Clarence Lam and Isaac Mammel were recognized for… Read More
From Math Olympiads to Diplomacy: Meet Visiting Math Professor Qendrim Gashi

Maryland Global, published a great interview with our visiting professor (and diplomat), Qendrim Gashi. The interview is available at https://marylandglobal.umd.edu/about/news/math-olympiads-diplomacy-meet-visiting-math-professor-qendrim-gashi Read More

Schedule

9:00 AM - Information Bias and its Remedies - Jae-Kwang Kim (Iowa State University)
10:00 AM - Selection Bias and its Remedies - Jae-Kwang Kim (Iowa State University)
11:00 AM - Two Key Ideas in Survey Nonresponse: Response Propsensity and Missing at Random - Roderick Little (University of Michigan)
12 Noon - Lunch
1:00 PM - Applying Non-Ignorable Missing Data Methods - Rebecca Andridge (Ohio State University)
2:00 PM - Model Based Imputation Methods for Small Area Estimation - Partha Lahiri (University of Maryland)
3:00 PM Information Projection Approach to Propensity Estimation for Handling Missing Data - Jae-Kwang Kim (Iowa State University)

Speakers

Jae-Kwang Kim, Iowa State University, Ames
Roderick Little, University of Michigan, Ann Arbor
Rebecca Andridge, Ohio State University, Columbus
Partha Lahiri, University of Maryland, College Park

Missing Data: Titles and Abstracts

Overview Lecture for Missing Data

Jae-Kwang Kim
LAS Dean's professor (2020-2022)
Iowa State University, Ames, IA

Part 1. Information bias and its remedies
Part 2. Selection bias and its remedies
Information bias and selection biases are the main features of incomplete data. In the overview lecture, we cover these problems in the statistical framework and introduce statistical tools for handling incomplete data. Topics includes measurement error models, observed likelihood, denoising, mean score theorem, EM algorithm, and sample likelihood. The overview lecture does not assume strong background in statistics. Minimum knowledge in undergraduate mathematical statistics should be enough to understand the basic ideas in this overview lecture.

Two key ideas in survey nonresponse: response propensity and missing
at random

Roderick Little, University of Michigan, Ann Arbor
Richard D. Remington Distinguished University Professor of Biostatistics

I present recent work concerning two key ideas in survey nonresponse, namely response propensity and missing at random. I propose a specific
definition of the response propensity that clarifies the conditioning, and weakened sufficient conditions for missing at random for asymptotic
frequentist maximum likelihood inference. Finally I show how an explicit modeling approach allows certain missing not at random mechanisms to be identified when there is post-stratification information.

Applying Non-Ignorable Missing Data to U.S. Election Polling Data

Rebecca Andridge
Associate Professor, Biostatistics
The Ohio State University

Among the numerous explanations that have been offered for recent polling errors in U.S. pre-election surveys, selection bias due to non- gnorable partisan nonresponse bias, where the probability of responding to a poll is a function of the candidate preference that a poll is attempting to measure (even after conditioning on other relevant covariates used for weighting adjustments), has received relatively less focus in the academic literature. Under this type of selection mechanism, estimates of candidate preferences based on individual or aggregated polls may be subject to significant bias, even after standard weighting adjustments. Until recently, methods for measuring and adjusting for this type of non-ignorable nonresponse or selection bias have been unavailable. This talk describes a simple model-based index of the potential bias in estimates of population proportions (e.g., candidate preference) due to non-ignorable nonresponse/selection mechanisms. The index depends in an inestimable parameter that captures the amount of deviation from missingness at random; this parameter ranges from 0 to 1 and naturally lends itself tto a sensitivity analysis. We analyze publicly available data from seven different pre-election polls conducted in seven different \swing" states by ABC and the Washington Post in 2020, and evaluate the ability of these new measures to detect bias in estimates of the proportion of likely voters in each state that will vote for President Trump. Using official election outcomes in each state as benchmarks and alternative data sources for estimating key characteristics of the likely voter populations in each state, we evaluate the ability of the
new measure to 1) detect potential selection bias in these estimates, and 2) adjust for that bias when official pre-election polling estimates are produced.

Model-based Imputation Methods For Small Area Estimation

Partha Lahiri
Director, Joint Program in Survey Methodology (JPSM),
Professor, JPSM and Department of Mathematics,
University of Maryland, College Park

There is a growing demand to produce reliable estimates of different characteristics of interest for small geographical areas (e.g.., states) or domains obtained by a cross-classification of different demographic factors such as age, sex, race/ethnicity. The information on the outcome variable(s) of interest often comes from a sample survey that targets reliable estimation for large areas (e.g., national level). In this talk, I will discuss how model-based imputation methods can be used to improve inferences about different small area or domain parameters. The proposed method essentially uses suitable statistical models that can be used to extract information from multiple data sources. We illustrate the proposed methodology in the context of election projection for small areas. The talk is based on collaborative research with UMD students Aditi Sen and Zhenyu Yue.

Information Projection Approach to Propensity Estimation for Handling Missing Data

Jae-Kwang Kim
LAS Dean's professor (2020-2022)
Iowa State University, Ames, IA

Missing data is frequently encountered in practice. Propensity score estimation is a popular tool for handling such missingness. The propensity score is often developed using the model for the response probability, which can be subject to model misspecification. In this talk, we consider
an alternative approach of estimating the inverse of the propensity scores using the density ratio function. By partitioning the sample into two groups based on the response status of the elements, we can apply the density ratio function estimation method and obtain the inverse propensity scores for nonresponse adjustment. Density ratio estimation can be obtained by applying the so-called maximum entropy method, which uses the Kullback-Leibler divergence measure under calibration constraints. By including the covariates for the outcome regression models only into the density ratio model, we can achieve efficient propensity score estimation. We further extend the proposed approach to the multivariate missing case. Some limited simulation studies are presented to compare with the existing methods.

About the Workshop

Missing data is an important area of statistics. The workshop will feature well known experts who will give several lectures including an introduction and research work.

Meet Brin Mathematics Research Center Coordinator Natalie Kinnear
She grew up in a Terp family and found her way from Los Angeles to College Park in 2018. … Read More
Keeping an ‘Open Mind’ in Math
As director of UMD’s applied mathematics & statistics, and scientific computation graduate program, Radu Balan oversees many branches of math—and… Read More
From Math Olympiad to Diplomacy
Math Professor Reimagining Learning: Meet Qendrim Gashi Let’s start by talking about your role in the field of mathematics. Can… Read More
Visiting Professor Pays Homage to WWII’s Unsung Mathematicians
For more than two decades, Zbigniew Błocki has been sharing the story of the Polish codebreakers who helped crack Nazi… Read More
Professor Abba Gumel Answers Questions About the Math Behind Infectious Disease
The College of Computer, Mathematical, and Natural Sciences hosted a Reddit Ask-Me-Anything spotlighting epidemic modeling research. University of Maryland Mathematics… Read More
Brin Mathematics Research Center Hosted Seven Workshops and Three Distinguished Lectures This Spring
Three workshops, a summer school and a high school camp are planned for this summer. Summer 2025 signifies the completion of three… Read More