Name: Rocio Titiunik, University of Michigan
Start: 2019-01-16T12:00:00-08:00
End: 2019-01-16T13:30:00-08:00
Location: CCPR Seminar Room

November 2015

Reproducibility of Statistical Results

November 13, 2015 @ 12:00 pm - 1:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

March 2016

Betsy Sinclair, Washington University in St Louis

March 29, 2016 @ 2:30 pm - 4:00 pm PDT

314 Royce Hall 340 Royce Dr, los angeles, CA, United States

Rick Dale, University of California, Merced

March 31, 2016 @ 2:30 pm - 4:00 pm PDT

314 Royce Hall 340 Royce Dr, los angeles, CA, United States

May 2016

Ilan H. Meyer & Mark S. Handcock, UCLA

May 27, 2016 @ 12:00 pm - 2:00 pm PDT

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

April 2017

West Coast Experiments Conference, UCLA 2017

April 23, 2017 @ 8:00 am - April 25, 2017 @ 5:00 pm PDT

Covel Commons UCLA

May 2017

Shahryar Minhas, Duke University

May 24, 2017 @ 12:00 pm - 1:30 pm PDT

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

June 2017

Fragile Families Challenge: Getting Started Workshop

June 2, 2017 @ 12:00 pm - 4:00 pm PDT

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

James Robins, Harvard University

June 9, 2017 @ 12:00 pm - 1:00 pm PDT

Room 33-105 CHS Building 650 Charles E Young Drive South, Los Angeles, CA, United States

October 2017

Daniel Benjamin, USC Dornsife Center for Economic and Social Research

October 17, 2017 @ 1:00 pm - 2:00 pm PDT

Sander Greenland, UCLA Department of Epidemiology

October 24, 2017 @ 2:00 pm - April 23, 2021 @ 5:00 pm PDT

November 2017

Hadley Wickham, RStudio

November 8, 2017 @ 3:30 pm - April 23, 2021 @ 5:00 pm PST

December 2017

Nathaniel Osgood, University of Saskatchewan, “Dynamic modeling for health in the age of big data”

December 12, 2017 @ 2:00 pm - 3:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

Nathaniel Osgood, University of Saskatchewan, “Using Smartphones and Wearables for Public Health Insight: A Hands-On Introduction”

December 13, 2017 @ 12:00 pm - 1:30 pm PST

CHS 61-269

January 2018

Rob Warren, University of Minnesota

January 24, 2018 @ 12:00 pm - 1:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

February 2018

Per Block, ETH Zurich (Swiss Federal Institute of Technology in Zurich)

February 6, 2018 @ 2:00 pm - April 23, 2021 @ 5:00 pm PST

Yu Xie, Princeton

February 21, 2018 @ 12:00 pm - 1:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

March 2018

Jake Bowers, University of Illinois at Urbana-Champaign

March 13, 2018 @ 2:00 pm - 3:15 pm PDT

Franz Hall 2258A

October 2018

Erin Hartman, University of California Los Angeles

October 17, 2018 @ 12:00 pm - 1:30 pm PDT

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

November 2018

Adrian Raftery, University of Washington

November 14, 2018 @ 12:00 pm - 1:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

January 2019

Rocio Titiunik, University of Michigan

January 16, 2019 @ 12:00 pm - 1:30 pm PST

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

Internal vs. external validity in studies with incomplete populations

Researchers working with administrative data rarely have access to the entire universe of units they need to estimate effects and make statistical inferences. Examples are varied and come from different disciplines. In social program evaluation, it is common to have data on all households who received the program, but only partial information on the universe of households who applied or could have applied for the program. In studies of voter turnout, information on the total number of citizens who voted is usually complete, but data on the total number of voting-eligible citizens is unavailable at low levels of aggregation. In criminology, information on arrests by race is available, but the overall population that could have potentially been arrested is typically unavailable. And in studies of drug overdose deaths, we lack complete information about the full population of drug users.

In all these cases, a reasonable strategy is to study treatment effects and descriptive statistics using the information that is available. This strategy may lack the generality of a full-population study, but may nonetheless yield valuable information for the included units if it has sufficient internal validity. However, the distinction between internal and external validity is complex when the subpopulation of units for which information is available is not defined according to a reproducible criterion and/or when this subpopulation itself is defined by the treatment of interest. When this happens, a useful approach is to consider the full range of conclusions that would be obtained under different possible scenarios regarding the missing information. I discuss a general strategy based on partial identification ideas that may be helpful to assess sensitivity of the partial-population study under weak (non-parametric) assumptions, when information about the outcome variable is known with certainty for a subset of the units. I discuss extensions such as the inclusion of covariates in the estimation model and different strategies for statistical inference.

Co-sponsored with the Political Science Department, Statistics Department and the Center for Social Statistics