Kosuke Imai, Harvard University

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

Matching Methods for Causal Inference with Time-Series Cross-Section Data

Matching methods aim to improve the validity of causal inference in observational studies by reducing model dependence and offering intuitive diagnostics. While they have become a part of standard tool kit for empirical researchers across disciplines, matching methods are rarely used when analyzing time-series cross-section (TSCS) data, which consist of a relatively large number of repeated measurements on the same units.

We develop a methodological framework that enables the application of matching methods to TSCS data. In the proposed approach, we first match each treated observation with control observations from other units in the same time period that have an identical treatment history up to the pre-specified number of lags. We use standard matching and weighting methods to further refine this matched set so that the treated observation has outcome and covariate histories similar to those of its matched control observations. Assessing the quality of matches is done by examining covariate balance. After the refinement, we estimate both short-term and long-term average treatment effects using the difference-in-differences estimator, accounting for a time trend. We also show that the proposed matching estimator can be written as a weighted linear regression estimator with unit and time fixed effects, providing model-based standard errors. We illustrate the proposed methodology by estimating the causal effects of democracy on economic growth, as well as the impact of inter-state war on inheritance tax. The open-source software is available for implementing the proposed matching methods.

Co-sponsored with the Political Science Department, Statistics Department and the Center for Social Statistics

Workshop: Merging Entities – Deterministic, Approximate, & Probabilistic

4240 Public Affairs Bldg

Instructor: Michael Tzen Title: Merging Entities: Deterministic, Approximate, & Probabilistic Location: January 31, 2019, 2:00-3:00 PM 4240 Public Affairs Building CCPR Seminar Room Content: Combining information from different groups is […]

Adeline Lo, Princeton University

1434A Physics and Astronomy Building

Covariate screening in high dimensional data: applications to forecasting and text data

High dimensional (HD) data, where the number of covariates and/or meaningful covariate interactions might exceed the number of observations, is increasing used in prediction in the social sciences. An important question for the researcher is how to select the most predictive covariates among all the available covariates. Common covariate selection approaches use ad hoc rules to remove noise covariates, or select covariates through the criterion of statistical significance or by using machine learning techniques. These can suffer from lack of objectivity, choosing some but not all predictive covariates, and failing reasonable standards of consistency that are expected to hold in most high-dimensional social science data. The literature is scarce in statistics that can be used to directly evaluate covariate predictivity. We address these issues by proposing a variable screening step prior to traditional statistical modeling, in which we screen covariates for their predictivity. We propose the influence (I) statistic to evaluate covariates in the screening stage, showing that the statistic is directly related to predictivity and can help screen out noisy covariates and discover meaningful covariate interactions. We illustrate how our screening approach can removing noisy phrases from U.S. Congressional speeches and rank important ones to measure partisanship. We also show improvements to out-of-sample forecasting in a state failure application. Our approach is applicable via an open-source software package.

Workshop: Grad Student Panel Discussing the Causal Toolkit

4240 Public Affairs Bldg

Title: Grad Student Panel Discussing the Causal Toolkit Location: February 27, 2019, 2:00-3:30 PM 4240 Public Affairs Building CCPR Seminar Room Content: Focusing on the uses of the causal toolkit, […]

Lan Liu, University of Minnesota at Twin Cities

Lan Liu, University of Minnesota at Twin Cities "Parsimonious Regressions for Repeated Measure Analysis"  Abstract: Longitudinal data with repeated measures frequently arises in various disciplines. The standard methods typically impose […]

Eloise Kaizar, Ohio State University

Eloise Kaizar, Ohio State University Randomized controlled trials are often thought to provide definitive evidence on the magnitude of treatment effects. But because treatment modifiers may have a different distribution […]

Susan Athey, Stanford University

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

Estimating Heterogeneous Treatment Effects and Optimal Treatment Assignment Policies

This talk will review recently developed methods for estimating conditional average treatment effects and optimal treatment assignment policies in experimental and observational studies, including settings with unconfoundedness or instrumental variables. Multi-armed bandits for learning treatment assignment policies will also be considered.

Co-sponsored with the Center for Social Statistics

Brandon Stewart, Princeton University

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

How to Make Causal Inferences Using Texts

Texts are increasingly used to make causal inferences: either with the document serving as the treatment or the outcome. We introduce a new conceptual framework to understand all text-based causal inferences, demonstrate fundamental problems that arise when using manual or computational approaches applied to text for causal inference, and provide solutions to the problems we raise.  We demonstrate that all text-based causal inferences depend upon a latent representation of the text and we provide a framework to learn the latent representation.  Estimating this latent representation, however, creates new risks: we may unintentionally create a dependency across observations or create opportunities to fish for large effects.  To address these risks, we introduce a train/test split framework and apply it to estimate causal effects from an experiment on immigration attitudes and a study on bureaucratic responsiveness.  Our work provides a rigorous foundation for text-based causal inferences, connecting two previously disparate literatures. (Joint Work with Egami, Fong, Grimmer and Roberts)

Co-sponsored with the Center for Social Statistics

Summer Institute in Computational Social Science

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

The purpose of the Summer Institute is to bring together graduate students, postdoctoral researchers, and early career faculty interested in computational social science. The Summer Institute is open to both […]

Summer Institute in Computational Social Science Panel Presentation

Luskin Conference Center Laureate Room

Summer Institute in Computational Social Science Panel Presentation

Friday June 21, 2019 2:00pm – 5:00pm
Reception 5:00pm – 6:00pm
Luskin Conference Center Laureate Room
• 2:00pm – 3:15pm Digital Demography
Prof. Dennis Feehan, UC Berkeley and Prof. Ka-Yuet Liu, UCLA
• 3:30pm – 4:45pm Computational Causal Inference
Prof. Judea Pearl, UCLA and Prof. Sam Pimentel, UC Berkeley

Big Data for Big Social Issues

UCLA Neuroscience Research Building Auditorium (NRB 132)

Big Data for Big Social Issues Summer Institute in Computational Social Science Panel: 1:00pm - 2:45pm Prof. John Friedman, Brown University: "Income Inequality and Social Mobility: What Can We Learn […]

Summer Institute in Computational Social Science

4240 Public Affairs Bldg

CCPR June 15 – 26, 2020 4240 Public Affairs Building The purpose of the Summer Institute is to bring together graduate students, postdoctoral researchers, and early career faculty interested in […]

SICSS Conference 2023

From June 20 to June 30, 2023, the University of California, Los Angeles (UCLA) Division of Social Sciences and the California Center for Population Research will sponsor the Summer Institute […]

The Summer Institutes in Computational Social Science (SICSS) 2024

CCPR Seminar Room 4240 Public Affairs Building, Los Angeles, CA, United States

From June 24 to July 3, 2024 the University of California, Los Angeles (UCLA) Division of Social Sciences and the California Center for Population Research will sponsor the Summer Institute […]