Name: Adeline Lo, Princeton University
Start: 2019-02-26T14:00:00-08:00
End: 2019-02-26T15:00:00-08:00
Location: 1434A Physics and Astronomy Building

Adeline Lo, Princeton University

February 26, 2019 @ 2:00 pm - 3:00 pm PST

Title: Covariate screening in high dimensional data: applications to forecasting and text data

Abstract: High dimensional (HD) data, where the number of covariates and/or meaningful covariate interactions might exceed the number of observations, is increasing used in prediction in the social sciences. An important question for the researcher is how to select the most predictive covariates among all the available covariates. Common covariate selection approaches use ad hoc rules to remove noise covariates, or select covariates through the criterion of statistical significance or by using machine learning techniques. These can suffer from lack of objectivity, choosing some but not all predictive covariates, and failing reasonable standards of consistency that are expected to hold in most high-dimensional social science data. The literature is scarce in statistics that can be used to directly evaluate covariate predictivity. We address these issues by proposing a variable screening step prior to traditional statistical modeling, in which we screen covariates for their predictivity. We propose the influence (I) statistic to evaluate covariates in the screening stage, showing that the statistic is directly related to predictivity and can help screen out noisy covariates and discover meaningful covariate interactions. We illustrate how our screening approach can removing noisy phrases from U.S. Congressional speeches and rank important ones to measure partisanship. We also show improvements to out-of-sample forecasting in a state failure application. Our approach is applicable via an open-source software package.

Hosted by the Center for Social Statistics

Details

Date: February 26, 2019
Time:
2:00 pm - 3:00 pm PST
Event Categories: CSS Events, Divisional Publish

Venue

1434A Physics and Astronomy Building

Organizer

Center for Social Statistics

Adeline Lo, Princeton University

February 26, 2019 @ 2:00 pm - 3:00 pm PST

Details

Venue

Organizer

Details

Venue

Organizer

Event Navigation