R Packages:

redist: Computational Algorithms for Redistricting Simulation (with Christopher Kenny, Cory McCartan, and Kosuke Imai)
redist is a publicly available R package that enables researchers to statistically simulate congressional redistricting plans using Markov chain Monte Carlo and Sequential Monte Carlo techniques. This includes the implementation of substantive constraints in the redistricting process such as geographic compactness and population parity requirements, as well as efficient simulation methods such as simulated tempering algorithms. Tools for analysis such as inverse probability reweighting and plotting functionality are included. The package implements methods found in Fifield, Higgins, Imai, and Tarr (JCGS, 2020), "Automated Redistricting Simulation Using Markov Chain Monte Carlo," Fifield, Imai, Kawahara, and Kenny (SPP, 2020), “The Essential Role of Empirical Validation in Legislative Redistricting Simulation ”, and McCartan and Imai, “Sequential Monte Carlo for Sampling Balanced and Compact Redistricting Plans“.

Type install.packages("redist") to install in R. Please also see the redist GitHub page for the most stable development release.

CRAN_Status_Badge CRAN downloads

fastLink: Fast Probabilistic Record Linkage (with Ted Enamorado and Kosuke Imai)
fastLink is a publicly available R package that enables researchers to merge two data sets using a fast implementation of the Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two data sets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods developed in Enamorado, Fifield, and Imai (APSR, 2019), "Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records."

Type install.packages("fastLink") to install in R. Please also see the fastLink Github page for the most stable development release.

fastLink received the 2021 Statistical Software award from the Society for Political Methodology. The award citation can be found here.

The core statistical model behind fastLink has also been ported over to the Apache Spark environment, as implemented in splink.

CRAN Version CRAN downloads

hettx: Detecting and Measuring Treatment Effect Variation (with Peng Ding, Avi Feller, and Luke Miratrix)
hettx is a publicly available R package that implements methods developed by Ding, Feller, and Miratrix (JRSS-B, 2016) ''Randomization Inference for Treatment Effect Variation'', and Ding, Feller, and Miratrix (JASA, 2019) ''Decomposing Treatment Effect Variation'', for testing whether there is unexplained variation in treatment effects across observations. The package includes wrapper functions implementing the proposed methods, as well as helper functions for analyzing and visualizing the results of the test.

Type install.packages(“hettx”) to install in R. Please also see the hettx Github Page for the most stable development release.

CRAN Version

Shiny Apps:

randomizeAuthor (with Yang-Yang Zhou and Evan Lieberman)
randomizeAuthor is an online Shiny App utility for transparently randomizing author order for academic articles while creating a common, replicable record of the randomization. To ensure non-manipulable replicability, the app queries the API for a random seed that guarantees replicability of the randomization, and it automatically sends an email record of the input names and their order along with the random seed value and its source.