I mostly write about statistics, computing, and academia. Subscribe to my atom feed!


Recent

Update/Benchmark of edarf, Nov 2016
A performance increase for partial dependence in edarf
Peer-Review is a Weak Signal of Paper Quality, Aug 2016
peer-review is a high variance estimator of paper quality
Academic Technical Debt, Aug 2016
academic code has technical debt
Generalization Error for Model Validation, Jul 2016
generalization error should be used for model evaluation more frequently
Salsa Vaya Review, Jul 2016
me talking about my bike

Popular

Setting up OSX for Data Analysis, Jun 2013
set up Emacs, R, Python, and Julia with Homebrew
An Analysis of Polity IV and its Components, Oct 2013
Gleditsch and Ward (1997) aren't quite correct and Polity is mis/over-used
GNU Make for Reproducible Data Analysis, Nov 2013
how to use GNU Make to make your research more reproducible
Analyzing Data from the Titanic, Jan 2013
predicting who survived the Titanic using a GLM and a GAM
Identification, Public Policy, and Marijuana Legalization, Dec 2013
causal inference (necessary for policy recommendations imo) with observational data is often hard or impossible

Categories


Travel

Kenya, Jan 2016
safari and hiking/climbing in Kenya
Washington, May 2015
climbing in the Cascades
Ecuador, Jan 2015
hiking and mountaineering in Ecuador
Boulder, Colorado, Jun 2014
hiking and climbing in and around Boulder

Statistics

Generalization Error for Model Validation, Jul 2016
generalization error should be used for model evaluation more frequently
Monte Carlo for Teaching Applied Statistics, Dec 2014
real data is overused in applied statistics classes in political science
Data Contingent Analysis of Historical Data, Oct 2014
analysis of historical data is contingent on the data (duh!)
On the Epistemology of Statistics, Sep 2014
articles that changed my thinking on what/how we can learn from data
Real-World Examples in Methods Papers, Jul 2014
real-world examples in methods papers should be examples for applied researchers to follow
Statistical Education in Political Science, May 2014
research design, probability, and mathematical statistics are necessary to a good statistical education
Data Mining as Exploratory Data Analysis, Feb 2014
statistical learning should often be used in observational studies w/o (as-if) randomization
Identification, Public Policy, and Marijuana Legalization, Dec 2013
causal inference (necessary for policy recommendations imo) with observational data is often hard or impossible
Analyzing Data from the Titanic, Jan 2013
predicting who survived the Titanic using a GLM and a GAM

Cycling

Salsa Vaya Review, Jul 2016
me talking about my bike

Education

Monte Carlo for Teaching Applied Statistics, Dec 2014
real data is overused in applied statistics classes in political science
Statistical Education in Political Science, May 2014
research design, probability, and mathematical statistics are necessary to a good statistical education
Math Education and Political Science, Nov 2012
math camp is no substitute for years of undergraduate math

Politics

Defense Cuts and the "Revolution in Military Affairs", Jul 2012
why doesn't the Powell Doctrine work? (it's politicians)

Computing

Update/Benchmark of edarf, Nov 2016
A performance increase for partial dependence in edarf
Academic Technical Debt, Aug 2016
academic code has technical debt
Git and GitHub Tutorial, Jan 2015
the basics of how to set up and use git (from the cli) and github for scientists
GNU Make for Reproducible Data Analysis, Nov 2013
how to use GNU Make to make your research more reproducible
Setting up OSX for Data Analysis, Jun 2013
set up Emacs, R, Python, and Julia with Homebrew
Git/GitHub, Transparency, and Legitimacy in Quantitative Research, Mar 2013
revision control (git) can help make research more transparent and reproducible
Analyzing Data from the Titanic, Jan 2013
predicting who survived the Titanic using a GLM and a GAM

Research

Update/Benchmark of edarf, Nov 2016
A performance increase for partial dependence in edarf
Peer-Review is a Weak Signal of Paper Quality, Aug 2016
peer-review is a high variance estimator of paper quality
Academic Technical Debt, Aug 2016
academic code has technical debt
Generalization Error for Model Validation, Jul 2016
generalization error should be used for model evaluation more frequently
Redux of An Empirical Evaluation of Explanations for State Repression, Oct 2014
analysis of additional (relative to eeesr) predictors of state repression
An Empirical Evaluation of Explanations for State Repression, Apr 2014
null-hypothesis significance testing is over/mis-used, predictive validity is important, and democracy/civil conflict partially measure state repression
Data Mining as Exploratory Data Analysis, Feb 2014
statistical learning should often be used in observational studies w/o (as-if) randomization
Identification, Public Policy, and Marijuana Legalization, Dec 2013
causal inference (necessary for policy recommendations imo) with observational data is often hard or impossible
An Analysis of Polity IV and its Components, Oct 2013
Gleditsch and Ward (1997) aren't quite correct and Polity is mis/over-used

Academia

Peer-Review is a Weak Signal of Paper Quality, Aug 2016
peer-review is a high variance estimator of paper quality
On Academic Publishers, Dec 2013
publishing could be nearly free and for-profit publishers don't add value to academic research
Stress Physiology and Graduate School, Jan 2012
a summary of "Why Zebra's Don't Get Ulcers" (a book on stress) adapted for graduate students