John Geer

I build things with statistics, code, and prose.

Just a dog with a mustache

Photo by ginnerobot

How to Tell When You Have Enough Data to Act

Statistical principles for decision making

6 Minute Read

How to Analyze Your Predictable Data: Anomaly Detection

Noticing unexpected observations among an ocean of expected ones.
Featured on and @analyticbridge.

5 Minute Read

Customer Retention with Statistical Learning

Predicting if customers plan to leave and finding the best intervention

5 Minute Read

Using Data Science to Estimate Annual Cycles in Housing

Even in the housing market, some things are predictable.
This post was featured on

4 Minute Read

> More articles

Probability Distribution Family Tree

How probability distributions are related

US Federal Outlays & Receipts

It’s how they say spending and income.

How the US Budget Changes

Received the “Deep Thought Badge” and Honorable Mention in Google & Eyebeam’s Data Visualization Challenge.

TED Views Predictor

A predictive model that uses the title, topic, and publish date of TED talks to estimate the number of times they will be viewed.

Genetic Prediction Algorithm

A program that evolves a prediction model for a given data set.

Bayesian Link Aggregator

A website that ranks links based on votes. It aims to learn the most from each vote by selecting which articles visitors vote on and using a Bayesian estimator.

Interesting hack for following a topic HN-style […]

Hilary Mason (@hmason), Data Scientist in Residence at Accel

I have a masters in Applied Statistics and have been excitedly coding for over a decade. My specialties include predictive analytics, machine learning, and data mining. I presently work as a remote data scientist.

Feel free to email me or connect on LinkedIn