REFLECTIONS ON THE START OF MY DATA FELLOWSHIP WITH UPTAKE

2018-03-21

I can’t believe it’s been more than a week since I returned from my trip to Chicago. This was an incredible opportunity and before I start this post a few thank yous are in order: I really can’t thank Uptake enough for being the kind of company that does more than just ‘have an mission statement’–having the motivation to start a philanthropic arm…

Full Post

ADVENT OF CODE IN R: DAY 1

2017-12-01

Over on the rOpenSci Slack, Sam asked if anyone was doing the (Advent of Code)[http://adventofcode.com/] challenges in R. A few others said they were interested and I decided to go for it as well! My solutions…

Full Post

RECREATING THE VACCINE HEATMAP IN PLOTLY WITH R

2016-10-14

In order to get the data for this visualization you will need to create a free account over at Project Tyco. You can download all the datasets if you like but I’m just going to use the Polio data set. These are the packages that I’m using:…

Full Post


BUILDING THIS SITE WITH RSTUDIO AND RMARKDOWN

2016-09-01

There may be a special place in hell for blog posts like this that document how someone put together their static site website and hosted it on GitHub. Especially when there is really great documentation availible that has been put together by really smart people. You can even look at Rmarkdown’s website source code on GitHub to see…

Full Post


THE MOVE TO R

2016-08-29

This is not a language wars type post. I do not think there is some Mordor forged language to rule them all. I debated whether or not to even write a post like this. Nevertheless, I had a chance to meet and talk to Jake Powray from DataKind at the DoGoodData confrence and I…

Full Post


P-HACKING AND ONTOLOGY

2015-08-25

In a recent FiveTirtyEight post by Christie Aschwanden about researcher bias and P-Hacking, there is a lovely interactive example of what variables a researcher would need include/exclude in the analysis in order to obtain a result that is statistically significant, i.e., p≤.05; thus worthy of publishing. The article brought many thoughts to mind…

Full Post


BAR CHART ANNOTATIONS WITH PANDAS AND MATPLOTLIB

2015-06-15

When I first started using Pandas, I loved how much easier it was to stick a plot method on a DataFrame or Series to get a better sense of what was going on. However, I was not very impressed with what the plots looked like. Any time I wanted to do something slightly different from the “Plotting” documentation on the pydata site, I found myself arm…

Full Post


PANDAS SNIPPETS (EVOLVING)

2015-04-03

The goal of this post is to keep me from googling pandas questions that I’ve forgotten. I don’t know how many times I’ve looked at the results and seen five or more StackOverflow links that have clearly already been clicked on; I feel like Sisyphus when this happens! So, here is what I’m currently committing…

Full Post


FIRST KAGGLE SUBMISSION—RANDOM FOREST CLASSIFIER

2015-01-32

I have seen kaggle mentioned on twitter a lot; mostly by the data scientists and researchers I look up to, but there’s never been much confidence that the site was for me in any way—mostly because I was a long way from my dream data science job with yet so much to learn. Notwithstanding, I cannot help but try and hack my way to my destination! I think it’s a part of my learning process…

Full Post


THE PYTHON JOURNEY—ONE SEMESTER WITH PYTHON 3.4

2014-12-05

This was quite a journey for me. I started the same way everyone else has; with my very first “Hello World” program written in Python 3…This was the first assignment for my Programming in Python course. I was not content to have it only print “Hello World”—no, I need to personalize it in some small way…

Full Post


FOUCAULT’S CHALLENGE TO MODERNIST CLASSIFICATION

2013-09-04

In Foucault’s Les Mots et les choses (The Order of Things), he notes a passage in Borges that, for him, demonstrates the limitations of taxonomic assertions in the face of exotic systems of thought—via Borges, he quotes a ‘certain Chinese encyclopedia’ in which it is written…

Full Post