Displaying posts categorized under

Data Science

Docker and R – Talk for the Greater Cleveland R Group

I was able to give a talk recently to the Greater Cleveland R Group about using Docker and R together. Docker is a virtualization platform that uses OS-level virtualization to provide containers. It’s pretty neat, so I wanted to share the talk here. Big thanks to Tim Hoolihan for both organizing the R User Group […]

My R Coding Convention

It seems like many R programmers (probably, many programmers in general) end up writing a post of this type, so I decided to jump on the bandwagon. I recently switched jobs, so I am at a nice point to make a “fresh start” with my coding conventions: I am not facing the need to refactor […]

Intro to Shiny – Talk for the Greater Cleveland R Group

I was able to give a talk recently to the Greater Cleveland R Group about Shiny, the R package that lets you build web apps for data visualization and analysis using R. It’s a pretty neat package, and I wanted to share my slides, code, and presentation here. My thanks to Gaurav Narain Saxena for recording […]

Write your own! On having better habits as an R programmer

I contribute responses to Stack Overflow pretty frequently. I like answering well-written questions and enjoy that it keeps my skills sharp. However, one area of annoyance for me on Stack Overflow is that many answers start with “You can do this using the <insert package name here> package” – even when the task at hand can […]

This One Chart Perfectly Sums Up Why Most Posts That Start This Way Are Total Lies!!!

How many times have you heard this line before? “This one graph perfectly sums up the current plight of Millenials!” “This one chart shows everything about global warming in a nutshell!” It’s one of the more common clickbait articles, but as a data science professional and Edward Tufte fan, I just can’t take it anymore. I […]

Public Health Informatics and the Future of Public Health

I wrote this essay almost six years ago, while I was in graduate school working on my MPH. I stumbled across it recently and thought to share it, especially given the direction in which my career has moved. Public health informatics represents an exciting area of future growth in public health, where many different disciplines […]

What’s YOUR PIN?

This article combines a number of my loves: data visualization, information security, and large-scale data analysis. Enjoy!

The Paradox of Information Support: How more data can mean less clarity, and how we might fix it

I think that we can all agree that proper information support is a useful thing for managers and leaders, no matter the organization. Each organization is positively awash in data, but effective organizations take that data and transform it into information – useful, cohesive nuggets that help leaders and managers to make better decisions. Still, proper information […]