How to Make High Quality Data Visualizations for Websites With R and ggplot2
If you’ve been following my blog, I like to use R and ggplot2 for data visualization. A lot.One of my older blog posts, An Introduction on How to Make Beautiful Charts With R and ggplot2, is still one...
View ArticleMaking Magic: the GIFening
After working at BuzzFeed for a few months, I’m now an expert in the proper usage of GIFs. My favorite GIF tool is the /giphy command in Slack, which puts a random GIF according to a given phrase into...
View ArticleBenchmarking Modern GPUs for Maximum Cloud Cost Efficiency in Deep Learning
A few months ago, I performed benchmarks of deep learning frameworks in the cloud, with a followup focusing on the cost difference between using GPUs and CPUs. And just a few months later, the...
View ArticleA Visual Overview of Stack Overflow's Question Tags
Stack Overflow is the most popular contemporary knowledge base for programming questions. But most interact with the site by Googling a programming question and getting a top result that links to SO....
View ArticleVisualizing One Million NCAA Basketball Shots
So March Madness is happing right now. In celebration, Google uploaded massive basketball datasets from the NCAA and Sportradar to BigQuery for anyone to query and experiment. After learning that the...
View ArticleHow to Quickly Train a Text-Generating Neural Network for Free
One of the more interesting applications of the neural network revolution is text generation. Most popular approaches are based off of Andrej Karpathy’s char-rnn architecture/blog post, which teaches a...
View ArticleAnalyzing IMDb Data The Intended Way, with R and ggplot2
IMDb, the Internet Movie Database, has been a popular source for data analysis and visualizations over the years. The combination of user ratings for movies and detailed movie metadata have always been...
View ArticleProblems with Predicting Post Performance on Reddit and Other Link Aggregators
Reddit, “the front page of the internet” is a link aggregator where anyone can submit links to cool happenings. Over the years, Reddit has expanded from just being a link aggregator, to allowing image...
View ArticleThings About Real-World Data Science Not Discussed In MOOCs and Thought Pieces
Data science has been sweeping the tech world. With a large variety of powerful free open-sourced tools and now the computing power to utilize them to their full potential, data science is more...
View ArticleRun Any Scheduled Task/Cron Super-Cheap on Google Cloud Platform
Let’s say you want to make a Twitter bot to tweet out a custom message every few hours or so, and the free-tier VMs offered by cloud services with fractional virtual CPUs and little RAM aren’t...
View Article