Exploring Open Data Sets

It’s always fascinating to take a look at the data visualizations and in-depth reports widely available on the web. As an aspiring (or active) data scientist, however, one of the best things you can do to learn about a particular field is to get your own hands dirty. Here, we’ve compiled a list of sites that host open and free data sets for anyone to use.


A Resurgence of Neural Networks in Machine Learning

Guest author Dan Gillick of Google explains the recent resurgence of neural networks in the field of machine learning. In just under three years, these models have helped to substantially reduce Word Error Rate (WER) for automatic speech recognition systems.

Data Size Matters [Infographic]

KB, MB, GB…these are all measurements we understand. But how much does a terabyte hold? How many DVDs would it take to reach a zettabyte? And what on earth is a yottabyte? To better understand the scope of data size, check out our infographic for real-life examples, a timeline of hard drive innovation, and a glimpse at where the data storage industry is heading in the future.


Exploring Political Bias with the Bitly Media Map

Here, Kris Harbold explores the Bitly Real-Time Media Map, an interactive data visualization that illustrates media consumption across the United States. By using the tools of data science to analyze the source and format of media links being shared, this map attempts to pinpoint correlations between the political affiliation of readers and the news they consume.


Book Review: Freakonomics by Steven D. Levitt and Stephen J. Dubner

If you’re at all interested in economics, data science, or even just popular books, it’s a good bet that you’ve heard of the 2005 New York Times bestseller, Freakonomics: A Rogue Economist Explores the Hidden Side of Everything. Eight years might seem to be a long shelf life for a book about statistics, particularly one dealing with current events, but as one of the first books to push economics and data science into popular consciousness, we think it’s definitely worth taking a second look.

David McCandless: The Beauty of Data Visualization

David McCandless, author of the popular Information is Beautiful blog, uses his 2010 TED Talk to describe how combining the language of the eye with the language of the mind enhances our understanding of complex data. He believes that the Web has turned us all into visualizers for whom processing graphic information is effortless.

Book Review: “Big Data: A Revolution That Will Transform How We Live, Work, and Think”

“Big data” — we hear the term all of the time, but what does it really mean? Viktor Mayer-Schönberger and Kenneth Cukier’s 2013 bestseller, “Big Data: A Revolution That Will Transform How We Live, Work, and Think,” attempts to answer this question with a solid overview of the promises, advancements, issues and implications of the big data revolution.