Caitlin Dewey

Cities from spreadsheets

My first Kiplinger project of the new year: “The 12 Best Cities for High-Paying Jobs,” with guest appearances by Trenton, Cedar Rapids and a few other places you might not expect.

It’s probably not evident from the final product, but a huge amount of data goes into these city slideshows — many-thousand-row spreadsheets from the Census Bureau, BLS, etc. I’ve done about a dozen and gotten “really good at spreadsheets,” in the bemused words of my assigning editor. The process intrigues me more and more as I refine it. You start with a mess of statistics and, many formulas and filters later, pull out a concrete list. Voila!

Next step: Learning to scrape from websites and PDFs, so I can expand past released data sets. I’ll refer to ProPublica’s exhaustive guide for that one.

One Comment on “Cities from spreadsheets

  1. Jeff
    January 5, 2012

    MS Excel 2010 (Office 14) deals better with data imports from text (what happens when you strip data from a pdf) than prior versions!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Information

This entry was posted on January 4, 2012 by in Digital journalism, My work and tagged , .
Follow

Get every new post delivered to your Inbox.