Extracting data from docs, mapping
30 Sep 2015Links for class
- What to do with the docs
- Extracting data from a PDF
- Tutorial on Getting started with the Github desktop app
Readings for Oct. 7
Read: Distrust Your Data
Read: When Maps Shouldn’t Be Maps
Read: Connecting with the Dots
Homework 1
Answer these questions about the readings.
Questions about the Class 5 reading assignments
Homework 2
- Extract data from this PDF.
- Don't just convert it to a spreadsheet. Clean it up nice.
- Save it as a CSV and upload it to the private Github repo.
Homework 3
Critique a piece of data journalism Find a story that purports to use data, read it, and answer the following questions:
- What data was used in the story?
- Where'd the data come from?
- Why was the data collected in the first place?
- What conclusions does the story come to based on the data?
- What are the limits to the data?
- Is the original somewhere online? If so, post the link.
Write the answers to these questions in Markdown format and post it to your Github repo.
Here are some stories if you don't want to find any on your own:
- Do lobster prices have anything to do with the McDonald’s lobster roll? | TrendCT
- The growth of Connecticut sectors — and the shrinking of a few | TrendCT
- The Republican Candidates Donald Trump Has Hurt the Most - The New York Times
- Coal, Gas, Nuclear, Hydro? How Your State Generates Power : NPR
- Which Connecticut towns prefer Dunkin’ Donuts or Starbucks? | TrendCT
- The Tiger Mom Tax: Asians Are Nearly Twice as Likely to Get a Higher Price from Princeton Review - ProPublica
- Perilous highway | longform.theday.com