NYU — Spring 2014 / Week 3

Class Projects

In general, we are flexible about the format of your projects, so we don’t want to have too many official requirements. Our only real goal is that you make something publishable. Below, some loose characteristics of a successful project.

Kinds of projects you could do

Blog style: 1 2 3

News Article: 1

T-shape: 1

Big Interactive: Fixed-width Liquid

Big Viz: 1 2 3 4 5 6 7 8

Reactive Documents: 1 2 3

Goofy in kind of a bad way: 1

Slideshow: 1 2 3

Examples of published NYT projects that could be (supercharged) class projects:

-Money on the Bench Forget the silly timer, the bar charts and the icons. A table that updates every day based that answers a (semi-) journalistically relevant would be totally doable.

-Strikeouts on the Rise Your project could be a mix of analytical copy and charts. R charts would do just fine here if you didn’t want to make them in a browser.

-This medium-sized chart is basically the result of merging two data sets, some basic data cleaning and asking interesting questions. It made this A1 NYT story much more comprehensible.

-Lautenberg’s Death Continues Sharp Decrease in Military Veterans in Senate One of the most analytical popular blogs on the Internet has no fancy charts; instead, fact-based ledes and the data to back them up, sometimes accompanied by an Excel chart. Your project could be a more traditional mix of charts and prose, like this blog post.

-Where 50,000 Guns Recovered in Chicago Came FromThis could easily be a Google map or an R map instead of a Javascript one.

-College Admissions Gap Take away the interactive chart and the 4-step walkthrough and this is just 4 answers to 4 data questions.

-Seeking Flexibility During Storms This doesn’t have to be just a map; your project could focus on interviews based off of analysis gotten from a map. (For, say, earthquakes instead of hurricanes.)

-The fastest-gentrifying neighborhoods in the United States Here’s another example where very simple data work might lead you somewhere interesting.

-Capitol South, Station Domination, and Lobbying Effectiveness Some of The Godfather’s favorite data work is not hugely complicated.

Other news organizations' work that might give you ideas

-Pro Publica

-NPR apps

-LAT data desk

-WNYC data news

-NICAR stories

Project schedule

By class time on Feb 26, we’d like you to post a link to your github page to an HTML page with three pitches on it.

Each pitch should include:

• A summary of your proposal, which includes why your idea is newsworthy

• Links to potential data sources

• Links to potential human sources

• A list of things you would have to learn to complete the project.

• An example of a publication you could pitch your completed project to.

Here’s an example of a couple pitches of projects we’ve worked on in class already. It doesn’t need to be in exactly this format, but this is the general idea we’re looking for. (Again, we’re looking for three.)

Project proposal example

Last year, on average, Major League baseball teams struck out more than at any time in its 112 year history. This subject has been written about before, but no news organization has visualized the distribution or, to my satisfaction, offered a significant answer to why it has been rising so steadily, even in the post-steroids era. This could be published soon, with a focus on how teams currently in the playoffs have been affected, or it could be published at the beginning of next year, as a preview to the next season.

My project would be an interactive chart that shows the average strikeouts per year of every team in the Major Leagues for every year their franchise has been in existence. Ideally, this chart would be interactive, allowing readers to see the chart in the context of their favorite team. All the historical data is available on baseball-reference.com.

I’d like to talk to a couple hitting coaches to see if they have recognized this difference and if they are telling their batters to hit differently than they did in the past. Chili Davis, the batting coach of the Oakland Athletics, would be ideal, but I’ve identified some minor league coaches as well.

I would probably need to learn how to scrape the data from baseball-reference, since I don’t want to manually download 5000 web pages by hand, and I would also need to learn how to do more effective user interaction with a D3 chart than I do right now.

There are a number of stat-based sports sites that I could pitch this to, including the Baseball Prospectus blog, FanGraphs or more local sites, like the sports desks of the San Francisco Chronicle or the Oakland Tribune.

Email kemanda with any questions.

Timeline for project success

We’ll be making sure your projects get done with this general outline.

Feb 26

By class time on Feb 26, post a link to three pitches (like the one above) to your github page. We’ll discuss them with you over the course of the week and help you settle on one.

March 5

By class time on March 5, post a link to your final project pitch that takes into account our feedback.

March 12

Your final pitch has been decided. Your project should be approved.

March 26

You have collected and cleaned all your data. You’ve started to analyze your data in Excel and have a vague sense of what you want to make. You should be starting to contact your, er, contacts and setting up any interviews if needed. You should have identified any secondary datasets you’ll need and should be collected them this week (like geo shapefiles, or census demographic info, etc…).

April 2

All your data should be collected and organized in your repo. You should have made first contact with any experts you’ll need to help explain your findings.

April 9

You should have a starting of your final HTML document, even if it’s just some text and Excel sketch screenshots. You should have a very good sense of what you’re building and what technologies you will need.

April 16

Project help session in class. We’ll be focusing on helping you with your data analysis. By this date you should have a good idea on exactly what it is you’re building.

April 23

Project help session in class. We’ll be focusing on editing your writing and ideas. Your HTMl document should be taking shape. If you’re using Excel, you should have at least screenshots of sketches for all your final charts (very rough is totally fine). If you’re using another technology, you should also have substantially coding progress made – datasets are loaded and in the correct format, topojson files are generated if needed. If you’re trying to do more than one chart on a page, now is a good time to refactor so things work well together.

April 30

This will be a last minute sanity check. Copy should be written and everything should be mostly in place, any interactivity should be mostly working. This last week should be reserved for design tweaks, text rewrites and polishing.

May 7

Presentations + projects due. Your project must be published with a final URL by the beginning of class. You will give a brief overview of your project and are required to be gracious during your classmates thunderous applause.