The document discusses crafting data from various New York State education sources. It mentions extracting data from the NYSED Directory, School Report Card, and other sources using tools like Excel, Microsoft Access, MySQL, Postgresql, and data formats like comma-separated values. The document emphasizes crafting data in a repeatable, traceable, and simple way to maintain data integrity. It also covers filtering, missing data, and ranking.
In terms of storytelling, DDJ is usually about the middle parts of the story.
My interest in HHROC
Our last team project
3,045 schools. Lots of data, good data. No aggregates, just counts.
Clean data, BEDS code, enrollment #’s, relevant to me
What other data can I find that will add value, interest, weight, impact?
2011 data
What if I find another dataset? What is the journalistic process for DDJ?
F-33 is from 2010, and mostly didn’t match up because it’s by district.
average of per student is bad statistics and unnecessarily watery
I realized that I was avoiding proving that the different kinds of data was related, when an average tax paper is doesn’t care about the statistics, they’d hold with the assumption that dollars should relate to everything.
Almost meaningless
Low is good
This would be better if I had a few degrees in math and statistics. Then again, anything I can create that also seems meaningful might also be worth reporting.