2. Data → Story
●
Find data
●
Wrangle/Cleanup the data
●
Merge data with others (if any)
●
Filter and sort the data
●
Analyze data
●
Visualize data (story)
3. CA Election 2070
●
What is data?
–
The candidates (age, gender, party)
–
The constituencies (vdc, ward, party)
–
The results (with votes, winner)
–
…..
4. Where to find it?
●
http://election.gov.np
●
The following FPTP results data in XML
5. Not lucky every time finding data
●
Scrapping (requires programming knowledge)
–
Using google scraper
●
PDF conversion
●
PDF manual transcribe
13. XML to CSV?
●
Online services are available
●
Might need help from technologist
●
In linux (there might be several ways, e.g)
xml2 < FPTP-CA70.xml | 2csv FPTP
DISTNAME CONST CANDIDATE AGE SEX
PARTYNAME SYMBOLNAME TOTALVOTE
STATUS > FPTP-CA70.csv
14. OpenNepal
●
Repository of datasets
–
●
●
●
data in csv, xml or json format
Request for dataset
Request for help in conversion from one format
to another, scrapping data, ...
OpenNepal Community (GoogleGroup) is very
vibrant
15. CA Results CSV data
●
Converted from XML
http://dev.yipl.com.np/data-training/data/FPTP-CA70.csv
20. Some exercise
●
●
●
●
●
Are there people who didn't receive a single vote?
What is the highest and lowest number of votes of
candidate who didn't win?
Find the percentage of female and male
candidates, percentage of winning female
candidates?
Try the above exercise in one district of your
interest?
Think of other things you can do with this basic
skills
21. More questions
●
●
●
How many parties have candidates in all 240
constituencies?
How many male and female candidates are
there in Nepali Congress? Ratio of male-female
in far-west districts?
Which party has the highest number of female
candidates?
28. Geocoding
●
Geo-coding
–
–
●
the conversion of a human-readable location name
into a numeric (or other machine-processable)
location such as a longitude and latitude
Kathmandu => [geocoding] => {latitude: 27.70169,
longitude: 85.3206}
Online tools available for geocoding
–
Google fusion table
–
cartodb