SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
Hands-on Training

Data – what and how?
A case of CA Election 70

YoungInnovations
OpenNepal
Data → Story
●

Find data

●

Wrangle/Cleanup the data

●

Merge data with others (if any)

●

Filter and sort the data

●

Analyze data

●

Visualize data (story)
CA Election 2070
●

What is data?
–

The candidates (age, gender, party)

–

The constituencies (vdc, ward, party)

–

The results (with votes, winner)

–

…..
Where to find it?
●

http://election.gov.np

●

The following FPTP results data in XML
Not lucky every time finding data
●

Scrapping (requires programming knowledge)
–

Using google scraper

●

PDF conversion

●

PDF manual transcribe
Chrome Scraper Extension
●

Search for “Chrome extension Scraper” from
Chrome browser to install
Scraper in Action
PDF to Text
●

Online tools available

●

Linux has different set of utilities

●

PDF is still a big nuisance (though something is
better than nothing)
PDF to Text
http://www.election.gov.np/election/uploads/fil
es/ecn_report/constwisecandidatecount.pdf
PDF to Text
●

Linux utility - pdftotext
CSV
●
●

●

CSV - Comma Separated Value
Opens in MS Excel, Open Office, Google
Spreadsheet
Easy to work with
CA XML Data to CSV
XML to CSV?
●

Online services are available

●

Might need help from technologist

●

In linux (there might be several ways, e.g)
xml2 < FPTP-CA70.xml | 2csv FPTP
DISTNAME CONST CANDIDATE AGE SEX
PARTYNAME SYMBOLNAME TOTALVOTE
STATUS > FPTP-CA70.csv
OpenNepal
●

Repository of datasets
–

●
●

●

data in csv, xml or json format

Request for dataset
Request for help in conversion from one format
to another, scrapping data, ...
OpenNepal Community (GoogleGroup) is very
vibrant
CA Results CSV data
●

Converted from XML

http://dev.yipl.com.np/data-training/data/FPTP-CA70.csv
Processing/Cleaning CSV – Basics
●

Add header

●

Sorting (by different fields)

●

Filter

●

Simple formulas
Add headers
●

Insert row at the top

●

Add header for each column
Sorting
●

Sorting by Age – Ascending, Descending

●

Find out youngest winning candidate age
Filtering
●

Filter the list of winning female candidates
Some exercise
●

●

●

●

●

Are there people who didn't receive a single vote?
What is the highest and lowest number of votes of
candidate who didn't win?
Find the percentage of female and male
candidates, percentage of winning female
candidates?
Try the above exercise in one district of your
interest?
Think of other things you can do with this basic
skills
More questions
●

●

●

How many parties have candidates in all 240
constituencies?
How many male and female candidates are
there in Nepali Congress? Ratio of male-female
in far-west districts?
Which party has the highest number of female
candidates?
Data Processing - Pivottable
PivotTable - more
●

Breakdown of
independent
candidates
Lets again see numbers
●

Sorted by total
number of
candidates
Visualization
●

Bar graph of male-female candidates of top few
districts
What else visualizations are
possible?
●

https://github.com/mbostock/d3/wiki/Gallery
What else visualizations are
possible?
●

https://github.com/mbostock/d3/wiki/Gallery
Geocoding
●

Geo-coding
–

–

●

the conversion of a human-readable location name
into a numeric (or other machine-processable)
location such as a longitude and latitude
Kathmandu => [geocoding] => {latitude: 27.70169,
longitude: 85.3206}

Online tools available for geocoding
–

Google fusion table

–

cartodb
Lat-long in maps.google.com
●

Put the lat long (27.70169 85.3206) in google
map search box
Services available for geocoding
http://open.mapquestapi.com/nominatim/v1/sea
rch?format=xml&q=Kathmandu,Nepal
Problems with this CSV
●

Unicode in districts name

●

Can't geocode (currently only english)
Adding english district name

http://dev.yipl.com.np/data-training/data/FPTP-CA70-eng.csv
Google Fusion Table
●

tables.googlelabs.com (need @gmail account)
Imported data
Geocoding
Using filter in the map
Use of heatmap based on votes
Thank you

Weitere ähnliche Inhalte

Ähnlich wie Data Literacy Training - case of CA Election 70

A Tale of Two Case Studies: Using LLMs in Production
A Tale of Two Case Studies: Using LLMs in ProductionA Tale of Two Case Studies: Using LLMs in Production
A Tale of Two Case Studies: Using LLMs in ProductionAggregage
 
Machine Learning - Startup weekend UCSB 2018
Machine Learning - Startup weekend UCSB 2018Machine Learning - Startup weekend UCSB 2018
Machine Learning - Startup weekend UCSB 2018Raul Eulogio
 
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...Authoritas
 
Agile SEO: Building a Continuous Improvement Machine for SEO
Agile SEO: Building a Continuous Improvement Machine for SEOAgile SEO: Building a Continuous Improvement Machine for SEO
Agile SEO: Building a Continuous Improvement Machine for SEOTory Gray
 
Open data in Egyptian Elections
Open data in Egyptian ElectionsOpen data in Egyptian Elections
Open data in Egyptian ElectionsHaitham Nabil
 
Data science: use cases and tools
Data science: use cases and toolsData science: use cases and tools
Data science: use cases and toolsAlexey Grigorev
 
Analyze this
Analyze thisAnalyze this
Analyze thisAjay Ohri
 
Endouble Kennissessie analytics 2.0
Endouble Kennissessie analytics 2.0Endouble Kennissessie analytics 2.0
Endouble Kennissessie analytics 2.0Endouble
 
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018Margaret Roth
 
Pivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPaulina Galindo
 
Find my tea [sync ipswich] a technical journey through new product development
Find my tea [sync ipswich] a technical journey through new product developmentFind my tea [sync ipswich] a technical journey through new product development
Find my tea [sync ipswich] a technical journey through new product developmentPaulGrenyer1
 
Digital landscape
Digital landscapeDigital landscape
Digital landscapeMike Jessop
 
Formula for Creating Custom Advocacy Reports with Maps
Formula for Creating Custom Advocacy Reports with MapsFormula for Creating Custom Advocacy Reports with Maps
Formula for Creating Custom Advocacy Reports with MapsDan Ford
 
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWO
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWOGenerating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWO
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWOVWO
 
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)Rosenfeld Media
 
The State of RunSignup and the State of the Industry
The State of RunSignup and the State of the IndustryThe State of RunSignup and the State of the Industry
The State of RunSignup and the State of the Industryrunsignup
 

Ähnlich wie Data Literacy Training - case of CA Election 70 (20)

A Tale of Two Case Studies: Using LLMs in Production
A Tale of Two Case Studies: Using LLMs in ProductionA Tale of Two Case Studies: Using LLMs in Production
A Tale of Two Case Studies: Using LLMs in Production
 
Machine Learning - Startup weekend UCSB 2018
Machine Learning - Startup weekend UCSB 2018Machine Learning - Startup weekend UCSB 2018
Machine Learning - Startup weekend UCSB 2018
 
Explore Outdoor
Explore OutdoorExplore Outdoor
Explore Outdoor
 
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...
Creating Meaningful in-house and Agency SEO Dashboards for the Enterprise - T...
 
Agile SEO: Building a Continuous Improvement Machine for SEO
Agile SEO: Building a Continuous Improvement Machine for SEOAgile SEO: Building a Continuous Improvement Machine for SEO
Agile SEO: Building a Continuous Improvement Machine for SEO
 
Open data in Egyptian Elections
Open data in Egyptian ElectionsOpen data in Egyptian Elections
Open data in Egyptian Elections
 
Analytics and-insight
Analytics and-insightAnalytics and-insight
Analytics and-insight
 
Data science: use cases and tools
Data science: use cases and toolsData science: use cases and tools
Data science: use cases and tools
 
Analyze this
Analyze thisAnalyze this
Analyze this
 
Endouble Kennissessie analytics 2.0
Endouble Kennissessie analytics 2.0Endouble Kennissessie analytics 2.0
Endouble Kennissessie analytics 2.0
 
Research Beyond
Research BeyondResearch Beyond
Research Beyond
 
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018
Creating a Data Driven L&D Team - an xAPI Case Study - DevLearn 2018
 
Pivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPivotal Tracker - Research Findings
Pivotal Tracker - Research Findings
 
Data mining
Data miningData mining
Data mining
 
Find my tea [sync ipswich] a technical journey through new product development
Find my tea [sync ipswich] a technical journey through new product developmentFind my tea [sync ipswich] a technical journey through new product development
Find my tea [sync ipswich] a technical journey through new product development
 
Digital landscape
Digital landscapeDigital landscape
Digital landscape
 
Formula for Creating Custom Advocacy Reports with Maps
Formula for Creating Custom Advocacy Reports with MapsFormula for Creating Custom Advocacy Reports with Maps
Formula for Creating Custom Advocacy Reports with Maps
 
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWO
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWOGenerating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWO
Generating Quality Hypotheses For Higher Uplifts | Masters of Conversion by VWO
 
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
Global Scale in Research (Nancy Douyon at Enterprise UX 2018)
 
The State of RunSignup and the State of the Industry
The State of RunSignup and the State of the IndustryThe State of RunSignup and the State of the Industry
The State of RunSignup and the State of the Industry
 

Kürzlich hochgeladen

Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6DianaGray10
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxUdaiappa Ramachandran
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.YounusS2
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024D Cloud Solutions
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDELiveplex
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsSeth Reyes
 
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfMachine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfAijun Zhang
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-pyJamie (Taka) Wang
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Adtran
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdfPedro Manuel
 

Kürzlich hochgeladen (20)

Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and Hazards
 
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfMachine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdf
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-py
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
 

Data Literacy Training - case of CA Election 70