SlideShare ist ein Scribd-Unternehmen logo
1 von 44
Visualising Activity Data Tony Hirst Dept of Communication and Systems, The Open University Scattered puzzle pieces next to solved fragment by HoriaVarlan
Today’s link shortener is bit.ly Read:		 [ jlKwGq ] as:		 http://bit.ly/jlKwGq
Visual Analysisvs.Presentation Graphics
This is NOT a presentation about: ,[object Object]
 data preparation
 data cleansing,[object Object]
ScraperWiki [ aGhJtK ]
Search and replace… …add regular expressions and you have search and replace “on steroids”
Google Refine [ aq1jUE ] Example: walkthrough (@jenit) [ awGQPT ] Example: merging two tables by column [ pWK3C0 ]
DataWrangler [ gmE3yz ]
Data has shape and structure
Hierarchical Data
Many Eyes [ qY5786 ] Treemaps
plot srcfile using ($1):(column(focusCar) -$2) with lines title "VET", srcfileusing ($1):(column(focusCar) -$3) with lines title "WEB", srcfileusing ($1):(column(focusCar) -$4) with lines title "HAM", srcfileusing ($1):(column(focusCar) -$5) with lines title "BUT", srcfileusing ($1):(column(focusCar) -$6) with lines title "ALO", srcfileusing ($1):(column(focusCar) -$7) with lines title "MAS", srcfileusing ($1):(column(focusCar) -$8) with lines title "SCH", srcfileusing ($1):(column(focusCar) -$9) with lines title "ROS", …
Or heatmaps in R: [ qXmPgs ]
Text processing with Unix tools[ m5tz63 ] [ lOVySX ] Count number of lines in a file: wc-l L2sample.csv View first few lines in a file: head L2sample.csv or head -n 4 L2sample.csv  View last few lines in a file: tail L2sample.csv or tail -n 15 L2sample.csv Sample contiguous rows from start or end of file: head -n 1 L2sample.csv > headers.csv 	tail -n 20 L2sample.csv > subSample.csv 	cat headers.csvsubSample.csv > subSampleWithHeaders.csv Sample contiguous rows from middle of file: head -n 15 L2sample.csv | tail -n 6 > middleSample.csv Split large file into smaller files: split -l 15 L2sample.csv subSamples Search for lines containing a term: grepmendeley L2sample.csv grepEBSCO L2sample.csv > rowsContainingEBSCO.csv
More text processing tricks Extract columns: cut -f 3 L2sample.csv 	cut -f 1,2,14,17 L2sample.csv > columnSample.csv Sort data in a column: 	cut -f 40 L2sample.csv | sort Identify distinct entries in a column: 	cut -f 40 L2sample.csv | sort | uniq Count how many times each distinct term appears in a column: 	cut -f 40 L2sample.csv | sort | uniq –c Sort can also sort by column (-k), reverse order (-r): cut -f 40 L2_2011-04.csv | sort | uniq -c | sort -k 1 -r > uniqueSID.csv
[ dAdIo3 ]
Time series data
aka “seasonal subseries” [ j3HODr ]
matplotlib Trends [ qSIcrV ] #time series data in d #first difference fd=np.diff(d) Autocorrelation
Graphs and Networks
Graphviz digraph test { CSV [shape=box] KML [shape=box] JSON [shape=box] XML [shape=box] RDF [shape=box] HTML [shape=box] GoogleSpreadsheet[shape=Msquare] RDFTripleStore [shape=Msquare] "[SPARQL]" [shape=diamond] "[YQL]" [shape=diamond] "[GoogleVizDataAPI]" [shape=diamond] "<GoogleGadgets>" [shape=doubleoctagon] "<GoogleVizDataCharts>" [shape=doubleoctagon] "<GoogleMaps>" [shape=doubleoctagon] "<GoogleEarth>" [shape=doubleoctagon] "<JQueryCharts_etc>" [shape=doubleoctagon] "[SPARQL]"->RDF; "[SPARQL]"->XML; "[SPARQL]"->CSV; "[SPARQL]"->JSON; JSON-> "<JQueryCharts_etc>"; CSV->"{GoogleRefine}" CSV->ScraperWiki JSON->ScraperWiki "[YQL]"->ScraperWiki ScraperWiki->CSV HTML->ScraperWiki HTML->"[YQL]" "[SPARQL]"->"[YQL]" "{GoogleRefine}"->CSV [style=dashed] CSV->"<Gephi>" [style=dashed] "<Gephi>"->CSV [style=dashed] RDF->"[YQL]” }
Gephi
[ nKoB4b]
[ nKoB4b]
Statistical Graphs
R
Graphics Libraries
Protovis
Processing

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (11)

The Very ^ 2 Basics of R
The Very ^ 2 Basics of RThe Very ^ 2 Basics of R
The Very ^ 2 Basics of R
 
Inside database
Inside databaseInside database
Inside database
 
Do something in 5 with gas 3-simple invoicing app
Do something in 5 with gas 3-simple invoicing appDo something in 5 with gas 3-simple invoicing app
Do something in 5 with gas 3-simple invoicing app
 
Gratest
GratestGratest
Gratest
 
Introduction to data.table in R
Introduction to data.table in RIntroduction to data.table in R
Introduction to data.table in R
 
Data visualization in python/Django
Data visualization in python/DjangoData visualization in python/Django
Data visualization in python/Django
 
Sql can be cool again
Sql can be cool againSql can be cool again
Sql can be cool again
 
Visdjango presentation django_boston_oct_2014
Visdjango presentation django_boston_oct_2014Visdjango presentation django_boston_oct_2014
Visdjango presentation django_boston_oct_2014
 
Hacking the Internet of Things for Fun & Profit
Hacking the Internet of Things for Fun & ProfitHacking the Internet of Things for Fun & Profit
Hacking the Internet of Things for Fun & Profit
 
Data warehouse or conventional database: Which is right for you?
Data warehouse or conventional database: Which is right for you?Data warehouse or conventional database: Which is right for you?
Data warehouse or conventional database: Which is right for you?
 
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
 

Ähnlich wie Jiscad viz

PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12
Andrew Dunstan
 
SQL Server - Introduction to TSQL
SQL Server - Introduction to TSQLSQL Server - Introduction to TSQL
SQL Server - Introduction to TSQL
Peter Gfader
 

Ähnlich wie Jiscad viz (20)

Hands on Mahout!
Hands on Mahout!Hands on Mahout!
Hands on Mahout!
 
Meetup cassandra for_java_cql
Meetup cassandra for_java_cqlMeetup cassandra for_java_cql
Meetup cassandra for_java_cql
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfolio
 
The best ETL questions in a nut shell
The best ETL questions in a nut shellThe best ETL questions in a nut shell
The best ETL questions in a nut shell
 
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
 
PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12
 
ICDM2019 table tutorial
ICDM2019 table tutorialICDM2019 table tutorial
ICDM2019 table tutorial
 
Chapter15
Chapter15Chapter15
Chapter15
 
ElasticSearch.pptx
ElasticSearch.pptxElasticSearch.pptx
ElasticSearch.pptx
 
MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011
 
Practical catalyst
Practical catalystPractical catalyst
Practical catalyst
 
SQL Server - Introduction to TSQL
SQL Server - Introduction to TSQLSQL Server - Introduction to TSQL
SQL Server - Introduction to TSQL
 
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table packageJanuary 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
 
MIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome MeasuresMIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome Measures
 
Lobos Introduction
Lobos IntroductionLobos Introduction
Lobos Introduction
 
Simplifying SQL with CTE's and windowing functions
Simplifying SQL with CTE's and windowing functionsSimplifying SQL with CTE's and windowing functions
Simplifying SQL with CTE's and windowing functions
 
Google cloud Dataflow & Apache Flink
Google cloud Dataflow & Apache FlinkGoogle cloud Dataflow & Apache Flink
Google cloud Dataflow & Apache Flink
 
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptxfINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
 
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website DevelopmentHow We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
 
Bo4301369372
Bo4301369372Bo4301369372
Bo4301369372
 

Mehr von Tony Hirst

Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Tony Hirst
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
Tony Hirst
 

Mehr von Tony Hirst (20)

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiesta
 
Dev8d jupyter
Dev8d jupyterDev8d jupyter
Dev8d jupyter
 
Ili 16 robot
Ili 16 robotIli 16 robot
Ili 16 robot
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptx
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptx
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacks
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyter
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 Workshop
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wire
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interest
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKX
 
Week4
Week4Week4
Week4
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefine
 
Conversations with data
Conversations with dataConversations with data
Conversations with data
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingo
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
 

Kürzlich hochgeladen

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Kürzlich hochgeladen (20)

How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 

Jiscad viz

  • 1. Visualising Activity Data Tony Hirst Dept of Communication and Systems, The Open University Scattered puzzle pieces next to solved fragment by HoriaVarlan
  • 2. Today’s link shortener is bit.ly Read: [ jlKwGq ] as: http://bit.ly/jlKwGq
  • 4.
  • 5.
  • 7.
  • 9. Search and replace… …add regular expressions and you have search and replace “on steroids”
  • 10. Google Refine [ aq1jUE ] Example: walkthrough (@jenit) [ awGQPT ] Example: merging two tables by column [ pWK3C0 ]
  • 12. Data has shape and structure
  • 14. Many Eyes [ qY5786 ] Treemaps
  • 15.
  • 16.
  • 17.
  • 18.
  • 19. plot srcfile using ($1):(column(focusCar) -$2) with lines title "VET", srcfileusing ($1):(column(focusCar) -$3) with lines title "WEB", srcfileusing ($1):(column(focusCar) -$4) with lines title "HAM", srcfileusing ($1):(column(focusCar) -$5) with lines title "BUT", srcfileusing ($1):(column(focusCar) -$6) with lines title "ALO", srcfileusing ($1):(column(focusCar) -$7) with lines title "MAS", srcfileusing ($1):(column(focusCar) -$8) with lines title "SCH", srcfileusing ($1):(column(focusCar) -$9) with lines title "ROS", …
  • 20. Or heatmaps in R: [ qXmPgs ]
  • 21. Text processing with Unix tools[ m5tz63 ] [ lOVySX ] Count number of lines in a file: wc-l L2sample.csv View first few lines in a file: head L2sample.csv or head -n 4 L2sample.csv View last few lines in a file: tail L2sample.csv or tail -n 15 L2sample.csv Sample contiguous rows from start or end of file: head -n 1 L2sample.csv > headers.csv tail -n 20 L2sample.csv > subSample.csv cat headers.csvsubSample.csv > subSampleWithHeaders.csv Sample contiguous rows from middle of file: head -n 15 L2sample.csv | tail -n 6 > middleSample.csv Split large file into smaller files: split -l 15 L2sample.csv subSamples Search for lines containing a term: grepmendeley L2sample.csv grepEBSCO L2sample.csv > rowsContainingEBSCO.csv
  • 22. More text processing tricks Extract columns: cut -f 3 L2sample.csv cut -f 1,2,14,17 L2sample.csv > columnSample.csv Sort data in a column: cut -f 40 L2sample.csv | sort Identify distinct entries in a column: cut -f 40 L2sample.csv | sort | uniq Count how many times each distinct term appears in a column: cut -f 40 L2sample.csv | sort | uniq –c Sort can also sort by column (-k), reverse order (-r): cut -f 40 L2_2011-04.csv | sort | uniq -c | sort -k 1 -r > uniqueSID.csv
  • 24.
  • 27.
  • 28. matplotlib Trends [ qSIcrV ] #time series data in d #first difference fd=np.diff(d) Autocorrelation
  • 30. Graphviz digraph test { CSV [shape=box] KML [shape=box] JSON [shape=box] XML [shape=box] RDF [shape=box] HTML [shape=box] GoogleSpreadsheet[shape=Msquare] RDFTripleStore [shape=Msquare] "[SPARQL]" [shape=diamond] "[YQL]" [shape=diamond] "[GoogleVizDataAPI]" [shape=diamond] "<GoogleGadgets>" [shape=doubleoctagon] "<GoogleVizDataCharts>" [shape=doubleoctagon] "<GoogleMaps>" [shape=doubleoctagon] "<GoogleEarth>" [shape=doubleoctagon] "<JQueryCharts_etc>" [shape=doubleoctagon] "[SPARQL]"->RDF; "[SPARQL]"->XML; "[SPARQL]"->CSV; "[SPARQL]"->JSON; JSON-> "<JQueryCharts_etc>"; CSV->"{GoogleRefine}" CSV->ScraperWiki JSON->ScraperWiki "[YQL]"->ScraperWiki ScraperWiki->CSV HTML->ScraperWiki HTML->"[YQL]" "[SPARQL]"->"[YQL]" "{GoogleRefine}"->CSV [style=dashed] CSV->"<Gephi>" [style=dashed] "<Gephi>"->CSV [style=dashed] RDF->"[YQL]” }
  • 31. Gephi
  • 34.
  • 35.
  • 36.
  • 37.
  • 39. R
  • 40.
  • 44.
  • 45. I hope that’s beenouseful.info….?

Hinweis der Redaktion

  1. Change the basis… eg in OU, might consider different presentations (“years”) of the same course (“month”).