SlideShare ist ein Scribd-Unternehmen logo
1 von 12
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015
Data sources
SQL_anonymized_forum.sql
SQL_anonymized_general.sql
SQL_hash_mapping.sql
standard extract.csvSQL_unanonymizable.sql.gz
clickstream_export.gz
Personal data
Demographic data
…
Data sources
Weekly data packages
auth_user-{site}-analytics.sql
auth_userprofile-{site}-analytics.sql
certificates_generatedcertificate-{site}-analytics.sql
Daily data packages
course_structure-{site}-analytics.json
courseware_studentmodule-{site}-analytics.sql
email_opt_in-{site}-analytics.csv
student_courseenrollment-{site}-analytics.sql
user_api_usercoursetag-{site}-analytics.sql
user_id_map-{site}-analytics.sql
{org}-{course}-{date}-{site}.mongo
wiki_article-{site}-analytics.sql
wiki_articlerevision-{site}-analytics.sql
{org}-{site}-events-{date}.log.gz.gpg
Challenges
• Analytics across several courses
• Analytics across different platforms
• Analytics across different institutions
• Sharing data
Solution?
• Collaborative data science platform
– Standardize data storage
– Generalizable across courses and data providers
(currently OpenEdX, edX and Coursera)
– “Data being shared without data being
exchanged”
– Sharing and reproducing the results
MOOC data science commons
MOOCdb
Observing mode
- Observed Events table
- Resources table
- Resources Types table
- URLs table
- Resource URLs table
Submitting mode
- Problem Types table
- Problems table
- Submissions table
- Assessments table
Collaborating mode
- Collaborations table
- Collaboration Types table
Feedback mode
- Feedbacks table
- Questions table
- Answers table
- Surveys table
User information
- User PII table
- Global User table
- Course User table
http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
Collaborative platform and
applications
edX Coursera MOOCdb doc Github repo
Feature factory LabelMe Digital learner quantified Problem analytics
My MOOCViz Social network analysis Forum analysis Dropout prediction
Current state
• Established network of institutions
– MIT, Stanford, University of Michigan, University
of Edinburgh, University of Queensland, University
of Texas (Austin)
• Release of open source software
• Development and release of the first data
analytics framework
Next steps
Digital Learner Quantified
Discussion forum analysis
LabelMe
Problem analytics
Dropout prediction
Social network analysis
Collaboration
• If you are interested in…
– Development
– Feature modeling
– Translating your data
– Testing
kalyan@csail.mit.edu
s.Joksimovic@ed.ac.uk
Q&A
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015

Weitere ähnliche Inhalte

Ähnlich wie MOOCdb: Developing Data Standards for MOOCs

Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
Charles Severance
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
Vamsi Manoj
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
Chengjen Lee
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
Vikram Parmar
 

Ähnlich wie MOOCdb: Developing Data Standards for MOOCs (20)

Step by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSEStep by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSE
 
SCORE REPORT
SCORE REPORTSCORE REPORT
SCORE REPORT
 
Silverlight & WCF RIA
Silverlight & WCF RIASilverlight & WCF RIA
Silverlight & WCF RIA
 
Jisc learning analytics service core slides
Jisc learning analytics service core slidesJisc learning analytics service core slides
Jisc learning analytics service core slides
 
Data Science in the Elastic Stack
Data Science in the Elastic StackData Science in the Elastic Stack
Data Science in the Elastic Stack
 
Exploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics serviceExploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics service
 
Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdf
 
Library Management Python, MySQL
Library Management Python, MySQLLibrary Management Python, MySQL
Library Management Python, MySQL
 
Resume_Susmita
Resume_SusmitaResume_Susmita
Resume_Susmita
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
Models in Minutes using AutoML
Models in Minutes using AutoMLModels in Minutes using AutoML
Models in Minutes using AutoML
 
Online examination documentation
Online examination documentationOnline examination documentation
Online examination documentation
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptx
 
20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
 
Resume
ResumeResume
Resume
 

Kürzlich hochgeladen

Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
GOWTHAMIM22
 

Kürzlich hochgeladen (20)

Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
Lubrication System in forced feed system
Lubrication System in forced feed systemLubrication System in forced feed system
Lubrication System in forced feed system
 
GBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) EnzymologyGBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) Enzymology
 
MSC IV_Forensic medicine -sexual offence.pdf
MSC IV_Forensic medicine -sexual offence.pdfMSC IV_Forensic medicine -sexual offence.pdf
MSC IV_Forensic medicine -sexual offence.pdf
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
family therapy psychotherapy types .pdf
family therapy psychotherapy types  .pdffamily therapy psychotherapy types  .pdf
family therapy psychotherapy types .pdf
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
VILLAGE ATTACHMENT For rural agriculture  PPT.pptxVILLAGE ATTACHMENT For rural agriculture  PPT.pptx
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
 
The Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdfThe Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdf
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
 
Mining Activity and Investment Opportunity in Myanmar.pptx
Mining Activity and Investment Opportunity in Myanmar.pptxMining Activity and Investment Opportunity in Myanmar.pptx
Mining Activity and Investment Opportunity in Myanmar.pptx
 
Abortion uae unmarried price +27791653574 Contact Us Dubai Abu Dhabi Sharjah ...
Abortion uae unmarried price +27791653574 Contact Us Dubai Abu Dhabi Sharjah ...Abortion uae unmarried price +27791653574 Contact Us Dubai Abu Dhabi Sharjah ...
Abortion uae unmarried price +27791653574 Contact Us Dubai Abu Dhabi Sharjah ...
 
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
 
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdfMODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
 
Factor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary GlandFactor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary Gland
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 

MOOCdb: Developing Data Standards for MOOCs

  • 1. MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015
  • 3. Data sources Weekly data packages auth_user-{site}-analytics.sql auth_userprofile-{site}-analytics.sql certificates_generatedcertificate-{site}-analytics.sql Daily data packages course_structure-{site}-analytics.json courseware_studentmodule-{site}-analytics.sql email_opt_in-{site}-analytics.csv student_courseenrollment-{site}-analytics.sql user_api_usercoursetag-{site}-analytics.sql user_id_map-{site}-analytics.sql {org}-{course}-{date}-{site}.mongo wiki_article-{site}-analytics.sql wiki_articlerevision-{site}-analytics.sql {org}-{site}-events-{date}.log.gz.gpg
  • 4. Challenges • Analytics across several courses • Analytics across different platforms • Analytics across different institutions • Sharing data
  • 5. Solution? • Collaborative data science platform – Standardize data storage – Generalizable across courses and data providers (currently OpenEdX, edX and Coursera) – “Data being shared without data being exchanged” – Sharing and reproducing the results
  • 7. MOOCdb Observing mode - Observed Events table - Resources table - Resources Types table - URLs table - Resource URLs table Submitting mode - Problem Types table - Problems table - Submissions table - Assessments table Collaborating mode - Collaborations table - Collaboration Types table Feedback mode - Feedbacks table - Questions table - Answers table - Surveys table User information - User PII table - Global User table - Course User table http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
  • 8. Collaborative platform and applications edX Coursera MOOCdb doc Github repo Feature factory LabelMe Digital learner quantified Problem analytics My MOOCViz Social network analysis Forum analysis Dropout prediction
  • 9. Current state • Established network of institutions – MIT, Stanford, University of Michigan, University of Edinburgh, University of Queensland, University of Texas (Austin) • Release of open source software • Development and release of the first data analytics framework
  • 10. Next steps Digital Learner Quantified Discussion forum analysis LabelMe Problem analytics Dropout prediction Social network analysis
  • 11. Collaboration • If you are interested in… – Development – Feature modeling – Translating your data – Testing kalyan@csail.mit.edu s.Joksimovic@ed.ac.uk
  • 12. Q&A MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015

Hinweis der Redaktion

  1. MOOCdb which is our solution to centralizing and generalizing MOOC data organization and providing general purpose analytics for MOOC education research.
  2. “How does amount of time spent on the videos during a certain week correlate to performance on the homework?”
  3. CAN WE HAVE STANDARDIZED DATA STORAGE? Sharing and reproducing the results: When they publish research, analysts can share the scripts by depositing them into a public archive where they are retrievable and cross-referenced to their donor and publication.
  4. The MOOCdb project aims to brings together educational researchers, computer science researchers, machine learning researchers, technologists, database and big data experts to advance MOOC data science. The project founded at MIT includes a platform agnostic functional data model for data exhaust from MOOCs, a collaborative-open source-open access data visualization framework, a crowd sourced knowledge discovery framework and a privacy preserving software framework. The team is currently working to release a number of these tools and frameworks as open source. WHAT MOOCdb PROVIDES? Concise data storage: MOOCdb's proposed schema is \loss-less" with respect to research relevant information, i.e. no information is lost in translating raw data to it. Access Control Levels for Anonymized Data: The data schema offers an organized means of structuring anonymized user identities safeguard them further.
  5. Sharing of data extraction scripts: Scripts for data extraction and descriptive statistics extraction can be open source and shared by everyone because they reference data organized according to the schema. Crowd source potential: Machine learning frequently involves humans identifying explanatory variables that could drive a response. Enabling the crowd to help propose variables could greatly scale the com- munity's progress in mining MOOC data.