SlideShare ist ein Scribd-Unternehmen logo
1 von 12
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015
Data sources
SQL_anonymized_forum.sql
SQL_anonymized_general.sql
SQL_hash_mapping.sql
standard extract.csvSQL_unanonymizable.sql.gz
clickstream_export.gz
Personal data
Demographic data
…
Data sources
Weekly data packages
auth_user-{site}-analytics.sql
auth_userprofile-{site}-analytics.sql
certificates_generatedcertificate-{site}-analytics.sql
Daily data packages
course_structure-{site}-analytics.json
courseware_studentmodule-{site}-analytics.sql
email_opt_in-{site}-analytics.csv
student_courseenrollment-{site}-analytics.sql
user_api_usercoursetag-{site}-analytics.sql
user_id_map-{site}-analytics.sql
{org}-{course}-{date}-{site}.mongo
wiki_article-{site}-analytics.sql
wiki_articlerevision-{site}-analytics.sql
{org}-{site}-events-{date}.log.gz.gpg
Challenges
• Analytics across several courses
• Analytics across different platforms
• Analytics across different institutions
• Sharing data
Solution?
• Collaborative data science platform
– Standardize data storage
– Generalizable across courses and data providers
(currently OpenEdX, edX and Coursera)
– “Data being shared without data being
exchanged”
– Sharing and reproducing the results
MOOC data science commons
MOOCdb
Observing mode
- Observed Events table
- Resources table
- Resources Types table
- URLs table
- Resource URLs table
Submitting mode
- Problem Types table
- Problems table
- Submissions table
- Assessments table
Collaborating mode
- Collaborations table
- Collaboration Types table
Feedback mode
- Feedbacks table
- Questions table
- Answers table
- Surveys table
User information
- User PII table
- Global User table
- Course User table
http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
Collaborative platform and
applications
edX Coursera MOOCdb doc Github repo
Feature factory LabelMe Digital learner quantified Problem analytics
My MOOCViz Social network analysis Forum analysis Dropout prediction
Current state
• Established network of institutions
– MIT, Stanford, University of Michigan, University
of Edinburgh, University of Queensland, University
of Texas (Austin)
• Release of open source software
• Development and release of the first data
analytics framework
Next steps
Digital Learner Quantified
Discussion forum analysis
LabelMe
Problem analytics
Dropout prediction
Social network analysis
Collaboration
• If you are interested in…
– Development
– Feature modeling
– Translating your data
– Testing
kalyan@csail.mit.edu
s.Joksimovic@ed.ac.uk
Q&A
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015

Weitere ähnliche Inhalte

Ähnlich wie MOOCdb: Developing Data Standards for MOOCs

Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
Charles Severance
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
Vamsi Manoj
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
Chengjen Lee
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
Vikram Parmar
 

Ähnlich wie MOOCdb: Developing Data Standards for MOOCs (20)

Step by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSEStep by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSE
 
SCORE REPORT
SCORE REPORTSCORE REPORT
SCORE REPORT
 
Silverlight & WCF RIA
Silverlight & WCF RIASilverlight & WCF RIA
Silverlight & WCF RIA
 
Jisc learning analytics service core slides
Jisc learning analytics service core slidesJisc learning analytics service core slides
Jisc learning analytics service core slides
 
Data Science in the Elastic Stack
Data Science in the Elastic StackData Science in the Elastic Stack
Data Science in the Elastic Stack
 
Exploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics serviceExploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics service
 
Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdf
 
Library Management Python, MySQL
Library Management Python, MySQLLibrary Management Python, MySQL
Library Management Python, MySQL
 
Resume_Susmita
Resume_SusmitaResume_Susmita
Resume_Susmita
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
Models in Minutes using AutoML
Models in Minutes using AutoMLModels in Minutes using AutoML
Models in Minutes using AutoML
 
Online examination documentation
Online examination documentationOnline examination documentation
Online examination documentation
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptx
 
20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
 
Resume
ResumeResume
Resume
 

Kürzlich hochgeladen

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 

Kürzlich hochgeladen (20)

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 

MOOCdb: Developing Data Standards for MOOCs

  • 1. MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015
  • 3. Data sources Weekly data packages auth_user-{site}-analytics.sql auth_userprofile-{site}-analytics.sql certificates_generatedcertificate-{site}-analytics.sql Daily data packages course_structure-{site}-analytics.json courseware_studentmodule-{site}-analytics.sql email_opt_in-{site}-analytics.csv student_courseenrollment-{site}-analytics.sql user_api_usercoursetag-{site}-analytics.sql user_id_map-{site}-analytics.sql {org}-{course}-{date}-{site}.mongo wiki_article-{site}-analytics.sql wiki_articlerevision-{site}-analytics.sql {org}-{site}-events-{date}.log.gz.gpg
  • 4. Challenges • Analytics across several courses • Analytics across different platforms • Analytics across different institutions • Sharing data
  • 5. Solution? • Collaborative data science platform – Standardize data storage – Generalizable across courses and data providers (currently OpenEdX, edX and Coursera) – “Data being shared without data being exchanged” – Sharing and reproducing the results
  • 7. MOOCdb Observing mode - Observed Events table - Resources table - Resources Types table - URLs table - Resource URLs table Submitting mode - Problem Types table - Problems table - Submissions table - Assessments table Collaborating mode - Collaborations table - Collaboration Types table Feedback mode - Feedbacks table - Questions table - Answers table - Surveys table User information - User PII table - Global User table - Course User table http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
  • 8. Collaborative platform and applications edX Coursera MOOCdb doc Github repo Feature factory LabelMe Digital learner quantified Problem analytics My MOOCViz Social network analysis Forum analysis Dropout prediction
  • 9. Current state • Established network of institutions – MIT, Stanford, University of Michigan, University of Edinburgh, University of Queensland, University of Texas (Austin) • Release of open source software • Development and release of the first data analytics framework
  • 10. Next steps Digital Learner Quantified Discussion forum analysis LabelMe Problem analytics Dropout prediction Social network analysis
  • 11. Collaboration • If you are interested in… – Development – Feature modeling – Translating your data – Testing kalyan@csail.mit.edu s.Joksimovic@ed.ac.uk
  • 12. Q&A MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015

Hinweis der Redaktion

  1. MOOCdb which is our solution to centralizing and generalizing MOOC data organization and providing general purpose analytics for MOOC education research.
  2. “How does amount of time spent on the videos during a certain week correlate to performance on the homework?”
  3. CAN WE HAVE STANDARDIZED DATA STORAGE? Sharing and reproducing the results: When they publish research, analysts can share the scripts by depositing them into a public archive where they are retrievable and cross-referenced to their donor and publication.
  4. The MOOCdb project aims to brings together educational researchers, computer science researchers, machine learning researchers, technologists, database and big data experts to advance MOOC data science. The project founded at MIT includes a platform agnostic functional data model for data exhaust from MOOCs, a collaborative-open source-open access data visualization framework, a crowd sourced knowledge discovery framework and a privacy preserving software framework. The team is currently working to release a number of these tools and frameworks as open source. WHAT MOOCdb PROVIDES? Concise data storage: MOOCdb's proposed schema is \loss-less" with respect to research relevant information, i.e. no information is lost in translating raw data to it. Access Control Levels for Anonymized Data: The data schema offers an organized means of structuring anonymized user identities safeguard them further.
  5. Sharing of data extraction scripts: Scripts for data extraction and descriptive statistics extraction can be open source and shared by everyone because they reference data organized according to the schema. Crowd source potential: Machine learning frequently involves humans identifying explanatory variables that could drive a response. Enabling the crowd to help propose variables could greatly scale the com- munity's progress in mining MOOC data.