SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Preprocessing of Academic Data
for Mining Association Rule
Overview
Main Objective of Our Research
Concept of KDD
Methods of Preprocessing Academic Data for Mining
Data Analysis
Relational Database
Universal Database
Synthetic Data Population
Data Transformation
Interested Association Rules for Academic Data
Preprocessing of Academic Data for Mining Association Rule 2
Main Objective of Our Research
To get knowledge and find the correlation of several explicit & implicit
factors related to:
Students’ academic progress
Potentiality decay of students
Abandonment
Why do students drop out before Graduation ?
Retention
Why does students’ extended continuation prevail ?
Preprocessing of Academic Data for Mining Association Rule 3
Concept of KDD
Knowledge Discovery and
Data mining Process
Data
Target
Data
Preprocessed Data
Transformed Data
Patterns/ Models
Knowledge
Selection
Preprocessing
Transformation
Data mining
Interpretation
Evaluation
4
Why Preprocessing before Data
Mining ?
Reasons for proposing a preprocessing technique before
applying mining association rules in academic data :
Proper interpretation of the results of mining is essential
to ensure that useful knowledge is derived from the data.
Blind application of data-mining methods can be a
dangerous activity, easily leading to the discovery of
meaningless and invalid patterns.
Preprocessing of Academic Data for Mining Association Rule 5
Methods of Preprocessing Academic
Data for Mining
Data Analysis of BIIS
Database
Personal Information Academic Information
Age SSC or equivalent GPA, Board
Gender HSC or equivalent GPA, Board
Origin Area(Birth Place)
Admission Year / Batch
Present Address Department
Hall Resident/Attached
Current Level/Term
Current CGPA
Term wise CGPA
Subject wise detailed Grade
Credit Hour Completed
Preprocessing of Academic Data for Mining Association Rule 6
Methods of Preprocessing Academic Data for Mining
(Contd.)
Data Analysis (contd.)
Age
Origin Area
Record of Taken Courses
Experience of Teachers
Hall Resident/Attached
Term Duration
SSC & HSC
GPA/Board
Gender
CGPA
Factors related to Academic Performance of Student
Academic
Performance
Preprocessing of Academic Data for Mining Association Rule 7
Methods of Preprocessing Academic Data for Mining
(Contd.)
Data Analysis (contd.)
Age
Origin Area
Credit Hour Ratio
Session Jam
Hall Resident/Attached
Term Duration
SSC & HSC
GPA/Board
Gender
Current
CGPA
Abandonment/
Retention of student
Stay Duration
Factors related to Abandonment/Retention of student
Preprocessing of Academic Data for Mining Association Rule 8
Methods of Preprocessing Academic Data for Mining
(Contd.)
Data Analysis (contd.)
Factors related to Condition of Academic Institution
Rate of Student
Retention
Average CGPA of all Students
Experience of Teachers
Rate of Student
Abandonment
Research & Publications
Condition of
Academic Institution
Preprocessing of Academic Data for Mining Association Rule 9
Methods of Preprocessing Academic Data for Mining
(Contd.)
Relational Database
Student Course
Grade
Sheet
representsachieves
Finding Correlation between performance of different courses
Preprocessing of Academic Data for Mining Association Rule 10
Methods of Preprocessing Academic Data for Mining
(Contd.)
How we have populated data in universal database?
Let us consider a 3 credit course CSE 303
Now we assume 5 possible scenarios:
Universal Database &
Synthetic Data Population
Preprocessing of Academic Data for Mining Association Rule 11
A student appears class tests(CT) having attendance more than 60%,
appeared term final examinations.
A student appeared CT but attendance is less than 60% and appeared term
final examination.
Class test and attendance are carried over and appeared term final
examination.
A student appeared CT and attendance is more than 60% but not appeared
term final examination.
A student attended less than 60% of classes and did not appear both in CT
and term final examination.
Methods of Preprocessing Academic Data for Mining
(Contd.)
Two algorithm have been developed to populate the
universal table :
Synthetic_Generation ( )
Generate_Grade ()
Universal Database &
Synthetic Data Population (contd.)
Student_Id CSE303_secA CSE303_secB CSE303_CT CSE303_
Attendance
CSE303_
Total
CSE303_Grade
…0805001 90 75 55 30 250 A+
0805002 85 70 45 25 225 A
… … … … … … …
Records of all taken courses of corresponding student ID are generated synthetically in a single
row of the universal table.
Preprocessing of Academic Data for Mining Association Rule 12
Methods of Preprocessing Academic Data for Mining
(Contd.)
Data Transformation
Definition Credit Hour Range
SecA_high or SecB_high 3 >=75 && <=105
SecA_avg or SecB_avg 3 >=60 && <75
SecA_low or SecB_low 3 < 60
CT_high 3 >=48 && <=60
CT_average 3 >=36 && <=48
CT_low 3 < 36
Grade_high 3 >=225 && <=300
Grade_average 3 >=180 && < 225
Grade_low 3 < 180
Transformation rule table for 3.0 credit course
Student_
ID
SecA_
high
SecA
_average
SecA_
low
SecB
_high
SecB
_average
SecB_
low
CT_
high
CT
_average
CT
_low
Grade_
high
Grade_
average
Grade_
low
0805001 1 0 0 1 0 0 1 0 0 1 0 0
0805002 1 0 0 0 1 0 0 1 0 1 0 0
… … … … … … … … … … … … …
Transformed table from universal table
Preprocessing of Academic Data for Mining Association Rule 13
Association Rules for Academic Data
No. Interested Association Rule Purpose
1. Course_No => CGPA_high Performance of Individual Course
2. Course_No => CGPA_low
3. Sec_A_high => CGPA_high Impact of Section of Answer Script
4. Sec_B_high => CGPA_high
5. CT_high =>CGPA_high Impact of Class test
6. CT_low => CGPA_low
7. Hall_Resident =>CGPA_low Impact of Residence
8. Attached =>CGPA_high
9. Course_No_1=> Course_No_2 Correlation of
courses
10. (Course_No_1,Course_No_2) => Course_No_3
11. Permanent_Address_City =>CGPA_high
Impact of locality
12. Permanent_Address_Rural =>CGPA_low
Preprocessing of Academic Data for Mining Association Rule 14
Future Work
Academic Performance
Family Background
Previous Academic Record
Seat Allotment in Hall
Offering Scholarship
Abandonment/Retention
Stay Duration
Session Jam
Unwanted leaves
Long term break
Condition of Institution
Average CGPA of all students
Term completion rate
Abandonment/retention rate
Research & Publications
Developing new mining algorithm which will be tested
using the synthetic dataset
Collecting real data from BIIS and using without disclosing
privacy to discover the Knowledge
Preprocessing of Academic Data for Mining Association Rule 15
Conclusions
Applies association rule mining algorithms to transform continuous
valued attribute into resemble the required educational knowledge
Guides to discover the required knowledge using the realistic
dataset and apply them in real life scenario
Developing Model using BIIS data but can be generalized
for application to any higher educational institution
Preprocessing of Academic Data for Mining Association Rule 16
Any Question or Suggestion is
Welcome
Preprocessing of Academic Data for Mining Association Rule 17
Contact :
asmlatifulhoque@cse.buet.ac.bd,
raaz.cse08@gmail.com,
shibbirahmedtanvin@gmail.com

Weitere ähnliche Inhalte

Ähnlich wie Presentation Slide_Preprocessing of Academic Data for Mining Association Rule [WADM 2013]

Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...
Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...
Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...tanvin
 
University Recommendation Support System using ML Algorithms
University Recommendation Support System using ML AlgorithmsUniversity Recommendation Support System using ML Algorithms
University Recommendation Support System using ML AlgorithmsIRJET Journal
 
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...IRJET Journal
 
Principles_of_distributed_database_syste.pdf
Principles_of_distributed_database_syste.pdfPrinciples_of_distributed_database_syste.pdf
Principles_of_distributed_database_syste.pdfbonbon93
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for StudentsIRJET Journal
 
IRJET- Using Data Mining to Predict Students Performance
IRJET-  	  Using Data Mining to Predict Students PerformanceIRJET-  	  Using Data Mining to Predict Students Performance
IRJET- Using Data Mining to Predict Students PerformanceIRJET Journal
 
IRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET Journal
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...Editor IJCATR
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...Editor IJCATR
 
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...IRJET Journal
 
Paper planes short ver linkedin
Paper planes  short ver   linkedinPaper planes  short ver   linkedin
Paper planes short ver linkedinHimanshu Agarwal
 
placement management system.pptx
placement management system.pptxplacement management system.pptx
placement management system.pptxPriyansuPradhan2
 
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessUsing ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessExamSoft
 
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...IRJET Journal
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction SystemIRJET Journal
 
Educational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept PlanEducational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept PlanIRJET Journal
 
Data mining to predict academic performance.
Data mining to predict academic performance. Data mining to predict academic performance.
Data mining to predict academic performance. Ranjith Gowda
 

Ähnlich wie Presentation Slide_Preprocessing of Academic Data for Mining Association Rule [WADM 2013] (20)

Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...
Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...
Discovering Knowledge regarding Academic Profile of Students Pursuing Graduat...
 
University Recommendation Support System using ML Algorithms
University Recommendation Support System using ML AlgorithmsUniversity Recommendation Support System using ML Algorithms
University Recommendation Support System using ML Algorithms
 
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...
A WEB BASED APPLICATION FOR TUTORING SUPPORT IN HIGHER EDUCATION USING EDUCAT...
 
Principles_of_distributed_database_syste.pdf
Principles_of_distributed_database_syste.pdfPrinciples_of_distributed_database_syste.pdf
Principles_of_distributed_database_syste.pdf
 
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATIONRESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for Students
 
IRJET- Using Data Mining to Predict Students Performance
IRJET-  	  Using Data Mining to Predict Students PerformanceIRJET-  	  Using Data Mining to Predict Students Performance
IRJET- Using Data Mining to Predict Students Performance
 
IRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning Techniques
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
 
Paper planes short ver linkedin
Paper planes  short ver   linkedinPaper planes  short ver   linkedin
Paper planes short ver linkedin
 
placement management system.pptx
placement management system.pptxplacement management system.pptx
placement management system.pptx
 
Student information system
Student information systemStudent information system
Student information system
 
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessUsing ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
 
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...
UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
 
Educational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept PlanEducational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept Plan
 
Paper Presentation
Paper PresentationPaper Presentation
Paper Presentation
 
Data mining to predict academic performance.
Data mining to predict academic performance. Data mining to predict academic performance.
Data mining to predict academic performance.
 

Kürzlich hochgeladen

Internship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SEInternship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SESaleh Ibne Omar
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRRsarwankumar4524
 
A Guide to Choosing the Ideal Air Cooler
A Guide to Choosing the Ideal Air CoolerA Guide to Choosing the Ideal Air Cooler
A Guide to Choosing the Ideal Air Coolerenquirieskenstar
 
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptxerickamwana1
 
GESCO SE Press and Analyst Conference on Financial Results 2024
GESCO SE Press and Analyst Conference on Financial Results 2024GESCO SE Press and Analyst Conference on Financial Results 2024
GESCO SE Press and Analyst Conference on Financial Results 2024GESCO SE
 
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxEngaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxAsifArshad8
 
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...Sebastiano Panichella
 
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunity
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunityDon't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunity
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunityApp Ethena
 
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...Sebastiano Panichella
 
cse-csp batch4 review-1.1.pptx cyber security
cse-csp batch4 review-1.1.pptx cyber securitycse-csp batch4 review-1.1.pptx cyber security
cse-csp batch4 review-1.1.pptx cyber securitysandeepnani2260
 
proposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerproposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerkumenegertelayegrama
 
Application of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxApplication of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxRoquia Salam
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEMCharmi13
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptxogubuikealex
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...漢銘 謝
 
General Elections Final Press Noteas per M
General Elections Final Press Noteas per MGeneral Elections Final Press Noteas per M
General Elections Final Press Noteas per MVidyaAdsule1
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRachelAnnTenibroAmaz
 

Kürzlich hochgeladen (17)

Internship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SEInternship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SE
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
 
A Guide to Choosing the Ideal Air Cooler
A Guide to Choosing the Ideal Air CoolerA Guide to Choosing the Ideal Air Cooler
A Guide to Choosing the Ideal Air Cooler
 
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx
05.02 MMC - Assignment 4 - Image Attribution Lovepreet.pptx
 
GESCO SE Press and Analyst Conference on Financial Results 2024
GESCO SE Press and Analyst Conference on Financial Results 2024GESCO SE Press and Analyst Conference on Financial Results 2024
GESCO SE Press and Analyst Conference on Financial Results 2024
 
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxEngaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
 
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
 
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunity
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunityDon't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunity
Don't Miss Out: Strategies for Making the Most of the Ethena DigitalOpportunity
 
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...
 
cse-csp batch4 review-1.1.pptx cyber security
cse-csp batch4 review-1.1.pptx cyber securitycse-csp batch4 review-1.1.pptx cyber security
cse-csp batch4 review-1.1.pptx cyber security
 
proposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerproposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeeger
 
Application of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxApplication of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptx
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEM
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptx
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
 
General Elections Final Press Noteas per M
General Elections Final Press Noteas per MGeneral Elections Final Press Noteas per M
General Elections Final Press Noteas per M
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
 

Presentation Slide_Preprocessing of Academic Data for Mining Association Rule [WADM 2013]

  • 1. Preprocessing of Academic Data for Mining Association Rule
  • 2. Overview Main Objective of Our Research Concept of KDD Methods of Preprocessing Academic Data for Mining Data Analysis Relational Database Universal Database Synthetic Data Population Data Transformation Interested Association Rules for Academic Data Preprocessing of Academic Data for Mining Association Rule 2
  • 3. Main Objective of Our Research To get knowledge and find the correlation of several explicit & implicit factors related to: Students’ academic progress Potentiality decay of students Abandonment Why do students drop out before Graduation ? Retention Why does students’ extended continuation prevail ? Preprocessing of Academic Data for Mining Association Rule 3
  • 4. Concept of KDD Knowledge Discovery and Data mining Process Data Target Data Preprocessed Data Transformed Data Patterns/ Models Knowledge Selection Preprocessing Transformation Data mining Interpretation Evaluation 4
  • 5. Why Preprocessing before Data Mining ? Reasons for proposing a preprocessing technique before applying mining association rules in academic data : Proper interpretation of the results of mining is essential to ensure that useful knowledge is derived from the data. Blind application of data-mining methods can be a dangerous activity, easily leading to the discovery of meaningless and invalid patterns. Preprocessing of Academic Data for Mining Association Rule 5
  • 6. Methods of Preprocessing Academic Data for Mining Data Analysis of BIIS Database Personal Information Academic Information Age SSC or equivalent GPA, Board Gender HSC or equivalent GPA, Board Origin Area(Birth Place) Admission Year / Batch Present Address Department Hall Resident/Attached Current Level/Term Current CGPA Term wise CGPA Subject wise detailed Grade Credit Hour Completed Preprocessing of Academic Data for Mining Association Rule 6
  • 7. Methods of Preprocessing Academic Data for Mining (Contd.) Data Analysis (contd.) Age Origin Area Record of Taken Courses Experience of Teachers Hall Resident/Attached Term Duration SSC & HSC GPA/Board Gender CGPA Factors related to Academic Performance of Student Academic Performance Preprocessing of Academic Data for Mining Association Rule 7
  • 8. Methods of Preprocessing Academic Data for Mining (Contd.) Data Analysis (contd.) Age Origin Area Credit Hour Ratio Session Jam Hall Resident/Attached Term Duration SSC & HSC GPA/Board Gender Current CGPA Abandonment/ Retention of student Stay Duration Factors related to Abandonment/Retention of student Preprocessing of Academic Data for Mining Association Rule 8
  • 9. Methods of Preprocessing Academic Data for Mining (Contd.) Data Analysis (contd.) Factors related to Condition of Academic Institution Rate of Student Retention Average CGPA of all Students Experience of Teachers Rate of Student Abandonment Research & Publications Condition of Academic Institution Preprocessing of Academic Data for Mining Association Rule 9
  • 10. Methods of Preprocessing Academic Data for Mining (Contd.) Relational Database Student Course Grade Sheet representsachieves Finding Correlation between performance of different courses Preprocessing of Academic Data for Mining Association Rule 10
  • 11. Methods of Preprocessing Academic Data for Mining (Contd.) How we have populated data in universal database? Let us consider a 3 credit course CSE 303 Now we assume 5 possible scenarios: Universal Database & Synthetic Data Population Preprocessing of Academic Data for Mining Association Rule 11 A student appears class tests(CT) having attendance more than 60%, appeared term final examinations. A student appeared CT but attendance is less than 60% and appeared term final examination. Class test and attendance are carried over and appeared term final examination. A student appeared CT and attendance is more than 60% but not appeared term final examination. A student attended less than 60% of classes and did not appear both in CT and term final examination.
  • 12. Methods of Preprocessing Academic Data for Mining (Contd.) Two algorithm have been developed to populate the universal table : Synthetic_Generation ( ) Generate_Grade () Universal Database & Synthetic Data Population (contd.) Student_Id CSE303_secA CSE303_secB CSE303_CT CSE303_ Attendance CSE303_ Total CSE303_Grade …0805001 90 75 55 30 250 A+ 0805002 85 70 45 25 225 A … … … … … … … Records of all taken courses of corresponding student ID are generated synthetically in a single row of the universal table. Preprocessing of Academic Data for Mining Association Rule 12
  • 13. Methods of Preprocessing Academic Data for Mining (Contd.) Data Transformation Definition Credit Hour Range SecA_high or SecB_high 3 >=75 && <=105 SecA_avg or SecB_avg 3 >=60 && <75 SecA_low or SecB_low 3 < 60 CT_high 3 >=48 && <=60 CT_average 3 >=36 && <=48 CT_low 3 < 36 Grade_high 3 >=225 && <=300 Grade_average 3 >=180 && < 225 Grade_low 3 < 180 Transformation rule table for 3.0 credit course Student_ ID SecA_ high SecA _average SecA_ low SecB _high SecB _average SecB_ low CT_ high CT _average CT _low Grade_ high Grade_ average Grade_ low 0805001 1 0 0 1 0 0 1 0 0 1 0 0 0805002 1 0 0 0 1 0 0 1 0 1 0 0 … … … … … … … … … … … … … Transformed table from universal table Preprocessing of Academic Data for Mining Association Rule 13
  • 14. Association Rules for Academic Data No. Interested Association Rule Purpose 1. Course_No => CGPA_high Performance of Individual Course 2. Course_No => CGPA_low 3. Sec_A_high => CGPA_high Impact of Section of Answer Script 4. Sec_B_high => CGPA_high 5. CT_high =>CGPA_high Impact of Class test 6. CT_low => CGPA_low 7. Hall_Resident =>CGPA_low Impact of Residence 8. Attached =>CGPA_high 9. Course_No_1=> Course_No_2 Correlation of courses 10. (Course_No_1,Course_No_2) => Course_No_3 11. Permanent_Address_City =>CGPA_high Impact of locality 12. Permanent_Address_Rural =>CGPA_low Preprocessing of Academic Data for Mining Association Rule 14
  • 15. Future Work Academic Performance Family Background Previous Academic Record Seat Allotment in Hall Offering Scholarship Abandonment/Retention Stay Duration Session Jam Unwanted leaves Long term break Condition of Institution Average CGPA of all students Term completion rate Abandonment/retention rate Research & Publications Developing new mining algorithm which will be tested using the synthetic dataset Collecting real data from BIIS and using without disclosing privacy to discover the Knowledge Preprocessing of Academic Data for Mining Association Rule 15
  • 16. Conclusions Applies association rule mining algorithms to transform continuous valued attribute into resemble the required educational knowledge Guides to discover the required knowledge using the realistic dataset and apply them in real life scenario Developing Model using BIIS data but can be generalized for application to any higher educational institution Preprocessing of Academic Data for Mining Association Rule 16
  • 17. Any Question or Suggestion is Welcome Preprocessing of Academic Data for Mining Association Rule 17