SlideShare ist ein Scribd-Unternehmen logo
1 von 31
Students Academic
Performance
Knowledge Discovery from Data
Introduction..
 Our project aim is to find students academic performance
and find out whether there is any general pattern in their
marks and performance.
 So here ,We are analyzing both internal and external
marks of a student.
 We did the following KDD preprocessing steps to mine
our data.
Learning the application domain
 Learning the application domain is the first step in KDD
process .
 Need to have a clear understanding about the application
domain and our objectives.
 The institution considered for mining is MCA batch of Rajagiri
College of Social Sciences.
 We collected all previous year academic record from the
department of computer science
Create a target data set:
data selection
 We selected 2007-2010 batch marks for analysing the
pattern.
 There were around 45 records(45 students).
 Both the internal and external marks of each student were
selected, in order to find out the performance pattern.
Internal & External Dataset
Data cleaning & preprocessing
 Data cleaning is the step where noise and irrelevant data are
removed from the large data set.
 This is a very important pre-processing step because our
outcome would be dependent on the quality of selected data.
 Remove duplicate records, enter logically correct values for
missing records(absent students), remove unnecessary data
fields and standardize data format.
 There was no much duplicate data or unnecessary data in the
collected record . The dataset was partially cleaned.
 Student internal mark and external mark were stored in
different records.
 By applying data integration these records were integrated
into one record.
 The new dataset consist of internal mark details and external
mark details of each student in one record.
Data reduction & transformation
 Data is transformed into appropriate form for making it ready for
data mining step.
 The dataset contains marks of 5 theory paper and 2 lab paper of
all 5 semesters.
 These marks are transformed into sum of internal marks and sum
of external marks of each student for the easiness of analysing
the pattern.
Cluster Analysis
 The data mining technique we used here is clustering.
 A cluster is a collection of data objects that are similar to
one another within same cluster and are dissimilar to
objects in other cluster.
 We first partitioned the set of data into groups based on
data similarity and then assign labels
Choosing functions of data mining
K-MEANS Partitioning
 The K-means algorithm takes input parameter k and
partitions the set of n objects into k clusters.
 Here we selected no: of cluster as 4
 Objects are distributed to a cluster based on cluster
center to which it is nearest.
 For each semester we found out the clusters separately
and labeled them as students Excellent, Good, Fair and
Poor
Choosing mining algorithms
The Tool used for pattern evaluation is ORANGE
Orange Cluster Analysis
No of cluster selected is 4
Semester 1
poor
Fair
Good
Excellent
Semester 2
Semester 3
Semester 4
Semester 5
Centroid Analysis
Semester 1
Semester 2
Semester 3
Semester 4
Semester 5
Combined Centroid Analysis
Data mining search for patterns of
interest
 From the mining process we found that “All the 5 semester
clusters followed the same pattern of performance”.
 A student with high internal mark has higher external
marks and a student with less internal marks has less
external marks.
 There is a direct relation between the internal and the
external marks.
 At some case this evaluation is not valid, cases like
 Being absent for internal exam and scoring high marks for
the externals (vice versa)
CONCLUSION
 A students performance in his university exam can be
predicted with the help of his internal marks. There is
a direct relation between the internal and the external
marks.
 A student with low internals will get low marks for
externals too
Use of discovered knowledge
representation
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

408372362-Student-Result-management-System-project-report-docx.docx
408372362-Student-Result-management-System-project-report-docx.docx408372362-Student-Result-management-System-project-report-docx.docx
408372362-Student-Result-management-System-project-report-docx.docx
santhoshyadav23
 
Training and placement
Training and placementTraining and placement
Training and placement
Bhavesh Parmar
 

Was ist angesagt? (20)

School fee-management-system
School fee-management-systemSchool fee-management-system
School fee-management-system
 
Decision tree
Decision treeDecision tree
Decision tree
 
408372362-Student-Result-management-System-project-report-docx.docx
408372362-Student-Result-management-System-project-report-docx.docx408372362-Student-Result-management-System-project-report-docx.docx
408372362-Student-Result-management-System-project-report-docx.docx
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
Advance Java Programming( CM5I) 4. Networking Basics
Advance Java Programming( CM5I) 4. Networking BasicsAdvance Java Programming( CM5I) 4. Networking Basics
Advance Java Programming( CM5I) 4. Networking Basics
 
Data types in java
Data types in javaData types in java
Data types in java
 
Inheritance in java
Inheritance in javaInheritance in java
Inheritance in java
 
OOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | IntroductionOOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | Introduction
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation
 
ER DIAGRAM & ER MODELING IN DBMS
ER DIAGRAM & ER MODELING IN DBMSER DIAGRAM & ER MODELING IN DBMS
ER DIAGRAM & ER MODELING IN DBMS
 
COVID - 19 DATA ANALYSIS USING PYTHON and Introduction to Data Science
COVID - 19 DATA ANALYSIS USING PYTHON and Introduction to Data ScienceCOVID - 19 DATA ANALYSIS USING PYTHON and Introduction to Data Science
COVID - 19 DATA ANALYSIS USING PYTHON and Introduction to Data Science
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learning
 
Student Result Management System
Student Result  Management System Student Result  Management System
Student Result Management System
 
MS Office and Oracle Lab Manual
MS Office and Oracle Lab Manual MS Office and Oracle Lab Manual
MS Office and Oracle Lab Manual
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Placement management system
Placement management systemPlacement management system
Placement management system
 
Training and placement
Training and placementTraining and placement
Training and placement
 
SQL - RDBMS Concepts
SQL - RDBMS ConceptsSQL - RDBMS Concepts
SQL - RDBMS Concepts
 
CDMS-PPT-fzq94g.pptx
CDMS-PPT-fzq94g.pptxCDMS-PPT-fzq94g.pptx
CDMS-PPT-fzq94g.pptx
 
A Literature Survey on Student Profile Management System
A Literature Survey on Student Profile Management SystemA Literature Survey on Student Profile Management System
A Literature Survey on Student Profile Management System
 

Andere mochten auch

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
Editor IJCATR
 
Attendance and student performance arp (1)
Attendance and student performance arp (1)Attendance and student performance arp (1)
Attendance and student performance arp (1)
Cindy Paynter
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Jakub Ruzicka
 
The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance
Hafizah R
 

Andere mochten auch (20)

Factors affecting the academic performance of college students (1)
Factors affecting the academic performance of college students (1)Factors affecting the academic performance of college students (1)
Factors affecting the academic performance of college students (1)
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuideLinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
 
Sania rtp
Sania rtpSania rtp
Sania rtp
 
Smartcards and Authentication Tokens
Smartcards and Authentication TokensSmartcards and Authentication Tokens
Smartcards and Authentication Tokens
 
Data Mining _ Weka
Data Mining _ WekaData Mining _ Weka
Data Mining _ Weka
 
Attendance and student performance arp (1)
Attendance and student performance arp (1)Attendance and student performance arp (1)
Attendance and student performance arp (1)
 
Some Thoughts on Learning Analytics and Educational Data Mining
Some Thoughts on Learning Analytics and Educational Data MiningSome Thoughts on Learning Analytics and Educational Data Mining
Some Thoughts on Learning Analytics and Educational Data Mining
 
Data Mining Project for student academic specialization and performance
Data Mining Project for student academic specialization and performanceData Mining Project for student academic specialization and performance
Data Mining Project for student academic specialization and performance
 
Mining Student Data LIVE_EUR_v2
Mining Student Data LIVE_EUR_v2Mining Student Data LIVE_EUR_v2
Mining Student Data LIVE_EUR_v2
 
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
Grand challenges for the Educational Data Mining and Learning Sciences Commun...Grand challenges for the Educational Data Mining and Learning Sciences Commun...
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
 
Provision and management of school plant as a correlate of science students a...
Provision and management of school plant as a correlate of science students a...Provision and management of school plant as a correlate of science students a...
Provision and management of school plant as a correlate of science students a...
 
Predicting Student Performance in Solving Parameterized Exercises
Predicting Student Performance in Solving Parameterized ExercisesPredicting Student Performance in Solving Parameterized Exercises
Predicting Student Performance in Solving Parameterized Exercises
 
Ethical Hacking
Ethical HackingEthical Hacking
Ethical Hacking
 
Solar and wind power forecasting
Solar and wind power forecastingSolar and wind power forecasting
Solar and wind power forecasting
 
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMSUSING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
 
My First Data Science Project (using Rapid Miner)
My First Data Science Project (using Rapid Miner)My First Data Science Project (using Rapid Miner)
My First Data Science Project (using Rapid Miner)
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
 
The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance
 
Big Data in Education
Big Data in EducationBig Data in Education
Big Data in Education
 

Ähnlich wie Students academic performance using clustering technique

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
Editor IJCATR
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
IIRindia
 

Ähnlich wie Students academic performance using clustering technique (20)

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
 
IRJET- Academic Performance Analysis System
IRJET- Academic Performance Analysis SystemIRJET- Academic Performance Analysis System
IRJET- Academic Performance Analysis System
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for Students
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
 
Big data project
Big data projectBig data project
Big data project
 
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
 
A Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningA Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data Mining
 
Clustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of ProgrammingClustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of Programming
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
 
IRJET- Using Data Mining to Predict Students Performance
IRJET-  	  Using Data Mining to Predict Students PerformanceIRJET-  	  Using Data Mining to Predict Students Performance
IRJET- Using Data Mining to Predict Students Performance
 
Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
 
Brown, chapter 4 By Savaedi
Brown, chapter 4 By SavaediBrown, chapter 4 By Savaedi
Brown, chapter 4 By Savaedi
 
Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 

Mehr von saniacorreya (6)

PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHMPROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Color and human vision
Color and human visionColor and human vision
Color and human vision
 
Manipulator robot for crack detection and welding
Manipulator robot for crack detection and weldingManipulator robot for crack detection and welding
Manipulator robot for crack detection and welding
 
Windows 10 ppt
Windows 10 pptWindows 10 ppt
Windows 10 ppt
 
Li fi
Li fiLi fi
Li fi
 

Kürzlich hochgeladen

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Kürzlich hochgeladen (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 

Students academic performance using clustering technique

  • 2. Introduction..  Our project aim is to find students academic performance and find out whether there is any general pattern in their marks and performance.  So here ,We are analyzing both internal and external marks of a student.  We did the following KDD preprocessing steps to mine our data.
  • 3. Learning the application domain  Learning the application domain is the first step in KDD process .  Need to have a clear understanding about the application domain and our objectives.  The institution considered for mining is MCA batch of Rajagiri College of Social Sciences.  We collected all previous year academic record from the department of computer science
  • 4. Create a target data set: data selection  We selected 2007-2010 batch marks for analysing the pattern.  There were around 45 records(45 students).  Both the internal and external marks of each student were selected, in order to find out the performance pattern.
  • 6. Data cleaning & preprocessing  Data cleaning is the step where noise and irrelevant data are removed from the large data set.  This is a very important pre-processing step because our outcome would be dependent on the quality of selected data.  Remove duplicate records, enter logically correct values for missing records(absent students), remove unnecessary data fields and standardize data format.
  • 7.  There was no much duplicate data or unnecessary data in the collected record . The dataset was partially cleaned.  Student internal mark and external mark were stored in different records.  By applying data integration these records were integrated into one record.  The new dataset consist of internal mark details and external mark details of each student in one record.
  • 8.
  • 9. Data reduction & transformation  Data is transformed into appropriate form for making it ready for data mining step.  The dataset contains marks of 5 theory paper and 2 lab paper of all 5 semesters.  These marks are transformed into sum of internal marks and sum of external marks of each student for the easiness of analysing the pattern.
  • 10.
  • 11. Cluster Analysis  The data mining technique we used here is clustering.  A cluster is a collection of data objects that are similar to one another within same cluster and are dissimilar to objects in other cluster.  We first partitioned the set of data into groups based on data similarity and then assign labels Choosing functions of data mining
  • 12. K-MEANS Partitioning  The K-means algorithm takes input parameter k and partitions the set of n objects into k clusters.  Here we selected no: of cluster as 4  Objects are distributed to a cluster based on cluster center to which it is nearest.  For each semester we found out the clusters separately and labeled them as students Excellent, Good, Fair and Poor Choosing mining algorithms
  • 13. The Tool used for pattern evaluation is ORANGE
  • 15. No of cluster selected is 4
  • 28. Data mining search for patterns of interest  From the mining process we found that “All the 5 semester clusters followed the same pattern of performance”.  A student with high internal mark has higher external marks and a student with less internal marks has less external marks.  There is a direct relation between the internal and the external marks.  At some case this evaluation is not valid, cases like  Being absent for internal exam and scoring high marks for the externals (vice versa)
  • 29. CONCLUSION  A students performance in his university exam can be predicted with the help of his internal marks. There is a direct relation between the internal and the external marks.  A student with low internals will get low marks for externals too
  • 30. Use of discovered knowledge representation