SlideShare a Scribd company logo
1 of 18
Weka: An Introduction 1
Weka – An Introduction
Weka: An Introduction 2
WEKA
 Waikato Environment for Knowledge Analysis (WEKA)
 Developed by the Department of Computer Science, University
of Waikato, New Zealand
 Machine learning/data mining software written in Java
(distributed under the GNU Public License)
 Used for research, education, and applications
 http://www.cs.waikato.ac.nz/ml/weka/
Weka: An Introduction 3
Weka Interfaces
 Explorer
 Preprocessing, attribute selection, learning, visualization
 Knowledge Flow
 Visual design of KDD process
 Experimenter
 testing and evaluating machine learning algorithms
 Command-line
Weka: An Introduction 4
Data Formats
 Uses flat text files to describe the data
 Can work with a wide variety of data files including its own “.arff”
format and C4.5 file formats
 Data can be imported from a file in various formats: ARFF,
CSV, C4.5 etc.
Weka: An Introduction 5
Data Formats (Contd.)
 ARFF (Attribute Relation File Format)
@relation person
@attribute age numeric
@attribute name string
@attribute education {College, Masters, Doctorate}
@attribute class {>50K,<=50K}
@data
 Supported Data types
 Numeric
 String
 Nominal
 Date
 Relational
Weka: An Introduction 6
Explorer
 Supports Exploratory Data Analysis
 Preprocess: Choose and modify the data being acted on.
 Classify: Train and test learning schemes that classify or
perform regression.
 Cluster: Learn clusters for the data.
 Associate: Learn association rules for the data.
 Select attributes: Select the most relevant attributes in the
data.
 Visualize: View an interactive 2D plot of the data.
Weka: An Introduction 7
Explorer - Preprocessing
 Loading Data
 Open file
 Open URL
 Open DB
 Generate
 Native format – ARFF
 Supports file Conversions
Weka: An Introduction 8
Explorer – Applying Filters
 Supervised Vs Unsupervised Filters
 Attribute Vs Instance Filters
 Unsupervised Attribute Filters
Add-Adds a new attribute
Normalize-Scales all numeric values
Remove-Remove Attributes (RemoveType / RemoveUseless)
 Unsupervised Instance Filters
Randomize- Randomize order of instance in a dataset
RemoveWithValues- Filter out instances with certain attribute values
 Supervised Attribute Filters
AttributeSelection- Attribute Selection Methods
Discretize- Convert Numeric attributes to nominal
 Supervised Instance Filters
Resample- Produce a random sub sample of a dataset
Weka: An Introduction 9
Classifiers
 Bayes - BayesNet, NaiveBayes
 Trees - ID3, J48
 Rules - OneR, Conjunctive Rule
 Functions - Linear Regression,
RBFNetwork,
Multilayer Perceptron
 Lazy - KStar, IBk
 Miscellaneous- VFI
Weka: An Introduction 10
Clusterers
 OPTICS
 DBScan
 SimpleKMeans
 Cobweb
Weka: An Introduction 11
Associations
 Apriori
 Predictive Apriori
 Filtered Associator
Weka: An Introduction 12
Attribute Selection
 Attribute Evaluators
 CfsSubsetEval
 ClassifierSubsetEval
 GainRatioAttributeEval
 InfoGainAttributeEval
 Search Method
 Best First
 Exhaustive Search
 Genetic Search
 Rank Search
Weka: An Introduction 13
Knowledge Flow Interface
 Data-flow inspired interface to WEKA
 process data in batches or incrementally
 process multiple batches or streams in parallel
(each separate flow executes in its own thread)
 chain filters together
 visualize performance of incremental classifiers
during processing
Weka: An Introduction 14
Experimenter Interface
 Enables the user to create, run, modify, and analyse
experiments in a more convenient manner
 Modes of Operation
 Simple
 Advanced
 Local / Remote Experiments are supported
Weka: An Introduction 15
Command Line Interface
 Plain text panel from where commands can be
entered
 java <classname> [<args>] invokes a java class with
the given arguments (if any)
 break stops the current thread, e.g., a running classifier,
in a friendly manner
 kill stops the current thread in an unfriendly fashion
 cls clears the output area
 exit exits the Simple CLI
 help [<command>]
Weka: An Introduction 16
Weka Operation
 The Operating System’s command line interface can also be
used after setting the CLASSPATH accordingly.
 All the functionality supported by Weka can also be invoked
from one’s own source code.
Weka: An Introduction 17
Weka Extensions
 BioWeka - Extension library for knowledge discovery in
biology
 WekaMetal - Meta learning extension to WEKA
 Weka-Parallel - Parallel processing for WEKA
 Grid Weka - Grid computing using WEKA
Weka: An Introduction 18
References
 Witten, I.H. and Frank, E. (2005) Data Mining: Practical machine
learning tools and techniques. 2nd edition Morgan Kaufmann,
San Francisco
 Weka Knowledge Flow Tutorial, Mark Hall Peter Reutemann
http://www.inf.fh-
dortmund.de/personen/professoren/engels/dm/praktikum/WEKA-
KnowledgeFlowTutorial-3-5-7.pdf
 WEKA Manual for Version 3-6-2 - Remco R. Bouckaert, Eibe
Frank et.al, January 11, 2010

More Related Content

What's hot

Weka a tool_for_exploratory_data_mining
Weka a tool_for_exploratory_data_miningWeka a tool_for_exploratory_data_mining
Weka a tool_for_exploratory_data_miningTony Frame
 
data mining with weka application
data mining with weka applicationdata mining with weka application
data mining with weka applicationRezapourabbas
 
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKAbutest
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorialbutest
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKAbutest
 
Weka presentation
Weka presentationWeka presentation
Weka presentationSaeed Iqbal
 
Analytics machine learning in weka
Analytics machine learning in wekaAnalytics machine learning in weka
Analytics machine learning in wekaSudhakar Chavan
 
Data mining with Weka
Data mining with WekaData mining with Weka
Data mining with WekaAlbanLevy
 
A simple introduction to weka
A simple introduction to wekaA simple introduction to weka
A simple introduction to wekaPamoda Vajiramali
 
Weka presentation
Weka presentationWeka presentation
Weka presentationAbrar ali
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using wekarathorenitin87
 
Association Rule Mining Using WEKA
Association Rule Mining Using WEKAAssociation Rule Mining Using WEKA
Association Rule Mining Using WEKAProthoma Diteeya
 
WEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And AttributesWEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And AttributesDataminingTools Inc
 

What's hot (17)

Weka a tool_for_exploratory_data_mining
Weka a tool_for_exploratory_data_miningWeka a tool_for_exploratory_data_mining
Weka a tool_for_exploratory_data_mining
 
data mining with weka application
data mining with weka applicationdata mining with weka application
data mining with weka application
 
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKA
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorial
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKA
 
Wek1
Wek1Wek1
Wek1
 
Weka presentation
Weka presentationWeka presentation
Weka presentation
 
Analytics machine learning in weka
Analytics machine learning in wekaAnalytics machine learning in weka
Analytics machine learning in weka
 
weka data mining
weka data mining weka data mining
weka data mining
 
Data mining with Weka
Data mining with WekaData mining with Weka
Data mining with Weka
 
Weka bike rental
Weka bike rentalWeka bike rental
Weka bike rental
 
A simple introduction to weka
A simple introduction to wekaA simple introduction to weka
A simple introduction to weka
 
Weka presentation
Weka presentationWeka presentation
Weka presentation
 
Weka library, JAVA
Weka library, JAVAWeka library, JAVA
Weka library, JAVA
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
 
Association Rule Mining Using WEKA
Association Rule Mining Using WEKAAssociation Rule Mining Using WEKA
Association Rule Mining Using WEKA
 
WEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And AttributesWEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And Attributes
 

Viewers also liked

2.3 shortest path dijkstra’s
2.3 shortest path dijkstra’s 2.3 shortest path dijkstra’s
2.3 shortest path dijkstra’s Krish_ver2
 
5.5 back track
5.5 back track5.5 back track
5.5 back trackKrish_ver2
 
4.4 external hashing
4.4 external hashing4.4 external hashing
4.4 external hashingKrish_ver2
 
2.4 rule based classification
2.4 rule based classification2.4 rule based classification
2.4 rule based classificationKrish_ver2
 

Viewers also liked (6)

2.5 graph dfs
2.5 graph dfs2.5 graph dfs
2.5 graph dfs
 
2.3 shortest path dijkstra’s
2.3 shortest path dijkstra’s 2.3 shortest path dijkstra’s
2.3 shortest path dijkstra’s
 
2.5 dfs & bfs
2.5 dfs & bfs2.5 dfs & bfs
2.5 dfs & bfs
 
5.5 back track
5.5 back track5.5 back track
5.5 back track
 
4.4 external hashing
4.4 external hashing4.4 external hashing
4.4 external hashing
 
2.4 rule based classification
2.4 rule based classification2.4 rule based classification
2.4 rule based classification
 

Similar to 1.5 weka an intoduction

An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Wekaweka Content
 
Weka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data miningWeka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data miningKeshab Kumar Gaurav
 
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013anpawlik
 
Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introductionbutest
 
Java se 8 fundamentals
Java se 8 fundamentalsJava se 8 fundamentals
Java se 8 fundamentalsmegharajk
 
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Timothy McPhillips
 
WEKA:Introduction To Weka
WEKA:Introduction To WekaWEKA:Introduction To Weka
WEKA:Introduction To Wekaweka Content
 
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUC
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUCDevelopment of Java tools using SWT and WALA af Hans Søndergaard, ViaUC
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUCInfinIT - Innovationsnetværket for it
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Helena Edelson
 
JavaBasicsCore1.ppt
JavaBasicsCore1.pptJavaBasicsCore1.ppt
JavaBasicsCore1.pptbuvanabala
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Ben Busby
 
Manuel - SPR - Intro to Java Language_2016
Manuel - SPR - Intro to Java Language_2016Manuel - SPR - Intro to Java Language_2016
Manuel - SPR - Intro to Java Language_2016Manuel Fomitescu
 
Dissecting Open Source Cloud Evolution: An OpenStack Case Study
Dissecting Open Source Cloud Evolution: An OpenStack Case StudyDissecting Open Source Cloud Evolution: An OpenStack Case Study
Dissecting Open Source Cloud Evolution: An OpenStack Case StudySalman Baset
 
Terraform modules restructured
Terraform modules restructuredTerraform modules restructured
Terraform modules restructuredAmi Mahloof
 
Terraform Modules Restructured
Terraform Modules RestructuredTerraform Modules Restructured
Terraform Modules RestructuredDoiT International
 

Similar to 1.5 weka an intoduction (20)

An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
 
Weka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data miningWeka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data mining
 
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013
Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013
 
Unit 1 notes.pdf
Unit 1 notes.pdfUnit 1 notes.pdf
Unit 1 notes.pdf
 
Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introduction
 
Java se 8 fundamentals
Java se 8 fundamentalsJava se 8 fundamentals
Java se 8 fundamentals
 
BioWeka
BioWekaBioWeka
BioWeka
 
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
 
WEKA:Introduction To Weka
WEKA:Introduction To WekaWEKA:Introduction To Weka
WEKA:Introduction To Weka
 
Weka
WekaWeka
Weka
 
Java se7 features
Java se7 featuresJava se7 features
Java se7 features
 
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUC
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUCDevelopment of Java tools using SWT and WALA af Hans Søndergaard, ViaUC
Development of Java tools using SWT and WALA af Hans Søndergaard, ViaUC
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
 
JavaBasicsCore1.ppt
JavaBasicsCore1.pptJavaBasicsCore1.ppt
JavaBasicsCore1.ppt
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
 
Manuel - SPR - Intro to Java Language_2016
Manuel - SPR - Intro to Java Language_2016Manuel - SPR - Intro to Java Language_2016
Manuel - SPR - Intro to Java Language_2016
 
Dissecting Open Source Cloud Evolution: An OpenStack Case Study
Dissecting Open Source Cloud Evolution: An OpenStack Case StudyDissecting Open Source Cloud Evolution: An OpenStack Case Study
Dissecting Open Source Cloud Evolution: An OpenStack Case Study
 
IMPACT/myGrid Hackathon - Taverna Roadmap
IMPACT/myGrid Hackathon - Taverna RoadmapIMPACT/myGrid Hackathon - Taverna Roadmap
IMPACT/myGrid Hackathon - Taverna Roadmap
 
Terraform modules restructured
Terraform modules restructuredTerraform modules restructured
Terraform modules restructured
 
Terraform Modules Restructured
Terraform Modules RestructuredTerraform Modules Restructured
Terraform Modules Restructured
 

More from Krish_ver2

5.5 back tracking
5.5 back tracking5.5 back tracking
5.5 back trackingKrish_ver2
 
5.5 back tracking 02
5.5 back tracking 025.5 back tracking 02
5.5 back tracking 02Krish_ver2
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructuresKrish_ver2
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructuresKrish_ver2
 
5.4 randamized algorithm
5.4 randamized algorithm5.4 randamized algorithm
5.4 randamized algorithmKrish_ver2
 
5.3 dynamic programming 03
5.3 dynamic programming 035.3 dynamic programming 03
5.3 dynamic programming 03Krish_ver2
 
5.3 dynamic programming
5.3 dynamic programming5.3 dynamic programming
5.3 dynamic programmingKrish_ver2
 
5.3 dyn algo-i
5.3 dyn algo-i5.3 dyn algo-i
5.3 dyn algo-iKrish_ver2
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03Krish_ver2
 
5.2 divide and conquer
5.2 divide and conquer5.2 divide and conquer
5.2 divide and conquerKrish_ver2
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03Krish_ver2
 
5.1 greedyyy 02
5.1 greedyyy 025.1 greedyyy 02
5.1 greedyyy 02Krish_ver2
 
4.4 hashing ext
4.4 hashing  ext4.4 hashing  ext
4.4 hashing extKrish_ver2
 

More from Krish_ver2 (20)

5.5 back tracking
5.5 back tracking5.5 back tracking
5.5 back tracking
 
5.5 back tracking 02
5.5 back tracking 025.5 back tracking 02
5.5 back tracking 02
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructures
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructures
 
5.4 randamized algorithm
5.4 randamized algorithm5.4 randamized algorithm
5.4 randamized algorithm
 
5.3 dynamic programming 03
5.3 dynamic programming 035.3 dynamic programming 03
5.3 dynamic programming 03
 
5.3 dynamic programming
5.3 dynamic programming5.3 dynamic programming
5.3 dynamic programming
 
5.3 dyn algo-i
5.3 dyn algo-i5.3 dyn algo-i
5.3 dyn algo-i
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03
 
5.2 divide and conquer
5.2 divide and conquer5.2 divide and conquer
5.2 divide and conquer
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03
 
5.1 greedyyy 02
5.1 greedyyy 025.1 greedyyy 02
5.1 greedyyy 02
 
5.1 greedy
5.1 greedy5.1 greedy
5.1 greedy
 
5.1 greedy 03
5.1 greedy 035.1 greedy 03
5.1 greedy 03
 
4.4 hashing02
4.4 hashing024.4 hashing02
4.4 hashing02
 
4.4 hashing
4.4 hashing4.4 hashing
4.4 hashing
 
4.4 hashing ext
4.4 hashing  ext4.4 hashing  ext
4.4 hashing ext
 
4.2 bst
4.2 bst4.2 bst
4.2 bst
 
4.2 bst 03
4.2 bst 034.2 bst 03
4.2 bst 03
 
4.2 bst 02
4.2 bst 024.2 bst 02
4.2 bst 02
 

Recently uploaded

ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 

Recently uploaded (20)

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 

1.5 weka an intoduction

  • 1. Weka: An Introduction 1 Weka – An Introduction
  • 2. Weka: An Introduction 2 WEKA  Waikato Environment for Knowledge Analysis (WEKA)  Developed by the Department of Computer Science, University of Waikato, New Zealand  Machine learning/data mining software written in Java (distributed under the GNU Public License)  Used for research, education, and applications  http://www.cs.waikato.ac.nz/ml/weka/
  • 3. Weka: An Introduction 3 Weka Interfaces  Explorer  Preprocessing, attribute selection, learning, visualization  Knowledge Flow  Visual design of KDD process  Experimenter  testing and evaluating machine learning algorithms  Command-line
  • 4. Weka: An Introduction 4 Data Formats  Uses flat text files to describe the data  Can work with a wide variety of data files including its own “.arff” format and C4.5 file formats  Data can be imported from a file in various formats: ARFF, CSV, C4.5 etc.
  • 5. Weka: An Introduction 5 Data Formats (Contd.)  ARFF (Attribute Relation File Format) @relation person @attribute age numeric @attribute name string @attribute education {College, Masters, Doctorate} @attribute class {>50K,<=50K} @data  Supported Data types  Numeric  String  Nominal  Date  Relational
  • 6. Weka: An Introduction 6 Explorer  Supports Exploratory Data Analysis  Preprocess: Choose and modify the data being acted on.  Classify: Train and test learning schemes that classify or perform regression.  Cluster: Learn clusters for the data.  Associate: Learn association rules for the data.  Select attributes: Select the most relevant attributes in the data.  Visualize: View an interactive 2D plot of the data.
  • 7. Weka: An Introduction 7 Explorer - Preprocessing  Loading Data  Open file  Open URL  Open DB  Generate  Native format – ARFF  Supports file Conversions
  • 8. Weka: An Introduction 8 Explorer – Applying Filters  Supervised Vs Unsupervised Filters  Attribute Vs Instance Filters  Unsupervised Attribute Filters Add-Adds a new attribute Normalize-Scales all numeric values Remove-Remove Attributes (RemoveType / RemoveUseless)  Unsupervised Instance Filters Randomize- Randomize order of instance in a dataset RemoveWithValues- Filter out instances with certain attribute values  Supervised Attribute Filters AttributeSelection- Attribute Selection Methods Discretize- Convert Numeric attributes to nominal  Supervised Instance Filters Resample- Produce a random sub sample of a dataset
  • 9. Weka: An Introduction 9 Classifiers  Bayes - BayesNet, NaiveBayes  Trees - ID3, J48  Rules - OneR, Conjunctive Rule  Functions - Linear Regression, RBFNetwork, Multilayer Perceptron  Lazy - KStar, IBk  Miscellaneous- VFI
  • 10. Weka: An Introduction 10 Clusterers  OPTICS  DBScan  SimpleKMeans  Cobweb
  • 11. Weka: An Introduction 11 Associations  Apriori  Predictive Apriori  Filtered Associator
  • 12. Weka: An Introduction 12 Attribute Selection  Attribute Evaluators  CfsSubsetEval  ClassifierSubsetEval  GainRatioAttributeEval  InfoGainAttributeEval  Search Method  Best First  Exhaustive Search  Genetic Search  Rank Search
  • 13. Weka: An Introduction 13 Knowledge Flow Interface  Data-flow inspired interface to WEKA  process data in batches or incrementally  process multiple batches or streams in parallel (each separate flow executes in its own thread)  chain filters together  visualize performance of incremental classifiers during processing
  • 14. Weka: An Introduction 14 Experimenter Interface  Enables the user to create, run, modify, and analyse experiments in a more convenient manner  Modes of Operation  Simple  Advanced  Local / Remote Experiments are supported
  • 15. Weka: An Introduction 15 Command Line Interface  Plain text panel from where commands can be entered  java <classname> [<args>] invokes a java class with the given arguments (if any)  break stops the current thread, e.g., a running classifier, in a friendly manner  kill stops the current thread in an unfriendly fashion  cls clears the output area  exit exits the Simple CLI  help [<command>]
  • 16. Weka: An Introduction 16 Weka Operation  The Operating System’s command line interface can also be used after setting the CLASSPATH accordingly.  All the functionality supported by Weka can also be invoked from one’s own source code.
  • 17. Weka: An Introduction 17 Weka Extensions  BioWeka - Extension library for knowledge discovery in biology  WekaMetal - Meta learning extension to WEKA  Weka-Parallel - Parallel processing for WEKA  Grid Weka - Grid computing using WEKA
  • 18. Weka: An Introduction 18 References  Witten, I.H. and Frank, E. (2005) Data Mining: Practical machine learning tools and techniques. 2nd edition Morgan Kaufmann, San Francisco  Weka Knowledge Flow Tutorial, Mark Hall Peter Reutemann http://www.inf.fh- dortmund.de/personen/professoren/engels/dm/praktikum/WEKA- KnowledgeFlowTutorial-3-5-7.pdf  WEKA Manual for Version 3-6-2 - Remco R. Bouckaert, Eibe Frank et.al, January 11, 2010