SlideShare a Scribd company logo
1 of 60
Information Retrieval (IR)?
Overview
• Intro to IR
• History of IR
• Components of IR
• Issues in IR
• Open source engine frame works
• Impact of web on IR
• Role of AI in IR
• Components of Search Engine
What is Information Retrieval ?
Information retrieval, as the name implies,
concerns the retrieving of relevant
information from databases. It is basically
concerned with facilitating the user's access to
large amounts of (predominantly textual)
information.
What is information retrieval
• Gathering information from a source(s) based on
an information need usually from a query
– Major assumption - that the information need can be
specified
– Broad definition of information
– Most methods are automated - scaling
• Sources of information
– Searching in laptops
– Archived information (libraries, maps, etc.)
– E-mail search
– Web (search engines)
Information retrieval is more than just web search
information retrieval vs ?
• Information retrieval (IR) is the activity or process of
obtaining information resources relevant to an information
need from a collection of information resources.
• Data mining is the process that attempts to discover
patterns in large data sets.
• Information extraction (IE) is the task of automatically
extracting structured information from unstructured and/or
semi-structured machine-readable documents
Unstructured (text) vs. structured (database)
data in the mid-nineties (90’s)
6
DATABASE vs IR
DATABASE IR
What we are retrieving Structured data Mostly Unstructured
Queries we are posing Formally queries Expression in natural
language(free of queirs)
Results we get Exact. Always in correct
format
Sometimes relvant, Often
not
Interaction with system One-short query Interaction based.
Goal of IR
• Goal of IR to search large documents
collections to retrieve small subsets to the
user’s information need.
• Popular IR systems are
Internet Search Engine(Google , Bing,
Yahoo)
Digital Library Catalogues
how trap mice alive
The classic search model
Collection
User task
Info need
Query
Results
Search
engine
Query
refinement
Get rid of mice in a
politically correct way
Info about removing mice
without killing them
Misconception?
Misformulation?
Sear
ch
History of IR
COMPONENTS OF IR
• IR is study of finding needed information. It
helps us to find information that matches
their information needs.
• IR locates relevant documents , on the
basis of user input keywords or free text
queries.
General model of IR
Query
/user text
Matching
rule
Data store
Retrieval
result
ISSUES IN IR
GOAL OF IR
The goal of the IR system is to
retrieve all the items that are relevant to user
query. While retrieving as few non relevant
items as possible.
PROBLEMS IN IR?
• Document and query indexing
 how to represent best contents?
 query evaluation(retrieval process)
 To what extend does a query should respond?
How good is IR system?
 Are the retrieved documents relevant?
Are the all relevant documents has been retrieved.
Why IR is Difficult?
• Vocabularies mismatching
• synonyms : car vs vechicle
• : anna University vs annamalai
university
• Content representation may be inadequate and
incomplete.
• The user is the ultimate judge, so IR system is must
be so effective to retrieve the information
Challenges in IR
• High heterogeneity
• document structure , size and quality of data.
• What does the user expected be retrieved?
• Retrieval strategies
• Scale and distribution of data
• Relevance
– relevance is the fundamental concept in
information retrieval
– there are many factors that IR has portrayed a
particular document is retrieved
– vocabulary mismatch problem
• it is important to distinguish between topic relevant
and user relevant
– Retrieval Models:
– To address these problems in IR Retrieval
Models has been proposed.
– A good retrieval system will find documents
are likely to be considered relevant when the
user submit the query.
–
• Evaluation
– How quality the documents matches the
person’s expectation, since the quality of
ranking depends upon raking algorithms.
– Page ranking algorithm has been introduced.
OPEN SOURCE SEARCH
ENGINE FRAMEWORK
Definition
• Open source is an approach to design,
development, and distribution of software,
offering, practical accessibility to
software’s source code with free of charge.
Need of open source
• Demand of consumers as well as enterprises
are increasing with increase in information
technology usage. Information technology
solutions are required to satisfy their
different needs.
• Single solution provider cannot provide all
needed solutions
• open source, freeware, free software.
• 1970 and 1980 software organization uses
technical measures to prevent computer
users from being able to modify and use
software. 1980 copyright law has been
introduced.
• Richard stallman is the founder of Free
Software Foundation (FSF)
• The primary goal is to use application
software and os being shared among
different users with full freedom.
• OS: Linux, symbian, NetBSD.
• Servers: Apache, Tomcat, Drupal,
Wordpress, Eclipse,Joomla
• Programming languages: java, PhP,
Python, JavaScript
• Digital Content: Wikipedia, project
gutenburg.
Open source software Closed/Proprietary Software
Source code freely available Source code is kept secret
Modification are allowed Modifications are not allowed
Sublicensing is allowed Sublicensing is not allowed
No guarantee of further development Guarantee of further development
Wikipedia, Android os, google iOs, Microsoft windows
Advantages
• Right to use software in any way
• Usually no license cost and totally free of
cost.
• Higher flexibility
• Source code is open & can be modified
freely.
Application
• Social networking
• Animation
• Instant messaging
• Website development
• ERP
• Multimedia
• Freeware : It is a software that available
free of cost and can be easily distributed
without any restrictions.
• Free software:
• used to run free to run programs
• user is free distributed the program with anybody
• user will modify and improve the program
Reasons for choosing Open
source
• Development and maintenance of open
source is a community based activity
• open source allows us to study, modify and
distribute the software.
• open source allows customer enhancement.
Widely used open source
software license
• Apache license
• BSD license
• GNU General Public License
• Mozilla Public license
• Eclipse Public license
IMPACT OF WEB ON IR
• www is developed by Tim Berners lee in
1990 to organize research documents
available on the internet. It is an idea of
making documents available by FTP of
hypertext to link documents.
• Client use browser application to send URI
via http to server requesting a web page.
• Web pages are constructed using HTML.
• Servers returned with requested web pages.
• IR has 3 kinds of components file
organization, storage, and retrieval.
• One way to find relevant documents on the
web it to launch web robot. It also called
crawler, spider, worm.
• These software programs receive user
query , then explore web to locate
documents , evaluate their relevance and
return a rank order documents
• IR queries
– Keyword query
– Boolean query
– Phrase query
– Full document query
– Natural language query
Web challenges on IR
• www is expanding faster than any IR
models and web pages are update
frequently or dynamically.
• Many web pages are not indexed by search
engine , this phenomenon is called invisible
web.
• Two problems
– Problem with data
– Problem with user
• Problem with data
– Distributed data (different servers)
– Large volume (billions of separate documents)
– Quality of data
– High heterogeneity
– Unstructured data (30 % duplicate data)
• Problem with user
– How to specify the query
– How to interpret answer provided by the system
Role of AI in IR
• AI is collection of hard problems which can
be solved by humans and other things.
• Natural Language Processing (NLP) is a
major part of AI which serves as a filed of
application in IR.
• NLP techniques uses to make queries to
extract information, retrieve documents
from a collection and translate them from
one form to another.
• Two types of NLP is used in IR.
– NLP allows to respond to a range of large
inputs, to produce more experienced results.
– NLP allows text processing system to scan
source texts, to retrieve particular information
• 3 consideration in applying NLP to IR
– .selection of NLP technologies
– Choice of models
– Approach to extract information (acquisition)

More Related Content

What's hot

Information retrieval 1 introduction to ir
Information retrieval 1 introduction to irInformation retrieval 1 introduction to ir
Information retrieval 1 introduction to irVaibhav Khanna
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.Lanujessy
 
Information retrieval
Information retrievalInformation retrieval
Information retrievalhplap
 
Near Field Communications in Libraries & Healthcare
Near Field Communications in Libraries & HealthcareNear Field Communications in Libraries & Healthcare
Near Field Communications in Libraries & Healthcareayoungkin
 
File and data base management
File and data base managementFile and data base management
File and data base managementAsad Ahmed
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval ssilambu111
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrievalSadaf Rafiq
 
Fostering an Ecosystem for Smartphone Privacy
Fostering an Ecosystem for Smartphone PrivacyFostering an Ecosystem for Smartphone Privacy
Fostering an Ecosystem for Smartphone PrivacyJason Hong
 
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004Jason Hong
 
Information Storage and Retrieval system (ISRS)
Information Storage and Retrieval system (ISRS)Information Storage and Retrieval system (ISRS)
Information Storage and Retrieval system (ISRS)Sumit Kumar Gupta
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content SmarterBianca Pereira
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)silambu111
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIDr. Rupak Chakravarty
 
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...IAEME Publication
 

What's hot (20)

Information retrieval 1 introduction to ir
Information retrieval 1 introduction to irInformation retrieval 1 introduction to ir
Information retrieval 1 introduction to ir
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
 
Introduction to Data Management and Sharing
Introduction to Data Management and SharingIntroduction to Data Management and Sharing
Introduction to Data Management and Sharing
 
Near Field Communications in Libraries & Healthcare
Near Field Communications in Libraries & HealthcareNear Field Communications in Libraries & Healthcare
Near Field Communications in Libraries & Healthcare
 
File and data base management
File and data base managementFile and data base management
File and data base management
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
Fostering an Ecosystem for Smartphone Privacy
Fostering an Ecosystem for Smartphone PrivacyFostering an Ecosystem for Smartphone Privacy
Fostering an Ecosystem for Smartphone Privacy
 
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004
An Architecture for Privacy-Sensitive Ubiquitous Computing at Mobisys 2004
 
DICTIONARY
DICTIONARYDICTIONARY
DICTIONARY
 
Information Storage and Retrieval system (ISRS)
Information Storage and Retrieval system (ISRS)Information Storage and Retrieval system (ISRS)
Information Storage and Retrieval system (ISRS)
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content Smarter
 
Data management
Data management Data management
Data management
 
Web search vs ir
Web search vs irWeb search vs ir
Web search vs ir
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - II
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...
IMPLEMENTATION OF DIGITAL LIBRARY SYSTEM BY USING DSPACE & ANDROID APPS AT AM...
 
AI and Smarter Libraries
AI and Smarter LibrariesAI and Smarter Libraries
AI and Smarter Libraries
 

Similar to Unit 1

CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalCarsten Eickhoff
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...Prateek Singh
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalVikas Bhushan
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsSloan Carne
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologiesenterprisesearchmeetup
 
Routine Maintenance of Computer Systems and Basic Internet Search Skills
Routine Maintenance of Computer Systems and Basic Internet Search SkillsRoutine Maintenance of Computer Systems and Basic Internet Search Skills
Routine Maintenance of Computer Systems and Basic Internet Search SkillsIdowu Adegbilero-Iwari
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
Information RetrievalsT_I_materials.pptx
Information RetrievalsT_I_materials.pptxInformation RetrievalsT_I_materials.pptx
Information RetrievalsT_I_materials.pptxlekhacce
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalRoi Blanco
 
Internet browsing techniques
Internet browsing techniquesInternet browsing techniques
Internet browsing techniquesTola Odugbesan
 
information Storage nd retrieval.pptx
information Storage nd retrieval.pptxinformation Storage nd retrieval.pptx
information Storage nd retrieval.pptxSiva Kumar
 
Harnessing search engines for KM
Harnessing search engines for KMHarnessing search engines for KM
Harnessing search engines for KMInvotra
 
A Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the WebA Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the Webmashups
 

Similar to Unit 1 (20)

CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
 
CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
CHAPTER -12 it.pptx
CHAPTER -12 it.pptxCHAPTER -12 it.pptx
CHAPTER -12 it.pptx
 
Web mining
Web miningWeb mining
Web mining
 
IRT Unit_I.pptx
IRT Unit_I.pptxIRT Unit_I.pptx
IRT Unit_I.pptx
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU Investigators
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
IT for management
IT for managementIT for management
IT for management
 
Routine Maintenance of Computer Systems and Basic Internet Search Skills
Routine Maintenance of Computer Systems and Basic Internet Search SkillsRoutine Maintenance of Computer Systems and Basic Internet Search Skills
Routine Maintenance of Computer Systems and Basic Internet Search Skills
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
Information RetrievalsT_I_materials.pptx
Information RetrievalsT_I_materials.pptxInformation RetrievalsT_I_materials.pptx
Information RetrievalsT_I_materials.pptx
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Internet browsing techniques
Internet browsing techniquesInternet browsing techniques
Internet browsing techniques
 
information Storage nd retrieval.pptx
information Storage nd retrieval.pptxinformation Storage nd retrieval.pptx
information Storage nd retrieval.pptx
 
Harnessing search engines for KM
Harnessing search engines for KMHarnessing search engines for KM
Harnessing search engines for KM
 
A Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the WebA Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the Web
 

More from karthiksmart21

WEB TECHNOLOGY Unit-2.pptx
WEB TECHNOLOGY Unit-2.pptxWEB TECHNOLOGY Unit-2.pptx
WEB TECHNOLOGY Unit-2.pptxkarthiksmart21
 
WEB TECHNOLOGY Unit-3.pptx
WEB TECHNOLOGY Unit-3.pptxWEB TECHNOLOGY Unit-3.pptx
WEB TECHNOLOGY Unit-3.pptxkarthiksmart21
 
WEB TECHNOLOGY Unit-5.pptx
WEB TECHNOLOGY Unit-5.pptxWEB TECHNOLOGY Unit-5.pptx
WEB TECHNOLOGY Unit-5.pptxkarthiksmart21
 
WEB TECHNOLOGY Unit-4.pptx
WEB TECHNOLOGY Unit-4.pptxWEB TECHNOLOGY Unit-4.pptx
WEB TECHNOLOGY Unit-4.pptxkarthiksmart21
 
MOBILE COMPUTING Unit 3.pptx
MOBILE COMPUTING Unit 3.pptxMOBILE COMPUTING Unit 3.pptx
MOBILE COMPUTING Unit 3.pptxkarthiksmart21
 
MOBILE COMPUTING Unit 4.pptx
 MOBILE COMPUTING Unit 4.pptx MOBILE COMPUTING Unit 4.pptx
MOBILE COMPUTING Unit 4.pptxkarthiksmart21
 
MOBILE COMPUTING Unit 2.pptx
MOBILE COMPUTING Unit 2.pptxMOBILE COMPUTING Unit 2.pptx
MOBILE COMPUTING Unit 2.pptxkarthiksmart21
 
MOBILE COMPUTING Unit 1.pptx
MOBILE COMPUTING Unit 1.pptxMOBILE COMPUTING Unit 1.pptx
MOBILE COMPUTING Unit 1.pptxkarthiksmart21
 
MOBILE COMPUTING Unit 5.pptx
MOBILE COMPUTING Unit 5.pptxMOBILE COMPUTING Unit 5.pptx
MOBILE COMPUTING Unit 5.pptxkarthiksmart21
 

More from karthiksmart21 (9)

WEB TECHNOLOGY Unit-2.pptx
WEB TECHNOLOGY Unit-2.pptxWEB TECHNOLOGY Unit-2.pptx
WEB TECHNOLOGY Unit-2.pptx
 
WEB TECHNOLOGY Unit-3.pptx
WEB TECHNOLOGY Unit-3.pptxWEB TECHNOLOGY Unit-3.pptx
WEB TECHNOLOGY Unit-3.pptx
 
WEB TECHNOLOGY Unit-5.pptx
WEB TECHNOLOGY Unit-5.pptxWEB TECHNOLOGY Unit-5.pptx
WEB TECHNOLOGY Unit-5.pptx
 
WEB TECHNOLOGY Unit-4.pptx
WEB TECHNOLOGY Unit-4.pptxWEB TECHNOLOGY Unit-4.pptx
WEB TECHNOLOGY Unit-4.pptx
 
MOBILE COMPUTING Unit 3.pptx
MOBILE COMPUTING Unit 3.pptxMOBILE COMPUTING Unit 3.pptx
MOBILE COMPUTING Unit 3.pptx
 
MOBILE COMPUTING Unit 4.pptx
 MOBILE COMPUTING Unit 4.pptx MOBILE COMPUTING Unit 4.pptx
MOBILE COMPUTING Unit 4.pptx
 
MOBILE COMPUTING Unit 2.pptx
MOBILE COMPUTING Unit 2.pptxMOBILE COMPUTING Unit 2.pptx
MOBILE COMPUTING Unit 2.pptx
 
MOBILE COMPUTING Unit 1.pptx
MOBILE COMPUTING Unit 1.pptxMOBILE COMPUTING Unit 1.pptx
MOBILE COMPUTING Unit 1.pptx
 
MOBILE COMPUTING Unit 5.pptx
MOBILE COMPUTING Unit 5.pptxMOBILE COMPUTING Unit 5.pptx
MOBILE COMPUTING Unit 5.pptx
 

Recently uploaded

COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...Health
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stageAbc194748
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 

Recently uploaded (20)

COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 

Unit 1

  • 2. Overview • Intro to IR • History of IR • Components of IR • Issues in IR • Open source engine frame works • Impact of web on IR • Role of AI in IR • Components of Search Engine
  • 3. What is Information Retrieval ? Information retrieval, as the name implies, concerns the retrieving of relevant information from databases. It is basically concerned with facilitating the user's access to large amounts of (predominantly textual) information.
  • 4. What is information retrieval • Gathering information from a source(s) based on an information need usually from a query – Major assumption - that the information need can be specified – Broad definition of information – Most methods are automated - scaling • Sources of information – Searching in laptops – Archived information (libraries, maps, etc.) – E-mail search – Web (search engines) Information retrieval is more than just web search
  • 5. information retrieval vs ? • Information retrieval (IR) is the activity or process of obtaining information resources relevant to an information need from a collection of information resources. • Data mining is the process that attempts to discover patterns in large data sets. • Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents
  • 6. Unstructured (text) vs. structured (database) data in the mid-nineties (90’s) 6
  • 7. DATABASE vs IR DATABASE IR What we are retrieving Structured data Mostly Unstructured Queries we are posing Formally queries Expression in natural language(free of queirs) Results we get Exact. Always in correct format Sometimes relvant, Often not Interaction with system One-short query Interaction based.
  • 8. Goal of IR • Goal of IR to search large documents collections to retrieve small subsets to the user’s information need. • Popular IR systems are Internet Search Engine(Google , Bing, Yahoo) Digital Library Catalogues
  • 9. how trap mice alive The classic search model Collection User task Info need Query Results Search engine Query refinement Get rid of mice in a politically correct way Info about removing mice without killing them Misconception? Misformulation? Sear ch
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 23. • IR is study of finding needed information. It helps us to find information that matches their information needs. • IR locates relevant documents , on the basis of user input keywords or free text queries.
  • 24. General model of IR Query /user text Matching rule Data store Retrieval result
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 36. GOAL OF IR The goal of the IR system is to retrieve all the items that are relevant to user query. While retrieving as few non relevant items as possible.
  • 37. PROBLEMS IN IR? • Document and query indexing  how to represent best contents?  query evaluation(retrieval process)  To what extend does a query should respond? How good is IR system?  Are the retrieved documents relevant? Are the all relevant documents has been retrieved.
  • 38. Why IR is Difficult? • Vocabularies mismatching • synonyms : car vs vechicle • : anna University vs annamalai university • Content representation may be inadequate and incomplete. • The user is the ultimate judge, so IR system is must be so effective to retrieve the information
  • 39. Challenges in IR • High heterogeneity • document structure , size and quality of data. • What does the user expected be retrieved? • Retrieval strategies • Scale and distribution of data
  • 40. • Relevance – relevance is the fundamental concept in information retrieval – there are many factors that IR has portrayed a particular document is retrieved – vocabulary mismatch problem • it is important to distinguish between topic relevant and user relevant
  • 41. – Retrieval Models: – To address these problems in IR Retrieval Models has been proposed. – A good retrieval system will find documents are likely to be considered relevant when the user submit the query. –
  • 42. • Evaluation – How quality the documents matches the person’s expectation, since the quality of ranking depends upon raking algorithms. – Page ranking algorithm has been introduced.
  • 44. Definition • Open source is an approach to design, development, and distribution of software, offering, practical accessibility to software’s source code with free of charge.
  • 45. Need of open source • Demand of consumers as well as enterprises are increasing with increase in information technology usage. Information technology solutions are required to satisfy their different needs. • Single solution provider cannot provide all needed solutions • open source, freeware, free software.
  • 46. • 1970 and 1980 software organization uses technical measures to prevent computer users from being able to modify and use software. 1980 copyright law has been introduced. • Richard stallman is the founder of Free Software Foundation (FSF) • The primary goal is to use application software and os being shared among different users with full freedom.
  • 47. • OS: Linux, symbian, NetBSD. • Servers: Apache, Tomcat, Drupal, Wordpress, Eclipse,Joomla • Programming languages: java, PhP, Python, JavaScript • Digital Content: Wikipedia, project gutenburg.
  • 48. Open source software Closed/Proprietary Software Source code freely available Source code is kept secret Modification are allowed Modifications are not allowed Sublicensing is allowed Sublicensing is not allowed No guarantee of further development Guarantee of further development Wikipedia, Android os, google iOs, Microsoft windows
  • 49. Advantages • Right to use software in any way • Usually no license cost and totally free of cost. • Higher flexibility • Source code is open & can be modified freely.
  • 50. Application • Social networking • Animation • Instant messaging • Website development • ERP • Multimedia
  • 51. • Freeware : It is a software that available free of cost and can be easily distributed without any restrictions. • Free software: • used to run free to run programs • user is free distributed the program with anybody • user will modify and improve the program
  • 52. Reasons for choosing Open source • Development and maintenance of open source is a community based activity • open source allows us to study, modify and distribute the software. • open source allows customer enhancement.
  • 53. Widely used open source software license • Apache license • BSD license • GNU General Public License • Mozilla Public license • Eclipse Public license
  • 54. IMPACT OF WEB ON IR • www is developed by Tim Berners lee in 1990 to organize research documents available on the internet. It is an idea of making documents available by FTP of hypertext to link documents. • Client use browser application to send URI via http to server requesting a web page.
  • 55. • Web pages are constructed using HTML. • Servers returned with requested web pages. • IR has 3 kinds of components file organization, storage, and retrieval. • One way to find relevant documents on the web it to launch web robot. It also called crawler, spider, worm. • These software programs receive user query , then explore web to locate documents , evaluate their relevance and return a rank order documents
  • 56. • IR queries – Keyword query – Boolean query – Phrase query – Full document query – Natural language query
  • 57. Web challenges on IR • www is expanding faster than any IR models and web pages are update frequently or dynamically. • Many web pages are not indexed by search engine , this phenomenon is called invisible web. • Two problems – Problem with data – Problem with user
  • 58. • Problem with data – Distributed data (different servers) – Large volume (billions of separate documents) – Quality of data – High heterogeneity – Unstructured data (30 % duplicate data) • Problem with user – How to specify the query – How to interpret answer provided by the system
  • 59. Role of AI in IR • AI is collection of hard problems which can be solved by humans and other things. • Natural Language Processing (NLP) is a major part of AI which serves as a filed of application in IR. • NLP techniques uses to make queries to extract information, retrieve documents from a collection and translate them from one form to another.
  • 60. • Two types of NLP is used in IR. – NLP allows to respond to a range of large inputs, to produce more experienced results. – NLP allows text processing system to scan source texts, to retrieve particular information • 3 consideration in applying NLP to IR – .selection of NLP technologies – Choice of models – Approach to extract information (acquisition)