SlideShare a Scribd company logo
1 of 18
Presented By
Sadhana Patra
MLIS, 3rd Semester










Information retrieval is the activity of obtaining information
resources relevant to an information need from a collection of
information resources.
An information retrieval process begins when a user enters a query
into the system. Queries are formal statements of information
needs.
User queries are matched against the database information.
Depending on the application the data objects may be, for example,
text documents, images, audio, mind maps or videos.
Most IR systems compute a numeric score on how well each object
in the database matches the query, and rank the objects according to
this value.
The top ranking objects are then shown to the user. The process
may then be iterated if the user wishes to refine the query.


Every online database, every search engine,
everything that is searched online is based in
some way or another on principles developed
in IR
◦ IR is at the heart of searching used in systems such
as DIALOG, LexisNexis & others



Understanding the basics of IR is a
prerequisite for understanding how searching
of online systems works.
“Information retrieval embraces the intellectual
aspects of the description of information and
its specification for search, and also whatever
systems, techniques, or machines are
employed to carry out the operation.”
Calvin Mooers, 1951
Objective:
Provide the users with effective access to &
interaction with information resources.
1.

2.

3.

Document subsystem

a) Acquisition
b) Representation
c) File organization

User sub system

a) Problem
b) Representation
c) Query

Searching /Retrieval subsystem

a) Matching
b) Retrieved objects
Acquisition

(Document subsystem)




Selection of documents & other objects from
various web resources
Mostly text based documents
◦ full texts, titles, abstracts ...
◦ but also other objects:
 data, statistics, images, maps, trade marks, sounds ...



The data are collected by web crawler and
stored in data base.
Representation of documents,
objects(document subsystem)



Indexing – many ways :

◦ free text terms (even in full texts)
◦ controlled vocabulary - thesaurus
◦ manual & automatic techniques




Abstracting; summarizing
Bibliographic description:
◦ author, title, sources, date…
◦ metadata




Classifying, clustering
Organizing in fields & limits

◦ Basic Index, Additional Index. Limits
File organization

(Document subsystem)


Sequential
◦ record (document) by record



Inverted
◦ term by term; list of records under each term







Combination
indexes inverted, documents sequential
When citation retrieved only, need for
document files
Large file approaches
◦ for efficient retrieval by computers
Problem

(user subsystem)


Related to user‟s task, situation
◦ vary in specificity, clarity



Produces information need
◦ ultimate criterion for effectiveness of retrieval
 how well was the need met?



Information need for the same problem may
change, evolve, shift during the IR process adjustment in searching
◦ often more than one search for same problem over
time
 you will experience this in your term project
Representation
( user subsystem)








Converting a concept to query.
What we search for.
These are stemmed and corrected using
dictionary.
Focus toward a good result
Subject to feedback changes
Query - search statement
(user & system)



Translation into systems requirements & limits
◦ start of human-computer interaction

 query is the thing that goes into the computer




Selection of files, resources
Search strategy - selection of:
◦
◦
◦
◦



search terms & logic
possible fields, delimiters
controlled & uncontrolled vocabulary
variations in effectiveness tactics

Reiterations from feedback

◦ several feedback types: relevance feedback, magnitude
feedback..
◦ query expansion & modification






Question is what user asks and what you
may then have elaborated
Query is what is asked of computer to
match – what is put in
Question is transformed into query
Question:

◦ I am interested in major historical developments
in the area of information retrieval?



Query

◦ history information retrieval (in Google)
Matching - searching
(Searching subsystem)



Process of matching, comparing

◦ search: what documents in the file match the query as
stated?



Various search algorithms:
◦ exact match - Boolean

 still available in most, if not all systems

◦ best match - ranking by relevance
 increasingly used e.g. on the web

◦ hybrids incorporating both
 e.g. Target, Rank in DIALOG



Each has strengths, weaknesses
◦ no „perfect‟ method exists
 and probably never will
Retrieved documents -from system
to user (IR Subsystem)



Various order of output:
◦ Last In First Out (LIFO); sorted
◦ ranked by relevance
◦ ranked by other characteristics







Various forms of output
When citations only: possible links to
document delivery
Base for relevance, utility evaluation by users
Relevance feedback

What a user (or you) sees, gets,
judges – can be specified






Described three parts: Document subsystem,
User sub system, Searching /Retrieval
subsystem
There are many search engine like Google,
Bing and Yahoo etc., but they never disclose
their methods of Information Retrieval.
Lot more to know about Information
Retrieval.








http://nlp.stanford.edu/IRbook/pdf/irbookonlinereading.pdf
http://en.wikipedia.org/wiki/Information_retr
ieval
http://dss.ucsd.edu/~lwimberl/Lecture01.ppt

www.stevendroper.com/pls2253.htm
Information retrieval s

More Related Content

What's hot

Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introductionnimmyjans4
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrievalNanthini Dominique
 
Probabilistic information retrieval models & systems
Probabilistic information retrieval models & systemsProbabilistic information retrieval models & systems
Probabilistic information retrieval models & systemsSelman Bozkır
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMSai Kumar Ale
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3alaa223
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalRoi Blanco
 
Information retrieval 7 boolean model
Information retrieval 7 boolean modelInformation retrieval 7 boolean model
Information retrieval 7 boolean modelVaibhav Khanna
 
The vector space model
The vector space modelThe vector space model
The vector space modelpkgosh
 
Latest trends in AI and information Retrieval
Latest trends in AI and information Retrieval Latest trends in AI and information Retrieval
Latest trends in AI and information Retrieval Abhay Ratnaparkhi
 
Information retrieval 9 tf idf weights
Information retrieval 9 tf idf weightsInformation retrieval 9 tf idf weights
Information retrieval 9 tf idf weightsVaibhav Khanna
 
Ontology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyOntology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyDebashisnaskar
 
Web ontology language (owl)
Web ontology language (owl)Web ontology language (owl)
Web ontology language (owl)Ameer Sameer
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalVikas Bhushan
 

What's hot (20)

Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
Vector space model in information retrieval
Vector space model in information retrievalVector space model in information retrieval
Vector space model in information retrieval
 
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDFCS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
 
Probabilistic information retrieval models & systems
Probabilistic information retrieval models & systemsProbabilistic information retrieval models & systems
Probabilistic information retrieval models & systems
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEM
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
 
Digital library
Digital libraryDigital library
Digital library
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Information retrieval 7 boolean model
Information retrieval 7 boolean modelInformation retrieval 7 boolean model
Information retrieval 7 boolean model
 
The vector space model
The vector space modelThe vector space model
The vector space model
 
Latest trends in AI and information Retrieval
Latest trends in AI and information Retrieval Latest trends in AI and information Retrieval
Latest trends in AI and information Retrieval
 
Information retrieval 9 tf idf weights
Information retrieval 9 tf idf weightsInformation retrieval 9 tf idf weights
Information retrieval 9 tf idf weights
 
Ontology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyOntology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical Study
 
Inverted index
Inverted indexInverted index
Inverted index
 
Web ontology language (owl)
Web ontology language (owl)Web ontology language (owl)
Web ontology language (owl)
 
Lec1,2
Lec1,2Lec1,2
Lec1,2
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
 

Viewers also liked (6)

Data retrieval
Data retrievalData retrieval
Data retrieval
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
Sickle Cell Anemia
Sickle Cell AnemiaSickle Cell Anemia
Sickle Cell Anemia
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 

Similar to Information retrieval s

information retrieval in artificial intelligence
information retrieval in artificial intelligenceinformation retrieval in artificial intelligence
information retrieval in artificial intelligencePriyadharshiniG41
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrievalcaptainmactavish1996
 
South Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelSouth Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelTrey Grainger
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Shahriar Rafee
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataMelinda Watson
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIDr. Rupak Chakravarty
 
Query formulation process
Query formulation processQuery formulation process
Query formulation processmalathimurugan
 
Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Marianne Sweeny
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
information Storage nd retrieval.pptx
information Storage nd retrieval.pptxinformation Storage nd retrieval.pptx
information Storage nd retrieval.pptxSiva Kumar
 

Similar to Information retrieval s (20)

information retrieval in artificial intelligence
information retrieval in artificial intelligenceinformation retrieval in artificial intelligence
information retrieval in artificial intelligence
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
 
South Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelSouth Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis Panel
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured Data
 
Lec1
Lec1Lec1
Lec1
 
IRT Unit_I.pptx
IRT Unit_I.pptxIRT Unit_I.pptx
IRT Unit_I.pptx
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - II
 
Query formulation process
Query formulation processQuery formulation process
Query formulation process
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
 
CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
 
Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
information Storage nd retrieval.pptx
information Storage nd retrieval.pptxinformation Storage nd retrieval.pptx
information Storage nd retrieval.pptx
 
Unit 1
Unit 1Unit 1
Unit 1
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
BLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, SymplecticBLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, Symplectic
 

More from silambu111

More from silambu111 (9)

Chain indexing
Chain indexingChain indexing
Chain indexing
 
Search engine
Search engineSearch engine
Search engine
 
Sony
SonySony
Sony
 
Evaluation of medlars
Evaluation of medlarsEvaluation of medlars
Evaluation of medlars
 
Precis
PrecisPrecis
Precis
 
POPSI
POPSIPOPSI
POPSI
 
Citation indexing
Citation indexingCitation indexing
Citation indexing
 
Mam assign
Mam assignMam assign
Mam assign
 
Airtel
AirtelAirtel
Airtel
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Information retrieval s

  • 2.      Information retrieval is the activity of obtaining information resources relevant to an information need from a collection of information resources. An information retrieval process begins when a user enters a query into the system. Queries are formal statements of information needs. User queries are matched against the database information. Depending on the application the data objects may be, for example, text documents, images, audio, mind maps or videos. Most IR systems compute a numeric score on how well each object in the database matches the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.
  • 3.  Every online database, every search engine, everything that is searched online is based in some way or another on principles developed in IR ◦ IR is at the heart of searching used in systems such as DIALOG, LexisNexis & others  Understanding the basics of IR is a prerequisite for understanding how searching of online systems works.
  • 4. “Information retrieval embraces the intellectual aspects of the description of information and its specification for search, and also whatever systems, techniques, or machines are employed to carry out the operation.” Calvin Mooers, 1951 Objective: Provide the users with effective access to & interaction with information resources.
  • 5. 1. 2. 3. Document subsystem a) Acquisition b) Representation c) File organization User sub system a) Problem b) Representation c) Query Searching /Retrieval subsystem a) Matching b) Retrieved objects
  • 6.
  • 7. Acquisition (Document subsystem)   Selection of documents & other objects from various web resources Mostly text based documents ◦ full texts, titles, abstracts ... ◦ but also other objects:  data, statistics, images, maps, trade marks, sounds ...  The data are collected by web crawler and stored in data base.
  • 8. Representation of documents, objects(document subsystem)  Indexing – many ways : ◦ free text terms (even in full texts) ◦ controlled vocabulary - thesaurus ◦ manual & automatic techniques   Abstracting; summarizing Bibliographic description: ◦ author, title, sources, date… ◦ metadata   Classifying, clustering Organizing in fields & limits ◦ Basic Index, Additional Index. Limits
  • 9. File organization (Document subsystem)  Sequential ◦ record (document) by record  Inverted ◦ term by term; list of records under each term     Combination indexes inverted, documents sequential When citation retrieved only, need for document files Large file approaches ◦ for efficient retrieval by computers
  • 10. Problem (user subsystem)  Related to user‟s task, situation ◦ vary in specificity, clarity  Produces information need ◦ ultimate criterion for effectiveness of retrieval  how well was the need met?  Information need for the same problem may change, evolve, shift during the IR process adjustment in searching ◦ often more than one search for same problem over time  you will experience this in your term project
  • 11. Representation ( user subsystem)      Converting a concept to query. What we search for. These are stemmed and corrected using dictionary. Focus toward a good result Subject to feedback changes
  • 12. Query - search statement (user & system)  Translation into systems requirements & limits ◦ start of human-computer interaction  query is the thing that goes into the computer   Selection of files, resources Search strategy - selection of: ◦ ◦ ◦ ◦  search terms & logic possible fields, delimiters controlled & uncontrolled vocabulary variations in effectiveness tactics Reiterations from feedback ◦ several feedback types: relevance feedback, magnitude feedback.. ◦ query expansion & modification
  • 13.     Question is what user asks and what you may then have elaborated Query is what is asked of computer to match – what is put in Question is transformed into query Question: ◦ I am interested in major historical developments in the area of information retrieval?  Query ◦ history information retrieval (in Google)
  • 14. Matching - searching (Searching subsystem)  Process of matching, comparing ◦ search: what documents in the file match the query as stated?  Various search algorithms: ◦ exact match - Boolean  still available in most, if not all systems ◦ best match - ranking by relevance  increasingly used e.g. on the web ◦ hybrids incorporating both  e.g. Target, Rank in DIALOG  Each has strengths, weaknesses ◦ no „perfect‟ method exists  and probably never will
  • 15. Retrieved documents -from system to user (IR Subsystem)  Various order of output: ◦ Last In First Out (LIFO); sorted ◦ ranked by relevance ◦ ranked by other characteristics     Various forms of output When citations only: possible links to document delivery Base for relevance, utility evaluation by users Relevance feedback What a user (or you) sees, gets, judges – can be specified
  • 16.    Described three parts: Document subsystem, User sub system, Searching /Retrieval subsystem There are many search engine like Google, Bing and Yahoo etc., but they never disclose their methods of Information Retrieval. Lot more to know about Information Retrieval.