SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Information Retrieval Systems
By: Hussein Hazimeh
Lebanese University.
Main points










Introduction
Text operations and Indexing
Performance evaluation
Search engines as IR tools
Metasearch engines
IR Applications
Some current researches in IRS
Current conferences in information retrieval
Introduction


Information Retrieval (IR) is the discipline that deals with retrieval of
unstructured data, especially textual documents, in response to a
query .
User Interface
User need

Text Operations
Indexing
Inverted
file

Documents

Similarity Computation
(Searching)
Retrieved docs

Ranking

Ranked docs

Index
Text operation and Indexing


Text operations: reduce the complexity of the document
representation

Q=List of the European countries



List , Europe , country

Indexing: A simple alternative is to search the whole text
sequentially
Vocabular
y

beautiful
flowers
garden
house

70
45, 58
18, 29
6

Occurrences
Retrieval Performance Evaluation

Recall=|Ra|/|R|

Relevant Docs
In Answer Set
|Ra|

Precision=|Ra|/|A|

collection

Relevant Docs
|R|

Answer Set
|A|
Popular search engines



Google
Yahoo
Bing
…



Google search engine










Google search is based on priority
Priority rank used “PageRank” algorithm
Search Google can be using Boolean operators such as :
exclusion ( -aa ) , alternatives ( aa OR bb)
PageRank algorithm


PageRank is an algorithm used by Google search
engine to rank websites in their search engine
results.

PR(B) = PR(E) + PR(F) + PR(D) + P(C)
Googlebot : Google’s Web Crawler


Googlebot is Google’s web crawling robot, which finds
and retrieves pages on the web and hands them off to
the Google indexer.



Googlebot finds pages in two ways:



Through an add URL form, www.google.com/addurl.html
Finding links by crawling the web.
How Google process a query
Facebook as intelligent IR tool (Graph search)


Google vs. Facebook
Facebook as intelligent IR tool (continued..)


Google vs. Facebook
Metasearch engines


A meta search engine is a search tool that send user
requests to several other search engines and/or
databases and aggregate results into a single list or
displays them according to their source.



Metasearch engines enable users to enter search criteria
once and access several search engines simultaneously.
Metasearch engine
IR Applications
Mobile IR

Digital
Libraries
IR
Application
s
Enterpris
e Search

Desktop
Search
(Puggle)
Some current research topics in IRS


Visual Indexing



Indexing of (video, images, audio).
Visual content extraction



Machine learning in information retrieval



Web information retrieval (including blogs)



Mobile computing related information retrieval issues



Performance measures



Query languages and optimization
What is MapReduce ?


MapReduce is a programming model for processing
large data sets



The first is the map job, which takes a set of data
and converts it into another set of data, where
individual elements are broken down into tuples
(key/value pairs)



The reduce job takes the output from a map as input
and combines those data tuples into a smaller set of
tuples.
Motivations of MapReduce



Data processing > 1 TB



Massively parallel



Easy to use
Programming Model


Map(k1,v1) → list(k2,v2)
Reduce(k2, list (v2)) → list(v3)



Ex: 5 files







Toronto, 20
Whitby, 25
New York, 22
Rome, 32
Toronto, 4
Rome, 33
New York, 18
File 1
Programming Model (continued..)


we want to find the maximum tem-perature for each
city across all of the data files



Break this into 5 Map tasks



Each mapper work on 1 file and return the Max tem
in each city



All five of these output streams would be fed into the
reduce tasks, which combine the input results and
output a single value for each city, producing a final
result.
Programming Model(continued..)


Map(output) : (Toronto, 18) (Whitby, 27) (New York,
32) (Rome, 37)(Toronto, 32) (Whitby, 20) (New York,
33) (Rome, 38)(Toronto, 22) (Whitby, 19) (New York,
20) (Rome, 31)(Toronto, 31) (Whitby, 22) (New York,
19) (Rome, 30)



Reduce(output):(Toronto, 32) (Whitby, 27) (New
York, 33) (Rome, 38)
MapReduce uses


MapReduce is useful in a wide range of applications,
including distributed pattern-based searching, distributed
sorting, web link-graph reversal, term-vector per host,
web access log stats, inverted index construction,
document clustering, and machine learning



Moreover, the MapReduce model has been adapted to
several computing environments like multi-core systems,
desktop grids, dynamic cloud environments, and mobile
environments.



At Google, MapReduce was used to completely
regenerate Google's index of the World Wide Web. It
replaced the old ad hoc programs that updated the index
and ran the various analyses.
Current conferences in information retrieval


3rd Spanish Conference on Information Retrieval





The European Conference on Information Retrieval





2014 , June 20
Spain

2014, April 17
Netherland

7th International Workshop on Information Filtering
and Retrieval



2013, Dec 6
Italy
Search…
groph theories

Weitere ähnliche Inhalte

Was ist angesagt?

Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Vahid Moosavi
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingGraph-TA
 
A survey of web clustering engines
A survey of web clustering enginesA survey of web clustering engines
A survey of web clustering enginesunyil96
 
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...3DIR: Exploiting Topological Relationships in Three-dimensional Information R...
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...pdemian
 
A semantic based approach for information retrieval from html documents using...
A semantic based approach for information retrieval from html documents using...A semantic based approach for information retrieval from html documents using...
A semantic based approach for information retrieval from html documents using...csandit
 
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...cscpconf
 
Data collection for cultural project
Data collection for cultural projectData collection for cultural project
Data collection for cultural projectDanilo Supino
 
On nonmetric similarity search problems in complex domains
On nonmetric similarity search problems in complex domainsOn nonmetric similarity search problems in complex domains
On nonmetric similarity search problems in complex domainsunyil96
 
MPROP Pal: Helping Planners Work With Property Data
MPROP Pal: Helping Planners Work With Property DataMPROP Pal: Helping Planners Work With Property Data
MPROP Pal: Helping Planners Work With Property DataMKE Data
 
prie.ppt
prie.pptprie.ppt
prie.pptbutest
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process AnalysisMauro Dragoni
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataRoi Blanco
 
Introduction to R
Introduction to RIntroduction to R
Introduction to RSetia Pramana
 
Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Jamshaid Ashraf
 
Mp26 : A Quick Introduction to NetworkX
Mp26 : A Quick Introduction to NetworkXMp26 : A Quick Introduction to NetworkX
Mp26 : A Quick Introduction to NetworkXMontreal Python
 
04 --spatial-data
04 --spatial-data04 --spatial-data
04 --spatial-dataKarel Charvat
 
Improvement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationImprovement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationBeniamino Murgante
 
Algorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial OperationsAlgorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial OperationsNatasha Mandal
 

Was ist angesagt? (19)

Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013Bi g data_urban modeling_applications_23092013
Bi g data_urban modeling_applications_23092013
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modeling
 
A survey of web clustering engines
A survey of web clustering enginesA survey of web clustering engines
A survey of web clustering engines
 
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...3DIR: Exploiting Topological Relationships in Three-dimensional Information R...
3DIR: Exploiting Topological Relationships in Three-dimensional Information R...
 
A semantic based approach for information retrieval from html documents using...
A semantic based approach for information retrieval from html documents using...A semantic based approach for information retrieval from html documents using...
A semantic based approach for information retrieval from html documents using...
 
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...
A SEMANTIC BASED APPROACH FOR INFORMATION RETRIEVAL FROM HTML DOCUMENTS USING...
 
Data collection for cultural project
Data collection for cultural projectData collection for cultural project
Data collection for cultural project
 
On nonmetric similarity search problems in complex domains
On nonmetric similarity search problems in complex domainsOn nonmetric similarity search problems in complex domains
On nonmetric similarity search problems in complex domains
 
MPROP Pal: Helping Planners Work With Property Data
MPROP Pal: Helping Planners Work With Property DataMPROP Pal: Helping Planners Work With Property Data
MPROP Pal: Helping Planners Work With Property Data
 
prie.ppt
prie.pptprie.ppt
prie.ppt
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process Analysis
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF data
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)
 
Mp26 : A Quick Introduction to NetworkX
Mp26 : A Quick Introduction to NetworkXMp26 : A Quick Introduction to NetworkX
Mp26 : A Quick Introduction to NetworkX
 
04 --spatial-data
04 --spatial-data04 --spatial-data
04 --spatial-data
 
Iccsa stankuteha180611
Iccsa stankuteha180611Iccsa stankuteha180611
Iccsa stankuteha180611
 
Improvement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationImprovement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data Conflation
 
Algorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial OperationsAlgorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial Operations
 

Ähnlich wie IR tutorial

Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportMap reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportAhmad El Tawil
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications sathish sak
 
Survey on Parallel/Distributed Search Engines
Survey on Parallel/Distributed Search EnginesSurvey on Parallel/Distributed Search Engines
Survey on Parallel/Distributed Search EnginesYu Liu
 
Distributed computing poli
Distributed computing poliDistributed computing poli
Distributed computing poliivascucristian
 
Spatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use CasesSpatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use Casesmathieuraj
 
Big Data Hadoop (Overview)
Big Data Hadoop (Overview)Big Data Hadoop (Overview)
Big Data Hadoop (Overview)Rohit Srivastava
 
2004 map reduce simplied data processing on large clusters (mapreduce)
2004 map reduce simplied data processing on large clusters (mapreduce)2004 map reduce simplied data processing on large clusters (mapreduce)
2004 map reduce simplied data processing on large clusters (mapreduce)anh tuan
 
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...An Efficient Annotation of Search Results Based on Feature Ranking Approach f...
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...Computer Science Journals
 
Google Cluster Innards
Google Cluster InnardsGoogle Cluster Innards
Google Cluster InnardsMartin Dvorak
 
Searching Repositories of Web Application Models
Searching Repositories of Web Application ModelsSearching Repositories of Web Application Models
Searching Repositories of Web Application ModelsMarco Brambilla
 
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463IJRAT
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific MashupsMuhammad Imran
 
Data Analysis With Apache Flink
Data Analysis With Apache FlinkData Analysis With Apache Flink
Data Analysis With Apache FlinkDataWorks Summit
 
Data Analysis with Apache Flink (Hadoop Summit, 2015)
Data Analysis with Apache Flink (Hadoop Summit, 2015)Data Analysis with Apache Flink (Hadoop Summit, 2015)
Data Analysis with Apache Flink (Hadoop Summit, 2015)Aljoscha Krettek
 

Ähnlich wie IR tutorial (20)

G1803054653
G1803054653G1803054653
G1803054653
 
Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportMap reduce advantages over parallel databases report
Map reduce advantages over parallel databases report
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
 
50120140505004
5012014050500450120140505004
50120140505004
 
Survey on Parallel/Distributed Search Engines
Survey on Parallel/Distributed Search EnginesSurvey on Parallel/Distributed Search Engines
Survey on Parallel/Distributed Search Engines
 
Big data
Big dataBig data
Big data
 
Distributed computing poli
Distributed computing poliDistributed computing poli
Distributed computing poli
 
Spatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use CasesSpatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use Cases
 
Big Data
Big DataBig Data
Big Data
 
Big Data Hadoop (Overview)
Big Data Hadoop (Overview)Big Data Hadoop (Overview)
Big Data Hadoop (Overview)
 
Map reduce
Map reduceMap reduce
Map reduce
 
2004 map reduce simplied data processing on large clusters (mapreduce)
2004 map reduce simplied data processing on large clusters (mapreduce)2004 map reduce simplied data processing on large clusters (mapreduce)
2004 map reduce simplied data processing on large clusters (mapreduce)
 
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...An Efficient Annotation of Search Results Based on Feature Ranking Approach f...
An Efficient Annotation of Search Results Based on Feature Ranking Approach f...
 
Google Cluster Innards
Google Cluster InnardsGoogle Cluster Innards
Google Cluster Innards
 
Searching Repositories of Web Application Models
Searching Repositories of Web Application ModelsSearching Repositories of Web Application Models
Searching Repositories of Web Application Models
 
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific Mashups
 
Data Analysis With Apache Flink
Data Analysis With Apache FlinkData Analysis With Apache Flink
Data Analysis With Apache Flink
 
Data Analysis with Apache Flink (Hadoop Summit, 2015)
Data Analysis with Apache Flink (Hadoop Summit, 2015)Data Analysis with Apache Flink (Hadoop Summit, 2015)
Data Analysis with Apache Flink (Hadoop Summit, 2015)
 
A Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF GraphsA Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF Graphs
 

KĂźrzlich hochgeladen

FULL NIGHT — 9999894380 Call Girls In Patel Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Patel Nagar | DelhiFULL NIGHT — 9999894380 Call Girls In Patel Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Patel Nagar | DelhiSaketCallGirlsCallUs
 
FULL NIGHT — 9999894380 Call Girls In Saket | Delhi
FULL NIGHT — 9999894380 Call Girls In Saket | DelhiFULL NIGHT — 9999894380 Call Girls In Saket | Delhi
FULL NIGHT — 9999894380 Call Girls In Saket | DelhiSaketCallGirlsCallUs
 
sources of Hindu law kdaenflkjwwfererger
sources of Hindu law kdaenflkjwwferergersources of Hindu law kdaenflkjwwfererger
sources of Hindu law kdaenflkjwwferergerLakshayTewatia4
 
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls Agency
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls AgencyHire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls Agency
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls AgencyNitya salvi
 
Barasat call girls 📞 8617697112 At Low Cost Cash Payment Booking
Barasat call girls 📞 8617697112 At Low Cost Cash Payment BookingBarasat call girls 📞 8617697112 At Low Cost Cash Payment Booking
Barasat call girls 📞 8617697112 At Low Cost Cash Payment BookingNitya salvi
 
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...Nitya salvi
 
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...Nitya salvi
 
Call Girls in Sakinaka 9892124323, Vashi CAll Girls Call girls Services, Che...
Call Girls in Sakinaka  9892124323, Vashi CAll Girls Call girls Services, Che...Call Girls in Sakinaka  9892124323, Vashi CAll Girls Call girls Services, Che...
Call Girls in Sakinaka 9892124323, Vashi CAll Girls Call girls Services, Che...Pooja Nehwal
 
FULL NIGHT — 9999894380 Call Girls In Paschim Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In  Paschim Vihar | DelhiFULL NIGHT — 9999894380 Call Girls In  Paschim Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Paschim Vihar | DelhiSaketCallGirlsCallUs
 
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | Delhi
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | DelhiFULL NIGHT — 9999894380 Call Girls In Anand Niketan | Delhi
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | DelhiSaketCallGirlsCallUs
 
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...Nitya salvi
 
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | DelhiFULL NIGHT — 9999894380 Call Girls In Ashok Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | DelhiSaketCallGirlsCallUs
 
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service Available
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service AvailableMoradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service Available
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service AvailableNitya salvi
 
FULL NIGHT — 9999894380 Call Girls In Kishangarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Kishangarh | DelhiFULL NIGHT — 9999894380 Call Girls In Kishangarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Kishangarh | DelhiSaketCallGirlsCallUs
 
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | DelhiFULL NIGHT — 9999894380 Call Girls In Uttam Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | DelhiSaketCallGirlsCallUs
 
Completed Event Presentation for Huma 1305
Completed Event Presentation for Huma 1305Completed Event Presentation for Huma 1305
Completed Event Presentation for Huma 1305jazlynjacobs51
 
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.Nitya salvi
 
architect Hassan Khalil portfolio Year 2024
architect Hassan Khalil portfolio  Year 2024architect Hassan Khalil portfolio  Year 2024
architect Hassan Khalil portfolio Year 2024hassan khalil
 
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...Sheetaleventcompany
 

KĂźrzlich hochgeladen (20)

FULL NIGHT — 9999894380 Call Girls In Patel Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Patel Nagar | DelhiFULL NIGHT — 9999894380 Call Girls In Patel Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Patel Nagar | Delhi
 
FULL NIGHT — 9999894380 Call Girls In Saket | Delhi
FULL NIGHT — 9999894380 Call Girls In Saket | DelhiFULL NIGHT — 9999894380 Call Girls In Saket | Delhi
FULL NIGHT — 9999894380 Call Girls In Saket | Delhi
 
sources of Hindu law kdaenflkjwwfererger
sources of Hindu law kdaenflkjwwferergersources of Hindu law kdaenflkjwwfererger
sources of Hindu law kdaenflkjwwfererger
 
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls Agency
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls AgencyHire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls Agency
Hire 💕 8617370543 Mumbai Suburban Call Girls Service Call Girls Agency
 
Barasat call girls 📞 8617697112 At Low Cost Cash Payment Booking
Barasat call girls 📞 8617697112 At Low Cost Cash Payment BookingBarasat call girls 📞 8617697112 At Low Cost Cash Payment Booking
Barasat call girls 📞 8617697112 At Low Cost Cash Payment Booking
 
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...
Haridwar Call Girls 8617697112 Short 4000 Night 10000 Best call girls Service...
 
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...
Mayiladuthurai Call Girls 8617697112 Short 3000 Night 8000 Best call girls Se...
 
Call Girls in Sakinaka 9892124323, Vashi CAll Girls Call girls Services, Che...
Call Girls in Sakinaka  9892124323, Vashi CAll Girls Call girls Services, Che...Call Girls in Sakinaka  9892124323, Vashi CAll Girls Call girls Services, Che...
Call Girls in Sakinaka 9892124323, Vashi CAll Girls Call girls Services, Che...
 
FULL NIGHT — 9999894380 Call Girls In Paschim Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In  Paschim Vihar | DelhiFULL NIGHT — 9999894380 Call Girls In  Paschim Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Paschim Vihar | Delhi
 
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | Delhi
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | DelhiFULL NIGHT — 9999894380 Call Girls In Anand Niketan | Delhi
FULL NIGHT — 9999894380 Call Girls In Anand Niketan | Delhi
 
(INDIRA) Call Girl Dehradun Call Now 8617697112 Dehradun Escorts 24x7
(INDIRA) Call Girl Dehradun Call Now 8617697112 Dehradun Escorts 24x7(INDIRA) Call Girl Dehradun Call Now 8617697112 Dehradun Escorts 24x7
(INDIRA) Call Girl Dehradun Call Now 8617697112 Dehradun Escorts 24x7
 
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...
Agonda Beach ( Call Girls ) Goa ✔ 8617370543 ✅ By Goa Call Girls For Pick Up ...
 
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | DelhiFULL NIGHT — 9999894380 Call Girls In Ashok Vihar | Delhi
FULL NIGHT — 9999894380 Call Girls In Ashok Vihar | Delhi
 
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service Available
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service AvailableMoradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service Available
Moradabad Call Girls - 📞 8617697112 🔝 Top Class Call Girls Service Available
 
FULL NIGHT — 9999894380 Call Girls In Kishangarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Kishangarh | DelhiFULL NIGHT — 9999894380 Call Girls In Kishangarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Kishangarh | Delhi
 
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | DelhiFULL NIGHT — 9999894380 Call Girls In Uttam Nagar | Delhi
FULL NIGHT — 9999894380 Call Girls In Uttam Nagar | Delhi
 
Completed Event Presentation for Huma 1305
Completed Event Presentation for Huma 1305Completed Event Presentation for Huma 1305
Completed Event Presentation for Huma 1305
 
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.
❤Personal Whatsapp Srinagar Srinagar Call Girls 8617697112 💦✅.
 
architect Hassan Khalil portfolio Year 2024
architect Hassan Khalil portfolio  Year 2024architect Hassan Khalil portfolio  Year 2024
architect Hassan Khalil portfolio Year 2024
 
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...
Call Girl In Chandigarh ☎ 08868886958✅ Just Genuine Call Call Girls Chandigar...
 

IR tutorial

  • 1. Information Retrieval Systems By: Hussein Hazimeh Lebanese University.
  • 2. Main points         Introduction Text operations and Indexing Performance evaluation Search engines as IR tools Metasearch engines IR Applications Some current researches in IRS Current conferences in information retrieval
  • 3. Introduction  Information Retrieval (IR) is the discipline that deals with retrieval of unstructured data, especially textual documents, in response to a query . User Interface User need Text Operations Indexing Inverted file Documents Similarity Computation (Searching) Retrieved docs Ranking Ranked docs Index
  • 4. Text operation and Indexing  Text operations: reduce the complexity of the document representation Q=List of the European countries  List , Europe , country Indexing: A simple alternative is to search the whole text sequentially Vocabular y beautiful flowers garden house 70 45, 58 18, 29 6 Occurrences
  • 5. Retrieval Performance Evaluation Recall=|Ra|/|R| Relevant Docs In Answer Set |Ra| Precision=|Ra|/|A| collection Relevant Docs |R| Answer Set |A|
  • 6. Popular search engines  Google Yahoo Bing …  Google search engine       Google search is based on priority Priority rank used “PageRank” algorithm Search Google can be using Boolean operators such as : exclusion ( -aa ) , alternatives ( aa OR bb)
  • 7. PageRank algorithm  PageRank is an algorithm used by Google search engine to rank websites in their search engine results. PR(B) = PR(E) + PR(F) + PR(D) + P(C)
  • 8. Googlebot : Google’s Web Crawler  Googlebot is Google’s web crawling robot, which finds and retrieves pages on the web and hands them off to the Google indexer.  Googlebot finds pages in two ways:   Through an add URL form, www.google.com/addurl.html Finding links by crawling the web.
  • 10. Facebook as intelligent IR tool (Graph search)  Google vs. Facebook
  • 11. Facebook as intelligent IR tool (continued..)  Google vs. Facebook
  • 12. Metasearch engines  A meta search engine is a search tool that send user requests to several other search engines and/or databases and aggregate results into a single list or displays them according to their source.  Metasearch engines enable users to enter search criteria once and access several search engines simultaneously.
  • 15. Some current research topics in IRS  Visual Indexing   Indexing of (video, images, audio). Visual content extraction  Machine learning in information retrieval  Web information retrieval (including blogs)  Mobile computing related information retrieval issues  Performance measures  Query languages and optimization
  • 16. What is MapReduce ?  MapReduce is a programming model for processing large data sets  The first is the map job, which takes a set of data and converts it into another set of data, where individual elements are broken down into tuples (key/value pairs)  The reduce job takes the output from a map as input and combines those data tuples into a smaller set of tuples.
  • 17. Motivations of MapReduce  Data processing > 1 TB  Massively parallel  Easy to use
  • 18. Programming Model  Map(k1,v1) → list(k2,v2) Reduce(k2, list (v2)) → list(v3)  Ex: 5 files    Toronto, 20 Whitby, 25 New York, 22 Rome, 32 Toronto, 4 Rome, 33 New York, 18 File 1
  • 19. Programming Model (continued..)  we want to find the maximum tem-perature for each city across all of the data files  Break this into 5 Map tasks  Each mapper work on 1 file and return the Max tem in each city  All five of these output streams would be fed into the reduce tasks, which combine the input results and output a single value for each city, producing a final result.
  • 20. Programming Model(continued..)  Map(output) : (Toronto, 18) (Whitby, 27) (New York, 32) (Rome, 37)(Toronto, 32) (Whitby, 20) (New York, 33) (Rome, 38)(Toronto, 22) (Whitby, 19) (New York, 20) (Rome, 31)(Toronto, 31) (Whitby, 22) (New York, 19) (Rome, 30)  Reduce(output):(Toronto, 32) (Whitby, 27) (New York, 33) (Rome, 38)
  • 21. MapReduce uses  MapReduce is useful in a wide range of applications, including distributed pattern-based searching, distributed sorting, web link-graph reversal, term-vector per host, web access log stats, inverted index construction, document clustering, and machine learning  Moreover, the MapReduce model has been adapted to several computing environments like multi-core systems, desktop grids, dynamic cloud environments, and mobile environments.  At Google, MapReduce was used to completely regenerate Google's index of the World Wide Web. It replaced the old ad hoc programs that updated the index and ran the various analyses.
  • 22. Current conferences in information retrieval  3rd Spanish Conference on Information Retrieval    The European Conference on Information Retrieval    2014 , June 20 Spain 2014, April 17 Netherland 7th International Workshop on Information Filtering and Retrieval   2013, Dec 6 Italy
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.

Hinweis der Redaktion

  1. Digital libraries: video recordings, ppt slides, presentations, audio recordings, …The electronic content may be stored locally, or accessed remotely via computer networksEnterprise search is how your organization helps people seek the information they need from anywhere, in any format, from anywhere inside their company – in databases, document management systems, on paper, wherever. Just because there are powerful search tools available, does not mean that you should not organize your content. Desktop search all pc + internet browsing + mails
  2. Result : (Toronto, 18) (Whitby, 27) (New York, 32) (Rome, 37)(Toronto, 32) (Whitby, 20) (New York, 33) (Rome, 38)(Toronto, 22) (Whitby, 19) (New York, 20) (Rome, 31)(Toronto, 31) (Whitby, 22) (New York, 19) (Rome, 30)(Toronto, 32) (Whitby, 27) (New York, 33) (Rome, 38)