SlideShare ist ein Scribd-Unternehmen logo
1 von 156
FACETED APPROACH TO WEB
MSLIS
1st
year (2nd
semester)
DRTC, ISIBC
Introduction
Dibakar Sen
Evolution of Knowledge Society
Hunter gathering Society Pastoral Society
Agricultural Society
Industrial SocietyInformation SocietyKnowledge Society
Background of Classification
● Aristotle may have been the last person to know everything there was to
be known in his own time.
● All philosophy is “a series of footnotes of Pluto”- then all theory and
practice of KR are surely a series of footnotes of Aristotle.
- Hierarchycal structure (taxonomy)
- Dichotomy
- Decachotomy (from thee point of the nature and growth of
the development of knowledge it is unrealistic to bind the universe of
subjects to a dechacotomy, because it growes in different direction and
different stages)
- Polychotomy
(these hierarchical struc. are used in knowedge organisation)
Species of Scheme
● Enumerative Scheme (Unidimensional)
- listing or enumerating the subject.
● Faceted Scheme : to organise the multidisciplinery /
interdisciplinary subject.
- subject formation
- phase relation
- Inter subject
- Intra facet
- Intra array
- a multidimensional approach. ( Cockroach – recipie,
cooking, pest , medicine, insect etc.)
Person Team Award
-Player - Cricket - Arjuna
-Sachin Tendulkar - Pakistan
Test debute against
Sachin Tendulkar Pakistan Cricket Team
Arjuna award
received
● Web directory use hierarchical structure (like DDC)
Ex. - dmoz, Yahoo directory etc.
● Library mainly deals with – Subjects
Web deals with both - subjects and objects
- faceted metadata
- facet term and facet value
(a homogenious group or category derived according
to the principle of facet analysis – SRR).
References
● Kumar, Krishan(2008), Theory of classification,
Vikas Publication,New Delhi.
● Patel, Dimple (2002), Organizing the web: a
faceted approach, DRTC, ISI bangalore.
● http://en.wikipedia.org/wiki/Society
TOPICS SPEAKERS
Faceted Approach Sandip Das
Faceted knowledge organisation
framework
Anwesha Bhattacharya
Linking of information: Semantic
annotation
Mohit Garg
Searching the web Samhati Soor
Query processing Manash Kumar
User interface Jayanta Kr. Nayek
Project: FATKS-HUM Manasa Rath
Evaluation Criteria Shiv Shakti Ghosh
Conclusion Tanmoy Mondal
Faceted Approach
Sandip Das
Facet
A generic term to denote any component-be it basic
subject or an isolate idea-of a compound subject.
A generic term to denote facet idea, facet term and facet
number.
According to A. G. Taylor “Clearly defined, mutually
exclusive, and collectively exhaustive aspects, properties or
characteristics of a class or a specific subject.”
e.g- Injuries to corn in france in 1978.
Facet Analysis
● Personality
● Matter
● Energy
● Space
● Time
Faceted Approach
According to Dr. S. R. Ranganathan “faceted
approach involves breaking down of a subject
into a basic subject and isolate ideas.”
e.g-Treatment of Lung Cancer in India during 80's.
Enumerative Classification
An enumerative scheme for classification consists
essentially of a single schedule enumerating all
subjects –of the past, the present, and the
anticipatable future.
e.g- French poetry, Italian drama,
Pros and Cons of Faceted Classification
Pros -
● Offers flexibility for interdisciplinary subjects.
● Ease of accommodating new concept.
● Citation order is not essential.
● Extra facets can be added.
Cons -
● Complicated class number.
● They are very complex and are difficult to use without
experience.
Pros and Cons of Enumerative Classification
Pros-
● Easy to construct class number of compound subjects.
● Generaly accepted and widely used throughout the world
● A fairly short and uncomplicated notation can be used.
● “Notationally” it is easier to display the stucture of the scheme.
Cons-
● Difficult to accommadate new subjects.
● There can be lack of accommodation for even simple subject..
Classification tools
Faceted – Colon Classification
Bibliographic Classification
UDC
Enumerative - LC
DDC
Faceted Classifications on the Web
● Tower Records
(http://www.towerrecords.com )
●
Recipes
( http://www.epicurious.com )
● Annotated Wordnet
(http://www.siderean.com/wordnet17.jsp )
● Te Kete Ipurangi: the Online Learning Centre
(http://www.tki.org.nz/e/search)
Tower Records(http://www.towerrecords.com )
Enumerative Classification on Web
● DDC
ADAM
(http://adam.ac.uk/)
Biz/ed
(http://bized.ac.uk/)
BUBL
(http://bubl.ac.uk/)
● LCC
Cyberstacks(sm)
(http://www.public.iastate.edu/~CYBERSTACKS/)
ADAM (http://adam.ac.uk/)
References
● Chowdhury, GC and Chowdhury, Sudatta.(2007), Organizing
Information : from the shelf to the web, facet publisher,
london
● Ranganathan, S.R. (2006), Prolegomena to library
classification, ess ess publications, bangalore
● Kumar, Krishan(2008), Theory of classification, vikas
publication,new delhi
● Kumari, Nitu (2009), Evaluation of facted portal, DRTC, ISI
Bangalore
● S, Suman and karmakar, Debanshu (2002), The role library
classification in organizing the web, DRTC, ISI bangalore
● Patel, Dimple (2002), Organizing the web: a faceted
approach, DRTC, ISI bangalore
Faceted Knowledge Organization Framework
Anwesha Bhattacharya
Traditional Knowledge Organization Tools
Are the traditional tools enough in the web
environment too?
Can the philosophy of facet analysis applied to
organize the web?
Evolution of the General Knowledge
Modelling Environment
Problems of Keyword-based search
For answers to be relevant, a user must ask the appropriate
query in order to retrieve the desired information and fulfill the
information need (IN).
For keyword-based search a high number of keywords is
necessary to narrow down the search according to the
information need
The semantic ambiguity of querying languages are:
– Built upon natural language
– The query length
The only solution is query refinement.
DEPA
DEPA stands for Discipline, Entity, Property & Action
It is a query refinement method that enables the extraction of the
'DEPA facets' of a user query for search and retrieval purpose
The method uses the general principles of facet analysis
The method provides a user with additional and essential
contextual information, in form of list of new facets
The user can narrow down the search by expanding the original
query with the suggested facets
Example
● The content is represented as a string of fundamental categories DEPA (Discipline,
Entity, Property and Action) that are conceptually equivalent to ‘facets’ (Bhattacharya)
e.g “Treating Apple trees for bacterial disease in Trentino” is represented by a
concept description C1 in a context C
The facet repository FR(C) contains:
⟨ C1 : Agriculture; AppleTrees;Disease; Treating ⟩
Lacuna of DEPA
● Does not have explicit representation of relations
between entities
● Lacks semantics and is more syntactic
● Works better in traditional library scenario than the
web environment
● Need to be improved by formalizing the description
using classification ontology.
This is achieved in Light weight Ontology.
Light weight Ontology
● Lightweight ontologies are ontologies with a tree structure
where each node is associated with a natural language label.
The labels of nodes are organized according to certain
predefined patterns which capture different aspects of meaning,
i.e., facets.
Faceted lightweight ontology
Faceted lightweight ontology is a lightweight ontology
whose terms are extracted from a background knowledge
organized in terms of facets. Using facets allows us to build
ontologies and organize them, which in general exploits the
structure and terms of the four basic DEPA categories
MEDICINE BACKGROUND KNOWLEDGE
Advantages
● The main advantage of the faceted approach is that it makes explicit the
logical relations among the concepts and concept groups and removes
the limitations of traditional hierarchies.
● Each time, by providing the context, the faceted approach allows for the
representation of different concepts.
For example, a cow can be described as an animal, as a pet, as a food
item, as a commodity, as a God for a particular community, and so on,
depending on the domain
● Faceted lightweight ontologies have a well defined structure and, as such,
they are easier to create, to share among users, and they also provide more
organized input to faceted search and navigation.
A new approach: DERA
How to build high quality and scalable ontologies?
DERA is faceted as it is inspired to the principles and canons of the faceted
approach by Ranganathan
DERA is a KR approach as it models entities of a domain (D) by their entity
classes (E), relations (R) and attributes (A)
It is a more general framework to encode the knowledge about the
entities e.g school, cloth etc. , not limited to documents.
Classification Ontologies
Classification ontologies mainly used to describe, classify and search for documents.
In these ontologies, terms denote sets of documents.
Hierarchical BT/NT relations between the terms denote superset/subset relations.
Descriptive ontologies in different domains
Ontologies built with the purpose of describing about real world entities.
Hierarchical is-a relations indicate subset relations.
Steps in DERA(I)
Step 1: Identification of the atomic concepts
(E) watercourse, stream: a natural body of running water flowing on or under the earth
Step 2: Analysis
a body of water
a flowing body of water
no fixed boundary
confined within a bed and stream banks
larger than a brook
Steps in DERA(II)
Step 3: Synthesis.
(E) Body of water
(is-a) Flowing body of water
(is-a) Stream
(is-a) Brook
(is-a) River
(is-a) Still body of water
(is-a) Pond
(is-a) Lake
Step 4: Standardization.
(E) stream, watercourse: a natural body of running water flowing on or under the earth
Steps in DERA(III)
Step 5: Ordering
Terms and concepts in the facets are ordered
Step 6: Formalization
Descriptive ontologies are translated into Description Logic formal ontologies, e.g.,:
River Stream⊑
River (Volga)
Length (Volga, 3692)
Properties of DERA
DERA facets have explicit semantics and are modeled as descriptive ontologies
DERA facets inherits all the nice properties of the faceted approach, such as robustness
and scalability
It allows:
Very expressive document search by any entity property
Automated reasoning via the formalization into Description Logics ontologies
Modeling of relevant entities of the domain and their E/R/A properties.
References
● Agostini A., Madalli, D.P., Prasad, A.R.D.(2011).
Faceted Approach To Diverse Query Processing.
● Giunchiglia et al. (2009b). Giunchiglia, F. Dutta, B.
Maltese, V. (2009). Faceted lightweight ontologies.
● Giunchiglia, F., Dutta, B., Maltese, V.(2013). From
Knowledge Organization to Knowledge
Representation.
Linking of Information: Semantic
Annotation
Mohit Garg
Semantic Annotation
●
For the Semantic Web, we need information in
a hierarchical structure.
●
Idea is that we attach semantic metadata to the
documents,pointing to concepts.
●
An annotation, is a form of meta-data attached
to a particular section of document content
●
SA is an annotation indicating the presence of a
(semantic) entity in a particular place in a text
What SA does
●
Attaches metadata to documents, which makes
them more useful and more easily processable
●
They can then be used for searching and
hyperlinking,categorising, and monitoring
●
Adds value to content of libraries, enabling user
interaction with content.
●
Enhanced capability for cross-referencing and
dynamic document classification
Metadata
● Metadata of a document is an important source
of knowledge about various properties of the
document.
● Subject metadata is of vital significance,
expressing the thought content of the
document.
● The usual practice is to list the names of
concepts dealt within the document using
knowledge organisation tools like a thesaurus,
taxonomy,subject heading list, etc
● But in an automatic information processing
environment, the metadata is meant to be
processed by the machine. For machine
processing of subject metadata, concept names
do not give enough indication of their subject
matter.
● The subject metadata is supposed to represent
the specific subject of the document.
● The specific subject of the document is that
division of knowledge whose extension and
intention are equal to those of its thought
content
FACETISING SEMANTIC ANNOTATION
● In faceted approach to subject analysis, any
compound and complex subjects are made up
of the combination of one or more facets.
● Faceted classification schemes and subject
indexing languages label each concept with a
facet type depending on its role in a given
specific subject.
Facetising semantic annotation
There are three basic components –
●
Concept
●
Facet
●
Category
(1) Concept. A generic term to denote a piece of
idea.
(2) Facet. The concepts are grouped under a
class according to a single characteristic. This
group/class is termed a facet. Facet is also
defined as a class having subclasses by
applying a “train of characteristics”
(Ranganathan).
● (3) Category. Denotes a basic group/class of
concepts which can be commonly identified in
all subjects of the universe of knowledge.
Ranganathan proposed these elementary
categories: personality, matter, energy, space
and time.
Operations in SDA
Faceted semantic annotation is a process for
preparings subject-propositions and consists
primarily of:-
●
concept identification
●
facet analysis
●
facet coordination
●
RDF representation of the facets
Step 1: concept identification
●
In this step the key terms are extracted from the
given sentence. Concepts which are implicit in
the sentence (such as medicine, human body,
disease, etc.) are also identified.
● In medical radiology, determination of depth
dose in Roentgen Rotation Therapy using
Ionization pocket chamber
● Medicine, human body, disease, treatment,
radiation therapy, X-ray therapy, treatment
using rotation technique, determination of depth
dose, using ionized packet chamber.
Step 2: facet analysis
●
The process of facet analysis starts with
labelling extracted concepts as facets
according to their characteristics/role in the
specific subject.
●
For ex.“human body” has been labelled as the
entity facet as it is the key component around
which all other concepts, such as “disease” and
“treatment”, are concentrated.
Step 3: facet coordination
●
Discipline followed by entity (with or without
modifiers) appropriately interpolated or
extrapolated by action and property (with or
without modifiers) is a logical sequence of the
elements of a basic chain manifesting in a
compound subject-proposition. Any action or
property may have further sets of action(s)
and/or property(ies) directly related to it. Their
positions are always after the action or property
to which they relate. Ex. Document annotation
in POPSI
Step 4: RDF representation of the
facets
● The main application of the SDA is for
automatic categorisation of web resources in
web directories. A resource can have more than
one subject proposition.
● This will allow expression of the multiple
dimensions of the specific subject of the
resource. Simultaneously, this will provide an
equal number of access points to the resource
at the time of the user’s query.
References
●
Broughton, V. (2004), “Faceted classification: a
tool for subject access in the twenty first
century”,Signum, Vol. 8, pp. 5-18.
●
Broughton, V. (2006), “The need for a faceted
classification as the basis of all methods of
information retrieval”, Aslib Proceedings, Vol.
58 No. 1, pp. 49-72.
●
Cimiano, P. and Handschuh, S. (2003),
“Ontology-based linguistic annotation”,
Proceedings of the ACL 2003 Workshop on
Linguistic Annotation: Getting the Model Right,
Vol. 19, pp. 14-21.
Searching the Web
Samhati Soor
Searching Information in Web
There are a number of basic ways
to access information on the
Web:
● Going directly to a site if we
have the address
● Browsing
● Conducting a search using a
Web search engine
● Exploring a subject directory
● Exploring the information
stored in live databases on the
Web, known as the "deep
Web"
● Joining an e-mail discussion
group
These attempts broadly fall into two categories:
Webpages in hierarchical categories (taxonomy-
based) - directory structure - human-empowered
directories - Faceted Searching
Yahoo!
Open Directory Project
dmoz
Web Search Engines – Keyword Searching
Excite
AltaVista
Google
Indexing in Web
● Human Assisted Indexing/
Manual Indexing
- Human-empowered
directories/ Faceted
Approach
● Automatic Indexing/ Full-
text indexing
- Full-text index Search
Engines
Human Assisted Indexing
IDENTIFICATION OF KEYWORDS IN A DOCUMENT
STANDARIZATION OF KEYWORDS
CHOICE OF A MODEL
PREPARATION OF ENTRIES
FILLING OF ENTRIES
Human-assisted Indexing on Web
● Depending on humans
for its listings.
● Submitting a short
description to the
directory for entire site or
writing by Editors
● A search looks for
matches only in the
descriptions submitted.
● Changing our web pages
has no effect on our
listing.
●
Following pre-
cordinating
indexing system,
because here the
coordinatiion is done
before giving the query.
Automatic Indexing
CHOOSING ALL WORDS IN A DOCUMENT
ELIMINATING COMMON FUNCTION WORDS
BY CONSULTING A STOP-WORD LIST
COMPUTING THE FREQUENCY OCCURENCE
OF ALL THE WORDS IN EACH DOCUMENT
ASSIGNING MOST
FREQUENTLY OCCURRING TERMS
AS THE INDEX TERMS
Automatic Indexing on Web
Three elements are here :
1. Crawler or Spider
2. Index (Post-coordinate Indexing)
3. Search Engine Software
Natural Language Search
● Natural language words or keywords
we use everyday.
● Flexible and can be combined in
different ways
● A lot of irrelvant searches
● The search engine has to look
everywhere for that word, whether in
the full text of the document or the title
of it.
●
We get more hits, recall is high, but
results are necessarily better.
● This type of searching deals with
automatic Indexing.
Controlled Vocabulary Search
● Preselected
● Used by the indexers of a database or a
catalog to describe a subject so that it can
be easily found.
● We may use a lot of words for the same
topic and that makes searching difficult.
● Helping to bring together under a single
word or phrase, all the material that is
available on a particular topic
● The results are usually more relevant to our
topic, so precision is high.
● The trick is to find the correct subject
headings which is sometimes not so easy.
● This type of search deals with Manual
indexing.
Faceted Searching vs Keyword Searching
FACETED KEYWORD
COST EXPENSIVE INEXPENSIVE
TIME MORE LESS
EXTENT OF
INDEXABLE MATTER
SUMMARIZATION COMPLETE TEXT
EXHAUSTIVITY MORE SELECTIVE VERY LESS SELECTIVE
SPECIFICITY GENERIC
TERMINOLOGY
VERY SPECIFIC
TERMINOLOGY
HEADING STRUCTURE MULTI-TERM CONTEXT LIMITED TERM-
COMBONATION
SEARCHING WIDE-RANGE MORE-SOPHISTICATED
VOCABULARY SMALLER LARGER
SURROGATION NOT OFTEN USED FREQUENTLY USED
RECALL LESS MORE
PRECISION MORE LESS
Faceted+Keyword
References
Indexing The World Wide Web:The Journey So Far
Abhishek Das
Google Inc., USA
Ankit Jain
Google Inc., USA
Towards a Framework for Adaptive FacetedSearch on Twitter
Ilknur Celik, Fabian Abel, Patrick Siehndel
Web Information Systems, Delft University of Technologyfcelik,abelg@tudelft.nl
L3S Research Center, Leibniz University Hannover, Germanysiehndel@l3s.de
Finding Information on the Internet
Session 1:
Searching the World Wide Web
Dr. Hesham Azmi
Program of Information Science
Dept. of Mass Comm.& Information Science
http://www.tourolib.org/services/students/subject-headings
Index and Indexing: Amitabha Chatterjee
Query Processing
Manash Kumar
Query Processing
● The user can make three kind of query
(Alessandro Agostini,Devika P. Madalli and
A.R.D. Prasad. Faceted Approach To Diverse
Query Processing):
– keyword-based
– by focus
– on subject
The Facets Repository
● The facets from facet analysis can be use to
build the facet repository available to a user to
refine a query.
● The facets repository is organized around two
main notions of the DEPA paradigm for facet
analysis
– subjetcs and facets
Concept terms
● A cluster’s name in representation language is
referred to as concept term.
● Two kinds of semantics are provided to a concept
term:
– an extensional semantics, defined over the documents
in the cluster named by the concept term.
– and an intensional semantics, defined by the unique
position of the concept term in a given ‘focus’.
Focused Terms
● A focus consists of an ordered set of related
concept terms.
● A focus is a path of concept terms corresponding to a
path in a given context.
Cx:Fruit
Orange Trentino
Apple
Cx:Fruit
Trentino
Apple
X.doc X.doc
Figure: An example of focus (right)
Facet Engine
● A facet engine is that computes the matching
between the focused terms of a input context and
the predefined set of facets stored in the facet
repository for a number of concepts
● The facet engine looks at all keywords generated
for each concept name in a focus for all focuses of
the hierarchy, and browse through the focus from
the root to the leaf to identify what keywords are
DEPA facets stored in facet repository.
Keyword-based querying
● The user types one or more keywords in the search box.
● Each keyword is mapped to zero or more concept terms in the
context . It is done using an exact string match of the keyword
to the concept term or one of its alias names, namely, its
focused terms.
● If no concept term and its alias names match any keyword, no
concept description is available to the facet engine, and as a
consequence no facets for query refinement are shown to the
user.
Keyword-based querying
● If one concept term or its alias names match
some keywords, then the concept description C
of the concept term is generated and processed
for query expansion.
● The facets that occur in the query expansion
are shown to the user
● When selecting one of the new facets, the user
will narrow down the search by expanding the
original query with the suggested facet.
Keyword-based querying
● If multiple concept terms match some
keywords, then the concept description of
each term is generated and processed for
query expansion
● The user is given the option to refine their
query to indicate which concept term, namely,
keyword they meant the most.
Querying by Focus
● Sometimes user knows at least something about the
subject to search, and the user’s knowledge comes from
documents stored and polyhierarchically organized in the
user’s document collection.
● In this case, it would always be desiderable for the user
to get better and better understanding of the hidden
content of the query, as it is automatically generated by a
suitable method, so as to discover new facets of the
original query that the user was not aware of before.
Querying by Focus
Cx:Fruit
Trentinoorange
Apple
Querying by Focus
● For example, suppose the query is ‘apple’ as contextualized.
The user clicks on a concept term in a context C , ie. the user
selects a focus in C.
● Alternatively, the user types some keywords as in keyword-
based querying, but in a specific order to mean a focus in C.
● For example, the user may click on (an appropriate graphic-
version of) ‘Apple’ in context or either type keywords ‘fruit’,
‘trentino’, ‘apple’ in this order, as to mean
Cx:Fruit>Trentino>Apple
Querying by Focus
● By selecting the facet ‘Fruit’ the user would
narrow down the search space by excluding
all subjects about Apple computer and related
subjects as search results.
● By selecting facet ‘Trentino’ the user would be
able to narrow down the search space by
excluding all subjects about fruits that are not
related toTrentino’s production of apples.
Querying on Subject
● ‘Subject’ refers to the topical intent of a query.
● In faceted approach to representation of
documents in collection ,‘subjects’ are broken
down into distinct divisions, the facets of
subject.
● A typical ‘query-on-subject’ is deemed to relate
to a specific subject of a pre-existing faceted
classification.
Querying on Subject
For example:
A subject-based query is-”What are the documents on the
effects of nitrogen fertilizers on rice plants?”
The subject of the concept subsumed by this query is one of
possibly many focuses.
– Cx1:Rice plants>nitrogen fertilizers>effects.
– Cx2: Agriculture>rice plants>nitrogen fertilizers>effects.
– Cx3:Agriculture> effects of nitrogen> fertilizers>rice plants.
Querying on Subject
● A number of different but equivalent focuses
could exists for a given subject-based query.
● If multiple focuses are computed from the
query’s subject, the user is given the option to
refine the original query to indicate which
focus they meant for the searched subject.
Reference
● Agostini A., Madalli, D.P., Prasad, A.R.D.
(2011). Faceted Approach To Diverse Query
Processing.
User Interface
Jayanta Kr. Nayek
User Interface
● A user interface is the system by which people (users) interact with
a machine.
● The goal of this interaction is effective operation and control of
the machine on the user's end, and feedback from the machine,
which aids the operator in making operational decisions.
● The prototype system provides three interfaces for general users to
search for relevant information:
Browser interface: It allows all current and perspective users to browse and navigate
information
Advanced search interface: Here users can combine their searching through multiple
selection
Basic search interface: It allows entering any number of keywords or phrases. It
supports Boolean queries and present th results with relevance ranking.
Faceted Search
● Facets refer to categories used to characterise information items in a collection.
● Faceted search is a technique for accessing information organized according to a
faceted classification system, allowing users to explore a collection of information by
applying multiple filters.
● It allows the user to narrow down search results with multiple filters,so user quickly
see which results are most relevant for him.
● Eg.Suppose a user want to filter a flight on price, number of stops and the total
flight duration and this query he search on Google,it will not give a relevant result but
if he will search on a faceted portal which are related to Flights and hotels like(Home
and Abroad,fare cost,Inside trip etc.) he will find relevant result in a short period.
Faceted search at homeflights,farecast,airlines,time
schedule
Faceted Navigation
● Faceted navigation allows the user to elaborate a query progressively,
seeing the effect of each choice in one facet on the available choices in
other facets.
● Faceted navigation can also be seen as an alternative for Advanced
search where users can 'search' on the information 'facets' rather than
seen the facet come back in the 'normal' navigation.
● Advantages:
More intuitive : Easy to guess what's behind each door.
Dynamic selection of categories allowed : Supports multiple
perspectives
Systematic advantages : Fewer elements-Its ability to handle
compound search.
List of portals on web use faceted approach
● Flameno :http://flamenco.berkeley.edu/index.html
● Wine.com: http://www.wine.com/
● Amazon.com:http://www.amazon.com/
● Ebay.com:http://www.ebay.in/
● Flipkart:http://www.flipkart.com/
● Epicurious.com://www.epicurious.com/
● Etoys.com:http://www.etoys.com/home/index.jsp
The Flamenco Search Interface Project
Search Interfaces that Flow
● FLAMENCO stands for FLexible information Access using MEtadata in Novel
COmbinations.
● The Flamenco search interface framework has the primary design goal of allowing users to
move through large information spaces in a flexible manner without feeling lost.
● A key property of the interface is the explicit exposure of category metadata, to guide
the user toward possible choices, and to organize the results of keyword searches.
● The interface uses hierarchical faceted metadata in a manner that allows users to both
refine and expand the current query, while maintaining a consistent representation of the
collection's structure.
● This use of metadata is integrated with free-text search, allowing the user to follow links,
then add search terms, then follow more links, without interrupting the interaction
flow.
Flamenco Documentation
Preparing Your Data
● A Flamenco collection is a set of items that are all the same kind (for
example, all items are books, or all items are songs, and so on).
● The metadata about any given item consists of its facet values and attribute
values. The first step in preparing your collection is to decide which
information will be in facets and which will be in attributes.
● Facet values are used to organize items into categories, whereas
attribute values are only displayed with individual items.
● In the sample collection, for instance, prize is a facet indicating the type of
Nobel Prize won, whereas name is an attribute for the name of the
winner. That's because it makes sense to group Nobel Prize winners into
categories by the type of prize, but not by their names.
Flamenco Documentation
Preparing Your Data
● For Flamenco to load your collection, the metadata about the collection has to be
provided in tab-delimited files (also known as TSV files, with a ".tsv" extension).
● The TSV files you need to provide are:
●
attrs.tsv: gives the list of attributes. Each line in this file represents one attribute.
● facets.tsv: gives the list of facets. Each line in this file represents one facet.
● items.tsv : gives the IDs and attribute values for all the items. Each line of the file
represents one item.
● facet_terms.tsv (for each facet):gives the tree of category terms in the facet.
● facet_map.tsv (for each facet): assigns items to the category terms for that facet.
Each line in this file has two fields.
● sortkeys.tsv (optional): indicates which facets or attributes are to be used for
sorting result lists.
● text.tsv (optional) : supports the text search feature of Flamenco.
Faceted Serch in Wine.com
Selection of Facet in Wine.com:
1. Religion: California
2. Type: Red wine, White, Bubbly
3. Style: Red- light & fruity
4. Price: $20 and below
5. Publications: International wine cellar
Faceted search in Wine.com
Faceted Navigation
References
● Tunkelang, Daniel (2009), Faceted search,
Morgan &Claypool.
● Kumari, Nitu (2009), Evaluation of facted portal,
DRTC, ISI Bangalore.
● http://flamenco.berkeley.edu/index.html
● http://www.wine.com/v6/wineshop/
Project : FATKS-HUM
Manasa Rath
FATKS-HUM (Background)
●To investigate the feasibility of using FAT to develop a
knowledge structure suitable for the digital environment
●To develop and evaluate a prototype implementation in
collaboration with the Arts and Humanities Data Service
(AHDS) and the Humbul Humanities Hub
FAT-HUM
➢serves to test and demonstrate facet analysis in
humanities
➢consists of the three distinct but closely interconnected
classifications of concepts:
-broad classification representing UoK
-more detailed faceted classification tested in two areas of humanities: religion
and visual arts
-classification of generally applicable concepts
MODEL of FAT-HUM
●stems from three faceted/analytic synthetic
classificatory schemes
● Bliss Classification scheme
● Universal Decimal Classification
● Broad System Ordering
Broad Classification Schedule
✔ broad general knowledge classification containing around 300 classes
✔ serves as a basic structure of disciplines to which faceted classification
for humanities is linked
Applicable Concepts
applicable throughout the classification are structured as
'external' vocabulary facets
kept as separate classification schedules called Common
auxiliaries
faceted classification for humanities needs to relate to the
concepts that are not particular to humanities only (e.g. place,
time, persons etc.)
purpose of this research project a test vocabulary compiled from
the BC2 and Universal Decimal Classification was created
MICRO (Inner classification
structure)
-FAT-HUM is a faceted classification developed by organizing
concepts of a specific field of knowledge according to the
following fundamental concept categories (facets) :
Thing - Kind - Part - Property - Material - Process - Operation - Patient -
Product - Byproduct- Agent - Place – Time
- facets of products, by product and material, for instance, may
be less relevant for humanities while some others need to be
introduced, such as 'theory and philosophy'
Syntax FATHUM's analytico-synthetic
feature
-Citation Order-sequence of facets in a
combined class symbol is not accidental
a) Combining concepts within the same facet
b) Combining concepts between facets
c) Combining concepts between humanities and common
auxiliaries
d)Combining concepts between different disciplines
Notation
Notation in the broad classification of knowledge
 Notation in the facets of common auxiliaries
(B4932) Critical reviews
 Notation in the humanities
590 Religion. Theology
 590A Theory and philosophy of religion
 Subject Relationships using symbols
Bias phase showed by 420<<590 'Religion for educational
purposes'
 Filling order
540+941 stands for Religion and Art
Other related project
SALT (Standard based Access service to
multilingual Lexicons and Terminologies)
REFERENCE
The Official Website of Facet Analytical Theory of
Knowledge Structures
http://www.ucl.ac.uk/fatks/classification_system.htm
Evaluation Criteria
Shiv Shakti Ghosh
Aim
• To know the intended design, implementation,
outputs, outcomes and popularity due to ease
of use.
• To provide information to project designers on
how to improve their design, the extent their
project achieved.
• To know how much a user is benefited because
of faceted portal.
Criteria
Content Organization
• It should show a clear and logical structure to
typical users.
• It includes putting critical information near the
top of the site, grouping related elements.
Faceted Navigation
• It should help users locate and link to
destination pages.
• Site maps, feedback on the user’s location
within the site, clickable list of page contents,
glosses should be provided.
Faceted Metadata
• For a huge and organized collection metadata
should be provided.
Facet exposure
• For maximum exposure facets should be placed
across the top.
Layout of Labels within Facets
• For facets having large labels column-oriented
layout should be preferred.
    
Faceted Category Interface
• The retrieve result can be selected by keyword
search by pre assigned metadata terms or by a
combination of both.
Facet Hierarchy Navigation
• Faceted portals should provide faceted hierarchy
navigation.
• In this user selects anyone facet and refines within
its hierarchy.
Multifaceted Navigation
• Users should be able to filter out large sets of
products or content by a variety of product
attributes like (size, color, feature, price range,
specifications).
Directed Navigation
• Combining information found within structured
fields(product name, size or manufacturer) with
unstructured content(product description).
Breadcrumb navigation
• When a user clicks through the site’s hierarchy,
each successive link should be indicated as a
text link.
• When a user ends up with a string of section
and subsection names where he is and where
he had been should be shown exactly.
Browser Interface
• Should help users to search after previewing
summary
Advanced Search Interface
• Users should be able to combine their searching
within categories of facets through multiple
selection boxes
Basic Search Interface
• Should allow users to enter any number of
keywords or phrases and should also support
Boolean queries.
Customization & Personalization
• Skins of the same layout can be used to
express different themes, fonts, and colors can
be used.
• User should be able to save and develop own
user interface templates.
• Facilities for creating user account, profile
should be provided so that user can organize
his/her information.
Reference
●Kumari, Nitu (2009), Evaluation of facted portal,
DRTC, ISI bangalore
Conclusion
Tanmay Mondal
Present Pattern
 User types in a search
 Search engine gives back matching results
 User reads the results and picks the best one
 Users' have to find out relevancy
Why Need?
●Users generally do not adopt new search
interfaces
●How to show a lot more information without
overwhelming or confusing
●Most users prefer simplicity unless complexity really
makes a difference
●Small details information are provided
Advantages
 Lets the user decide how to start, and how to explore
 After refinement, categories that are not relevant to the current
results disappear.
 Very easy to build up complex queries
 Reduces mental work & Provide Simple Error Handling
 Suggests alternatives
Advantages
 Easier to explore the collection
 Helps users infer what kinds of things are in the collection
 Seamless to add new facets and subcategories
 Use the metadata to show where to go next
 More flexible than canned hyperlinks
“The more successful the Web, the greater the problem of
information and resource discovery”
(Wallis and Burden).
Factors :
Expert
Time and Economy
Service providers
Summary
●Conversion: Customers can’t buy what they can’t find
●Efficiency: Good navigation increases productivity
●Confidence: Faceted navigation increases information scent
●Aboutness: Facets show semantic make-up of a collection
●Reduced Uncertainty: Not required to specify precise queries
●Guided Experience: Browsing categories provides a different
experience than keyword search
Reference
http://www.oknamfacetedmetadata.pdf
C6 final
C6 final
C6 final

Weitere ähnliche Inhalte

Ähnlich wie C6 final

Ontology and its various aspects
Ontology and its various aspectsOntology and its various aspects
Ontology and its various aspectssamhati27
 
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...ariadnenetwork
 
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyKnowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyBhojaraju Gunjal
 
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...andrea huang
 
Shalini canons of library classification ( idea plane) pdf
Shalini canons of library classification ( idea plane) pdfShalini canons of library classification ( idea plane) pdf
Shalini canons of library classification ( idea plane) pdfDr Shalini Lihitkar
 
Interaction for a Shared Knowledge with Reperio: The Cardano Case
Interaction for a Shared Knowledge with Reperio: The Cardano CaseInteraction for a Shared Knowledge with Reperio: The Cardano Case
Interaction for a Shared Knowledge with Reperio: The Cardano CaseDamiana Luzzi
 
Philosophies of Technology & Education, and Taxonomies of Learning
Philosophies of Technology & Education, and Taxonomies of LearningPhilosophies of Technology & Education, and Taxonomies of Learning
Philosophies of Technology & Education, and Taxonomies of LearningKelli Buckreus
 
Swoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchSwoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchIDES Editor
 
M1. sem web & ontology introd
M1. sem web & ontology introdM1. sem web & ontology introd
M1. sem web & ontology introdMichele Missikoff
 
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontology
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontologyJarrar.lecture notes.aai.2011s.ontology part2_whatisontology
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontologyPalGov
 
A structured approach to cultural studies of architectural space
A structured approach to cultural studies of architectural spaceA structured approach to cultural studies of architectural space
A structured approach to cultural studies of architectural spacedocanas35
 
Ontology visualization methods—a survey
Ontology visualization methods—a surveyOntology visualization methods—a survey
Ontology visualization methods—a surveyunyil96
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingsamhati27
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Michele Pasin
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalgowthamnaidu0986
 
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...Antonio Lieto
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMarko Rodriguez
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic webAngelica Lo Duca
 
Lloyd Swarmfest 2010 Presentation
Lloyd   Swarmfest 2010 PresentationLloyd   Swarmfest 2010 Presentation
Lloyd Swarmfest 2010 Presentationkalloyd
 

Ähnlich wie C6 final (20)

Ontology and its various aspects
Ontology and its various aspectsOntology and its various aspects
Ontology and its various aspects
 
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
 
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyKnowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
 
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
 
Shalini canons of library classification ( idea plane) pdf
Shalini canons of library classification ( idea plane) pdfShalini canons of library classification ( idea plane) pdf
Shalini canons of library classification ( idea plane) pdf
 
Interaction for a Shared Knowledge with Reperio: The Cardano Case
Interaction for a Shared Knowledge with Reperio: The Cardano CaseInteraction for a Shared Knowledge with Reperio: The Cardano Case
Interaction for a Shared Knowledge with Reperio: The Cardano Case
 
Philosophies of Technology & Education, and Taxonomies of Learning
Philosophies of Technology & Education, and Taxonomies of LearningPhilosophies of Technology & Education, and Taxonomies of Learning
Philosophies of Technology & Education, and Taxonomies of Learning
 
Swoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchSwoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic Search
 
M1. sem web & ontology introd
M1. sem web & ontology introdM1. sem web & ontology introd
M1. sem web & ontology introd
 
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontology
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontologyJarrar.lecture notes.aai.2011s.ontology part2_whatisontology
Jarrar.lecture notes.aai.2011s.ontology part2_whatisontology
 
A structured approach to cultural studies of architectural space
A structured approach to cultural studies of architectural spaceA structured approach to cultural studies of architectural space
A structured approach to cultural studies of architectural space
 
Ontology visualization methods—a survey
Ontology visualization methods—a surveyOntology visualization methods—a survey
Ontology visualization methods—a survey
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professional
 
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...
Functional and Structural Models of Commonsense Reasoning in Cognitive Archit...
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic web
 
Lloyd Swarmfest 2010 Presentation
Lloyd   Swarmfest 2010 PresentationLloyd   Swarmfest 2010 Presentation
Lloyd Swarmfest 2010 Presentation
 
Ontology
OntologyOntology
Ontology
 

Kürzlich hochgeladen

horny (9316020077 ) Goa Call Girls Service by VIP Call Girls in Goa
horny (9316020077 ) Goa  Call Girls Service by VIP Call Girls in Goahorny (9316020077 ) Goa  Call Girls Service by VIP Call Girls in Goa
horny (9316020077 ) Goa Call Girls Service by VIP Call Girls in Goasexy call girls service in goa
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGAPNIC
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsstephieert
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Standkumarajju5765
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)Damian Radcliffe
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistKHM Anwar
 
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...sonatiwari757
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445ruhi
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebJames Anderson
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 

Kürzlich hochgeladen (20)

horny (9316020077 ) Goa Call Girls Service by VIP Call Girls in Goa
horny (9316020077 ) Goa  Call Girls Service by VIP Call Girls in Goahorny (9316020077 ) Goa  Call Girls Service by VIP Call Girls in Goa
horny (9316020077 ) Goa Call Girls Service by VIP Call Girls in Goa
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girls
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization Specialist
 
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Call Girls In Noida 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Noida 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In Noida 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Noida 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
 

C6 final

  • 1. FACETED APPROACH TO WEB MSLIS 1st year (2nd semester) DRTC, ISIBC
  • 3. Evolution of Knowledge Society Hunter gathering Society Pastoral Society Agricultural Society Industrial SocietyInformation SocietyKnowledge Society
  • 4. Background of Classification ● Aristotle may have been the last person to know everything there was to be known in his own time. ● All philosophy is “a series of footnotes of Pluto”- then all theory and practice of KR are surely a series of footnotes of Aristotle. - Hierarchycal structure (taxonomy) - Dichotomy - Decachotomy (from thee point of the nature and growth of the development of knowledge it is unrealistic to bind the universe of subjects to a dechacotomy, because it growes in different direction and different stages) - Polychotomy (these hierarchical struc. are used in knowedge organisation)
  • 5. Species of Scheme ● Enumerative Scheme (Unidimensional) - listing or enumerating the subject. ● Faceted Scheme : to organise the multidisciplinery / interdisciplinary subject. - subject formation - phase relation - Inter subject - Intra facet - Intra array - a multidimensional approach. ( Cockroach – recipie, cooking, pest , medicine, insect etc.)
  • 6.
  • 7. Person Team Award -Player - Cricket - Arjuna -Sachin Tendulkar - Pakistan Test debute against Sachin Tendulkar Pakistan Cricket Team Arjuna award received
  • 8. ● Web directory use hierarchical structure (like DDC) Ex. - dmoz, Yahoo directory etc. ● Library mainly deals with – Subjects Web deals with both - subjects and objects - faceted metadata - facet term and facet value (a homogenious group or category derived according to the principle of facet analysis – SRR).
  • 9.
  • 10.
  • 11. References ● Kumar, Krishan(2008), Theory of classification, Vikas Publication,New Delhi. ● Patel, Dimple (2002), Organizing the web: a faceted approach, DRTC, ISI bangalore. ● http://en.wikipedia.org/wiki/Society
  • 12. TOPICS SPEAKERS Faceted Approach Sandip Das Faceted knowledge organisation framework Anwesha Bhattacharya Linking of information: Semantic annotation Mohit Garg Searching the web Samhati Soor Query processing Manash Kumar User interface Jayanta Kr. Nayek Project: FATKS-HUM Manasa Rath Evaluation Criteria Shiv Shakti Ghosh Conclusion Tanmoy Mondal
  • 14. Facet A generic term to denote any component-be it basic subject or an isolate idea-of a compound subject. A generic term to denote facet idea, facet term and facet number. According to A. G. Taylor “Clearly defined, mutually exclusive, and collectively exhaustive aspects, properties or characteristics of a class or a specific subject.” e.g- Injuries to corn in france in 1978.
  • 15. Facet Analysis ● Personality ● Matter ● Energy ● Space ● Time
  • 16. Faceted Approach According to Dr. S. R. Ranganathan “faceted approach involves breaking down of a subject into a basic subject and isolate ideas.” e.g-Treatment of Lung Cancer in India during 80's.
  • 17. Enumerative Classification An enumerative scheme for classification consists essentially of a single schedule enumerating all subjects –of the past, the present, and the anticipatable future. e.g- French poetry, Italian drama,
  • 18. Pros and Cons of Faceted Classification Pros - ● Offers flexibility for interdisciplinary subjects. ● Ease of accommodating new concept. ● Citation order is not essential. ● Extra facets can be added. Cons - ● Complicated class number. ● They are very complex and are difficult to use without experience.
  • 19. Pros and Cons of Enumerative Classification Pros- ● Easy to construct class number of compound subjects. ● Generaly accepted and widely used throughout the world ● A fairly short and uncomplicated notation can be used. ● “Notationally” it is easier to display the stucture of the scheme. Cons- ● Difficult to accommadate new subjects. ● There can be lack of accommodation for even simple subject..
  • 20. Classification tools Faceted – Colon Classification Bibliographic Classification UDC Enumerative - LC DDC
  • 21. Faceted Classifications on the Web ● Tower Records (http://www.towerrecords.com ) ● Recipes ( http://www.epicurious.com ) ● Annotated Wordnet (http://www.siderean.com/wordnet17.jsp ) ● Te Kete Ipurangi: the Online Learning Centre (http://www.tki.org.nz/e/search)
  • 23. Enumerative Classification on Web ● DDC ADAM (http://adam.ac.uk/) Biz/ed (http://bized.ac.uk/) BUBL (http://bubl.ac.uk/) ● LCC Cyberstacks(sm) (http://www.public.iastate.edu/~CYBERSTACKS/)
  • 25. References ● Chowdhury, GC and Chowdhury, Sudatta.(2007), Organizing Information : from the shelf to the web, facet publisher, london ● Ranganathan, S.R. (2006), Prolegomena to library classification, ess ess publications, bangalore ● Kumar, Krishan(2008), Theory of classification, vikas publication,new delhi ● Kumari, Nitu (2009), Evaluation of facted portal, DRTC, ISI Bangalore ● S, Suman and karmakar, Debanshu (2002), The role library classification in organizing the web, DRTC, ISI bangalore ● Patel, Dimple (2002), Organizing the web: a faceted approach, DRTC, ISI bangalore
  • 26. Faceted Knowledge Organization Framework Anwesha Bhattacharya
  • 28. Are the traditional tools enough in the web environment too? Can the philosophy of facet analysis applied to organize the web?
  • 29. Evolution of the General Knowledge Modelling Environment
  • 30. Problems of Keyword-based search For answers to be relevant, a user must ask the appropriate query in order to retrieve the desired information and fulfill the information need (IN). For keyword-based search a high number of keywords is necessary to narrow down the search according to the information need The semantic ambiguity of querying languages are: – Built upon natural language – The query length The only solution is query refinement.
  • 31. DEPA DEPA stands for Discipline, Entity, Property & Action It is a query refinement method that enables the extraction of the 'DEPA facets' of a user query for search and retrieval purpose The method uses the general principles of facet analysis The method provides a user with additional and essential contextual information, in form of list of new facets The user can narrow down the search by expanding the original query with the suggested facets
  • 32. Example ● The content is represented as a string of fundamental categories DEPA (Discipline, Entity, Property and Action) that are conceptually equivalent to ‘facets’ (Bhattacharya) e.g “Treating Apple trees for bacterial disease in Trentino” is represented by a concept description C1 in a context C The facet repository FR(C) contains: ⟨ C1 : Agriculture; AppleTrees;Disease; Treating ⟩
  • 33. Lacuna of DEPA ● Does not have explicit representation of relations between entities ● Lacks semantics and is more syntactic ● Works better in traditional library scenario than the web environment ● Need to be improved by formalizing the description using classification ontology. This is achieved in Light weight Ontology.
  • 34. Light weight Ontology ● Lightweight ontologies are ontologies with a tree structure where each node is associated with a natural language label. The labels of nodes are organized according to certain predefined patterns which capture different aspects of meaning, i.e., facets.
  • 35. Faceted lightweight ontology Faceted lightweight ontology is a lightweight ontology whose terms are extracted from a background knowledge organized in terms of facets. Using facets allows us to build ontologies and organize them, which in general exploits the structure and terms of the four basic DEPA categories
  • 37. Advantages ● The main advantage of the faceted approach is that it makes explicit the logical relations among the concepts and concept groups and removes the limitations of traditional hierarchies. ● Each time, by providing the context, the faceted approach allows for the representation of different concepts. For example, a cow can be described as an animal, as a pet, as a food item, as a commodity, as a God for a particular community, and so on, depending on the domain ● Faceted lightweight ontologies have a well defined structure and, as such, they are easier to create, to share among users, and they also provide more organized input to faceted search and navigation.
  • 38. A new approach: DERA How to build high quality and scalable ontologies? DERA is faceted as it is inspired to the principles and canons of the faceted approach by Ranganathan DERA is a KR approach as it models entities of a domain (D) by their entity classes (E), relations (R) and attributes (A) It is a more general framework to encode the knowledge about the entities e.g school, cloth etc. , not limited to documents.
  • 39. Classification Ontologies Classification ontologies mainly used to describe, classify and search for documents. In these ontologies, terms denote sets of documents. Hierarchical BT/NT relations between the terms denote superset/subset relations.
  • 40. Descriptive ontologies in different domains Ontologies built with the purpose of describing about real world entities. Hierarchical is-a relations indicate subset relations.
  • 41. Steps in DERA(I) Step 1: Identification of the atomic concepts (E) watercourse, stream: a natural body of running water flowing on or under the earth Step 2: Analysis a body of water a flowing body of water no fixed boundary confined within a bed and stream banks larger than a brook
  • 42. Steps in DERA(II) Step 3: Synthesis. (E) Body of water (is-a) Flowing body of water (is-a) Stream (is-a) Brook (is-a) River (is-a) Still body of water (is-a) Pond (is-a) Lake Step 4: Standardization. (E) stream, watercourse: a natural body of running water flowing on or under the earth
  • 43. Steps in DERA(III) Step 5: Ordering Terms and concepts in the facets are ordered Step 6: Formalization Descriptive ontologies are translated into Description Logic formal ontologies, e.g.,: River Stream⊑ River (Volga) Length (Volga, 3692)
  • 44. Properties of DERA DERA facets have explicit semantics and are modeled as descriptive ontologies DERA facets inherits all the nice properties of the faceted approach, such as robustness and scalability It allows: Very expressive document search by any entity property Automated reasoning via the formalization into Description Logics ontologies Modeling of relevant entities of the domain and their E/R/A properties.
  • 45. References ● Agostini A., Madalli, D.P., Prasad, A.R.D.(2011). Faceted Approach To Diverse Query Processing. ● Giunchiglia et al. (2009b). Giunchiglia, F. Dutta, B. Maltese, V. (2009). Faceted lightweight ontologies. ● Giunchiglia, F., Dutta, B., Maltese, V.(2013). From Knowledge Organization to Knowledge Representation.
  • 46. Linking of Information: Semantic Annotation Mohit Garg
  • 47. Semantic Annotation ● For the Semantic Web, we need information in a hierarchical structure. ● Idea is that we attach semantic metadata to the documents,pointing to concepts. ● An annotation, is a form of meta-data attached to a particular section of document content ● SA is an annotation indicating the presence of a (semantic) entity in a particular place in a text
  • 48. What SA does ● Attaches metadata to documents, which makes them more useful and more easily processable ● They can then be used for searching and hyperlinking,categorising, and monitoring ● Adds value to content of libraries, enabling user interaction with content. ● Enhanced capability for cross-referencing and dynamic document classification
  • 49. Metadata ● Metadata of a document is an important source of knowledge about various properties of the document. ● Subject metadata is of vital significance, expressing the thought content of the document.
  • 50.
  • 51. ● The usual practice is to list the names of concepts dealt within the document using knowledge organisation tools like a thesaurus, taxonomy,subject heading list, etc ● But in an automatic information processing environment, the metadata is meant to be processed by the machine. For machine processing of subject metadata, concept names do not give enough indication of their subject matter.
  • 52. ● The subject metadata is supposed to represent the specific subject of the document. ● The specific subject of the document is that division of knowledge whose extension and intention are equal to those of its thought content
  • 54. ● In faceted approach to subject analysis, any compound and complex subjects are made up of the combination of one or more facets. ● Faceted classification schemes and subject indexing languages label each concept with a facet type depending on its role in a given specific subject.
  • 55. Facetising semantic annotation There are three basic components – ● Concept ● Facet ● Category
  • 56. (1) Concept. A generic term to denote a piece of idea. (2) Facet. The concepts are grouped under a class according to a single characteristic. This group/class is termed a facet. Facet is also defined as a class having subclasses by applying a “train of characteristics” (Ranganathan).
  • 57. ● (3) Category. Denotes a basic group/class of concepts which can be commonly identified in all subjects of the universe of knowledge. Ranganathan proposed these elementary categories: personality, matter, energy, space and time.
  • 58. Operations in SDA Faceted semantic annotation is a process for preparings subject-propositions and consists primarily of:- ● concept identification ● facet analysis ● facet coordination ● RDF representation of the facets
  • 59. Step 1: concept identification ● In this step the key terms are extracted from the given sentence. Concepts which are implicit in the sentence (such as medicine, human body, disease, etc.) are also identified.
  • 60. ● In medical radiology, determination of depth dose in Roentgen Rotation Therapy using Ionization pocket chamber ● Medicine, human body, disease, treatment, radiation therapy, X-ray therapy, treatment using rotation technique, determination of depth dose, using ionized packet chamber.
  • 61. Step 2: facet analysis ● The process of facet analysis starts with labelling extracted concepts as facets according to their characteristics/role in the specific subject. ● For ex.“human body” has been labelled as the entity facet as it is the key component around which all other concepts, such as “disease” and “treatment”, are concentrated.
  • 62.
  • 63. Step 3: facet coordination ● Discipline followed by entity (with or without modifiers) appropriately interpolated or extrapolated by action and property (with or without modifiers) is a logical sequence of the elements of a basic chain manifesting in a compound subject-proposition. Any action or property may have further sets of action(s) and/or property(ies) directly related to it. Their positions are always after the action or property to which they relate. Ex. Document annotation in POPSI
  • 64.
  • 65. Step 4: RDF representation of the facets
  • 66. ● The main application of the SDA is for automatic categorisation of web resources in web directories. A resource can have more than one subject proposition. ● This will allow expression of the multiple dimensions of the specific subject of the resource. Simultaneously, this will provide an equal number of access points to the resource at the time of the user’s query.
  • 67. References ● Broughton, V. (2004), “Faceted classification: a tool for subject access in the twenty first century”,Signum, Vol. 8, pp. 5-18. ● Broughton, V. (2006), “The need for a faceted classification as the basis of all methods of information retrieval”, Aslib Proceedings, Vol. 58 No. 1, pp. 49-72. ● Cimiano, P. and Handschuh, S. (2003), “Ontology-based linguistic annotation”, Proceedings of the ACL 2003 Workshop on Linguistic Annotation: Getting the Model Right, Vol. 19, pp. 14-21.
  • 69. Searching Information in Web There are a number of basic ways to access information on the Web: ● Going directly to a site if we have the address ● Browsing ● Conducting a search using a Web search engine ● Exploring a subject directory ● Exploring the information stored in live databases on the Web, known as the "deep Web" ● Joining an e-mail discussion group
  • 70. These attempts broadly fall into two categories: Webpages in hierarchical categories (taxonomy- based) - directory structure - human-empowered directories - Faceted Searching Yahoo! Open Directory Project dmoz Web Search Engines – Keyword Searching Excite AltaVista Google
  • 71. Indexing in Web ● Human Assisted Indexing/ Manual Indexing - Human-empowered directories/ Faceted Approach ● Automatic Indexing/ Full- text indexing - Full-text index Search Engines
  • 72. Human Assisted Indexing IDENTIFICATION OF KEYWORDS IN A DOCUMENT STANDARIZATION OF KEYWORDS CHOICE OF A MODEL PREPARATION OF ENTRIES FILLING OF ENTRIES
  • 73. Human-assisted Indexing on Web ● Depending on humans for its listings. ● Submitting a short description to the directory for entire site or writing by Editors ● A search looks for matches only in the descriptions submitted. ● Changing our web pages has no effect on our listing. ● Following pre- cordinating indexing system, because here the coordinatiion is done before giving the query.
  • 74. Automatic Indexing CHOOSING ALL WORDS IN A DOCUMENT ELIMINATING COMMON FUNCTION WORDS BY CONSULTING A STOP-WORD LIST COMPUTING THE FREQUENCY OCCURENCE OF ALL THE WORDS IN EACH DOCUMENT ASSIGNING MOST FREQUENTLY OCCURRING TERMS AS THE INDEX TERMS
  • 75. Automatic Indexing on Web Three elements are here : 1. Crawler or Spider 2. Index (Post-coordinate Indexing) 3. Search Engine Software
  • 76.
  • 77. Natural Language Search ● Natural language words or keywords we use everyday. ● Flexible and can be combined in different ways ● A lot of irrelvant searches ● The search engine has to look everywhere for that word, whether in the full text of the document or the title of it. ● We get more hits, recall is high, but results are necessarily better. ● This type of searching deals with automatic Indexing.
  • 78. Controlled Vocabulary Search ● Preselected ● Used by the indexers of a database or a catalog to describe a subject so that it can be easily found. ● We may use a lot of words for the same topic and that makes searching difficult. ● Helping to bring together under a single word or phrase, all the material that is available on a particular topic ● The results are usually more relevant to our topic, so precision is high. ● The trick is to find the correct subject headings which is sometimes not so easy. ● This type of search deals with Manual indexing.
  • 79. Faceted Searching vs Keyword Searching FACETED KEYWORD COST EXPENSIVE INEXPENSIVE TIME MORE LESS EXTENT OF INDEXABLE MATTER SUMMARIZATION COMPLETE TEXT EXHAUSTIVITY MORE SELECTIVE VERY LESS SELECTIVE SPECIFICITY GENERIC TERMINOLOGY VERY SPECIFIC TERMINOLOGY HEADING STRUCTURE MULTI-TERM CONTEXT LIMITED TERM- COMBONATION SEARCHING WIDE-RANGE MORE-SOPHISTICATED VOCABULARY SMALLER LARGER SURROGATION NOT OFTEN USED FREQUENTLY USED RECALL LESS MORE PRECISION MORE LESS
  • 81. References Indexing The World Wide Web:The Journey So Far Abhishek Das Google Inc., USA Ankit Jain Google Inc., USA Towards a Framework for Adaptive FacetedSearch on Twitter Ilknur Celik, Fabian Abel, Patrick Siehndel Web Information Systems, Delft University of Technologyfcelik,abelg@tudelft.nl L3S Research Center, Leibniz University Hannover, Germanysiehndel@l3s.de Finding Information on the Internet Session 1: Searching the World Wide Web Dr. Hesham Azmi Program of Information Science Dept. of Mass Comm.& Information Science http://www.tourolib.org/services/students/subject-headings Index and Indexing: Amitabha Chatterjee
  • 83. Query Processing ● The user can make three kind of query (Alessandro Agostini,Devika P. Madalli and A.R.D. Prasad. Faceted Approach To Diverse Query Processing): – keyword-based – by focus – on subject
  • 84. The Facets Repository ● The facets from facet analysis can be use to build the facet repository available to a user to refine a query. ● The facets repository is organized around two main notions of the DEPA paradigm for facet analysis – subjetcs and facets
  • 85. Concept terms ● A cluster’s name in representation language is referred to as concept term. ● Two kinds of semantics are provided to a concept term: – an extensional semantics, defined over the documents in the cluster named by the concept term. – and an intensional semantics, defined by the unique position of the concept term in a given ‘focus’.
  • 86. Focused Terms ● A focus consists of an ordered set of related concept terms. ● A focus is a path of concept terms corresponding to a path in a given context. Cx:Fruit Orange Trentino Apple Cx:Fruit Trentino Apple X.doc X.doc Figure: An example of focus (right)
  • 87. Facet Engine ● A facet engine is that computes the matching between the focused terms of a input context and the predefined set of facets stored in the facet repository for a number of concepts ● The facet engine looks at all keywords generated for each concept name in a focus for all focuses of the hierarchy, and browse through the focus from the root to the leaf to identify what keywords are DEPA facets stored in facet repository.
  • 88. Keyword-based querying ● The user types one or more keywords in the search box. ● Each keyword is mapped to zero or more concept terms in the context . It is done using an exact string match of the keyword to the concept term or one of its alias names, namely, its focused terms. ● If no concept term and its alias names match any keyword, no concept description is available to the facet engine, and as a consequence no facets for query refinement are shown to the user.
  • 89. Keyword-based querying ● If one concept term or its alias names match some keywords, then the concept description C of the concept term is generated and processed for query expansion. ● The facets that occur in the query expansion are shown to the user ● When selecting one of the new facets, the user will narrow down the search by expanding the original query with the suggested facet.
  • 90. Keyword-based querying ● If multiple concept terms match some keywords, then the concept description of each term is generated and processed for query expansion ● The user is given the option to refine their query to indicate which concept term, namely, keyword they meant the most.
  • 91. Querying by Focus ● Sometimes user knows at least something about the subject to search, and the user’s knowledge comes from documents stored and polyhierarchically organized in the user’s document collection. ● In this case, it would always be desiderable for the user to get better and better understanding of the hidden content of the query, as it is automatically generated by a suitable method, so as to discover new facets of the original query that the user was not aware of before.
  • 93. Querying by Focus ● For example, suppose the query is ‘apple’ as contextualized. The user clicks on a concept term in a context C , ie. the user selects a focus in C. ● Alternatively, the user types some keywords as in keyword- based querying, but in a specific order to mean a focus in C. ● For example, the user may click on (an appropriate graphic- version of) ‘Apple’ in context or either type keywords ‘fruit’, ‘trentino’, ‘apple’ in this order, as to mean Cx:Fruit>Trentino>Apple
  • 94. Querying by Focus ● By selecting the facet ‘Fruit’ the user would narrow down the search space by excluding all subjects about Apple computer and related subjects as search results. ● By selecting facet ‘Trentino’ the user would be able to narrow down the search space by excluding all subjects about fruits that are not related toTrentino’s production of apples.
  • 95. Querying on Subject ● ‘Subject’ refers to the topical intent of a query. ● In faceted approach to representation of documents in collection ,‘subjects’ are broken down into distinct divisions, the facets of subject. ● A typical ‘query-on-subject’ is deemed to relate to a specific subject of a pre-existing faceted classification.
  • 96. Querying on Subject For example: A subject-based query is-”What are the documents on the effects of nitrogen fertilizers on rice plants?” The subject of the concept subsumed by this query is one of possibly many focuses. – Cx1:Rice plants>nitrogen fertilizers>effects. – Cx2: Agriculture>rice plants>nitrogen fertilizers>effects. – Cx3:Agriculture> effects of nitrogen> fertilizers>rice plants.
  • 97. Querying on Subject ● A number of different but equivalent focuses could exists for a given subject-based query. ● If multiple focuses are computed from the query’s subject, the user is given the option to refine the original query to indicate which focus they meant for the searched subject.
  • 98. Reference ● Agostini A., Madalli, D.P., Prasad, A.R.D. (2011). Faceted Approach To Diverse Query Processing.
  • 100. User Interface ● A user interface is the system by which people (users) interact with a machine. ● The goal of this interaction is effective operation and control of the machine on the user's end, and feedback from the machine, which aids the operator in making operational decisions. ● The prototype system provides three interfaces for general users to search for relevant information: Browser interface: It allows all current and perspective users to browse and navigate information Advanced search interface: Here users can combine their searching through multiple selection Basic search interface: It allows entering any number of keywords or phrases. It supports Boolean queries and present th results with relevance ranking.
  • 101. Faceted Search ● Facets refer to categories used to characterise information items in a collection. ● Faceted search is a technique for accessing information organized according to a faceted classification system, allowing users to explore a collection of information by applying multiple filters. ● It allows the user to narrow down search results with multiple filters,so user quickly see which results are most relevant for him. ● Eg.Suppose a user want to filter a flight on price, number of stops and the total flight duration and this query he search on Google,it will not give a relevant result but if he will search on a faceted portal which are related to Flights and hotels like(Home and Abroad,fare cost,Inside trip etc.) he will find relevant result in a short period.
  • 102. Faceted search at homeflights,farecast,airlines,time schedule
  • 103. Faceted Navigation ● Faceted navigation allows the user to elaborate a query progressively, seeing the effect of each choice in one facet on the available choices in other facets. ● Faceted navigation can also be seen as an alternative for Advanced search where users can 'search' on the information 'facets' rather than seen the facet come back in the 'normal' navigation. ● Advantages: More intuitive : Easy to guess what's behind each door. Dynamic selection of categories allowed : Supports multiple perspectives Systematic advantages : Fewer elements-Its ability to handle compound search.
  • 104.
  • 105. List of portals on web use faceted approach ● Flameno :http://flamenco.berkeley.edu/index.html ● Wine.com: http://www.wine.com/ ● Amazon.com:http://www.amazon.com/ ● Ebay.com:http://www.ebay.in/ ● Flipkart:http://www.flipkart.com/ ● Epicurious.com://www.epicurious.com/ ● Etoys.com:http://www.etoys.com/home/index.jsp
  • 106.
  • 107. The Flamenco Search Interface Project Search Interfaces that Flow ● FLAMENCO stands for FLexible information Access using MEtadata in Novel COmbinations. ● The Flamenco search interface framework has the primary design goal of allowing users to move through large information spaces in a flexible manner without feeling lost. ● A key property of the interface is the explicit exposure of category metadata, to guide the user toward possible choices, and to organize the results of keyword searches. ● The interface uses hierarchical faceted metadata in a manner that allows users to both refine and expand the current query, while maintaining a consistent representation of the collection's structure. ● This use of metadata is integrated with free-text search, allowing the user to follow links, then add search terms, then follow more links, without interrupting the interaction flow.
  • 108. Flamenco Documentation Preparing Your Data ● A Flamenco collection is a set of items that are all the same kind (for example, all items are books, or all items are songs, and so on). ● The metadata about any given item consists of its facet values and attribute values. The first step in preparing your collection is to decide which information will be in facets and which will be in attributes. ● Facet values are used to organize items into categories, whereas attribute values are only displayed with individual items. ● In the sample collection, for instance, prize is a facet indicating the type of Nobel Prize won, whereas name is an attribute for the name of the winner. That's because it makes sense to group Nobel Prize winners into categories by the type of prize, but not by their names.
  • 109. Flamenco Documentation Preparing Your Data ● For Flamenco to load your collection, the metadata about the collection has to be provided in tab-delimited files (also known as TSV files, with a ".tsv" extension). ● The TSV files you need to provide are: ● attrs.tsv: gives the list of attributes. Each line in this file represents one attribute. ● facets.tsv: gives the list of facets. Each line in this file represents one facet. ● items.tsv : gives the IDs and attribute values for all the items. Each line of the file represents one item. ● facet_terms.tsv (for each facet):gives the tree of category terms in the facet. ● facet_map.tsv (for each facet): assigns items to the category terms for that facet. Each line in this file has two fields. ● sortkeys.tsv (optional): indicates which facets or attributes are to be used for sorting result lists. ● text.tsv (optional) : supports the text search feature of Flamenco.
  • 110.
  • 111.
  • 112.
  • 113.
  • 114.
  • 115. Faceted Serch in Wine.com Selection of Facet in Wine.com: 1. Religion: California 2. Type: Red wine, White, Bubbly 3. Style: Red- light & fruity 4. Price: $20 and below 5. Publications: International wine cellar
  • 116. Faceted search in Wine.com
  • 118. References ● Tunkelang, Daniel (2009), Faceted search, Morgan &Claypool. ● Kumari, Nitu (2009), Evaluation of facted portal, DRTC, ISI Bangalore. ● http://flamenco.berkeley.edu/index.html ● http://www.wine.com/v6/wineshop/
  • 120. FATKS-HUM (Background) ●To investigate the feasibility of using FAT to develop a knowledge structure suitable for the digital environment ●To develop and evaluate a prototype implementation in collaboration with the Arts and Humanities Data Service (AHDS) and the Humbul Humanities Hub
  • 121. FAT-HUM ➢serves to test and demonstrate facet analysis in humanities ➢consists of the three distinct but closely interconnected classifications of concepts: -broad classification representing UoK -more detailed faceted classification tested in two areas of humanities: religion and visual arts -classification of generally applicable concepts
  • 122.
  • 123. MODEL of FAT-HUM ●stems from three faceted/analytic synthetic classificatory schemes ● Bliss Classification scheme ● Universal Decimal Classification ● Broad System Ordering
  • 124. Broad Classification Schedule ✔ broad general knowledge classification containing around 300 classes ✔ serves as a basic structure of disciplines to which faceted classification for humanities is linked
  • 125. Applicable Concepts applicable throughout the classification are structured as 'external' vocabulary facets kept as separate classification schedules called Common auxiliaries faceted classification for humanities needs to relate to the concepts that are not particular to humanities only (e.g. place, time, persons etc.) purpose of this research project a test vocabulary compiled from the BC2 and Universal Decimal Classification was created
  • 126. MICRO (Inner classification structure) -FAT-HUM is a faceted classification developed by organizing concepts of a specific field of knowledge according to the following fundamental concept categories (facets) : Thing - Kind - Part - Property - Material - Process - Operation - Patient - Product - Byproduct- Agent - Place – Time - facets of products, by product and material, for instance, may be less relevant for humanities while some others need to be introduced, such as 'theory and philosophy'
  • 127. Syntax FATHUM's analytico-synthetic feature -Citation Order-sequence of facets in a combined class symbol is not accidental a) Combining concepts within the same facet b) Combining concepts between facets c) Combining concepts between humanities and common auxiliaries d)Combining concepts between different disciplines
  • 128. Notation Notation in the broad classification of knowledge  Notation in the facets of common auxiliaries (B4932) Critical reviews  Notation in the humanities 590 Religion. Theology  590A Theory and philosophy of religion  Subject Relationships using symbols Bias phase showed by 420<<590 'Religion for educational purposes'  Filling order 540+941 stands for Religion and Art
  • 129. Other related project SALT (Standard based Access service to multilingual Lexicons and Terminologies)
  • 130. REFERENCE The Official Website of Facet Analytical Theory of Knowledge Structures http://www.ucl.ac.uk/fatks/classification_system.htm
  • 132. Aim • To know the intended design, implementation, outputs, outcomes and popularity due to ease of use. • To provide information to project designers on how to improve their design, the extent their project achieved. • To know how much a user is benefited because of faceted portal.
  • 133. Criteria Content Organization • It should show a clear and logical structure to typical users. • It includes putting critical information near the top of the site, grouping related elements. Faceted Navigation • It should help users locate and link to destination pages. • Site maps, feedback on the user’s location within the site, clickable list of page contents, glosses should be provided.
  • 134. Faceted Metadata • For a huge and organized collection metadata should be provided. Facet exposure • For maximum exposure facets should be placed across the top. Layout of Labels within Facets • For facets having large labels column-oriented layout should be preferred.     
  • 135.
  • 136. Faceted Category Interface • The retrieve result can be selected by keyword search by pre assigned metadata terms or by a combination of both. Facet Hierarchy Navigation • Faceted portals should provide faceted hierarchy navigation. • In this user selects anyone facet and refines within its hierarchy.
  • 137.
  • 138. Multifaceted Navigation • Users should be able to filter out large sets of products or content by a variety of product attributes like (size, color, feature, price range, specifications). Directed Navigation • Combining information found within structured fields(product name, size or manufacturer) with unstructured content(product description).
  • 139.
  • 140.
  • 141. Breadcrumb navigation • When a user clicks through the site’s hierarchy, each successive link should be indicated as a text link. • When a user ends up with a string of section and subsection names where he is and where he had been should be shown exactly.
  • 142.
  • 143. Browser Interface • Should help users to search after previewing summary Advanced Search Interface • Users should be able to combine their searching within categories of facets through multiple selection boxes Basic Search Interface • Should allow users to enter any number of keywords or phrases and should also support Boolean queries.
  • 144. Customization & Personalization • Skins of the same layout can be used to express different themes, fonts, and colors can be used. • User should be able to save and develop own user interface templates. • Facilities for creating user account, profile should be provided so that user can organize his/her information.
  • 145. Reference ●Kumari, Nitu (2009), Evaluation of facted portal, DRTC, ISI bangalore
  • 147. Present Pattern  User types in a search  Search engine gives back matching results  User reads the results and picks the best one  Users' have to find out relevancy
  • 148. Why Need? ●Users generally do not adopt new search interfaces ●How to show a lot more information without overwhelming or confusing ●Most users prefer simplicity unless complexity really makes a difference ●Small details information are provided
  • 149. Advantages  Lets the user decide how to start, and how to explore  After refinement, categories that are not relevant to the current results disappear.  Very easy to build up complex queries  Reduces mental work & Provide Simple Error Handling  Suggests alternatives
  • 150. Advantages  Easier to explore the collection  Helps users infer what kinds of things are in the collection  Seamless to add new facets and subcategories  Use the metadata to show where to go next  More flexible than canned hyperlinks
  • 151. “The more successful the Web, the greater the problem of information and resource discovery” (Wallis and Burden). Factors : Expert Time and Economy Service providers
  • 152. Summary ●Conversion: Customers can’t buy what they can’t find ●Efficiency: Good navigation increases productivity ●Confidence: Faceted navigation increases information scent ●Aboutness: Facets show semantic make-up of a collection ●Reduced Uncertainty: Not required to specify precise queries ●Guided Experience: Browsing categories provides a different experience than keyword search

Hinweis der Redaktion

  1. Fig. 2. For instance, the term horses denotes documents about horses (animals), while the fact that it is placed under transportation means indicates that documents about horses are also documents about transportation means.
  2. KR employs descriptive ontologies, i.e., ontologies built at the purpose of describing and reason about real world entities. In these ontologies, terms denote sets of real world entities, hierarchical is-a relations provide the backbone structure to these ontologies and indicate a subset relation, while the individuals include any real world entity. For instance, the relation horse is-a animal indicates that horses are a subset of all animals. This is called real world semantics in (Giunchiglia et al. 2009b). Descriptive ontologies provide knowledge about entities in terms of classes, attributes and relations.
  3. 940 THE ARTS: AN OUTLINE 940C Art period in general sense. Art in a certain time 940D Art by place in general sense, art in certain place 940E Agents in art. Ethnic groups. Persons. Tools and equipment 590-RELIGION 590A THEORY AND PHILOSOPHY OF RELIGION 590A1 Schools of theology characterised by various attributes 590A2 Concepts in religion. Religious ideas. Theology 590A3 The Holy. The sacred. The supernatural. Object(s) of religion/worship 590A4 God. Gods (Personalised god(s) as distinct from immanent spirits)
  4. In religion, for instance, the following 9 facets are identified to be suitable: Thing (entity) - i.e. the main facet of Religions. Faiths Part Property Processes Operations Patient Agent Time Theory and philosophy of Religion
  5. 2&amp;gt; users looking for information within a specific school, can select ‘Research’ from “purpose’ facet as well as ‘school’ from ‘area’ facet.