SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Indexing Language: Concept,
types & characteristics
Dr. Utpal Das
Dibrugarh University,
Dibrugarh, Assam
utpalishaan@gmail.com
Introduction:
A subject is then any concept or combination of concepts
which is expressed in the document. The readers’ task is
to interpret the words and sentences in the document in
order to understand the concepts. Whether a reader
understands a document depends on how precisely the
author expresses the concepts he refers to and whether
the reader is aware of the concepts the author expresses.
The basic idea is that the concepts exist before the
author writes the document and the reader reads the
document.
• Similarly, the indexer’s task is to identify concepts in the
document and re-express these in indexing terms. This is
done first by establishing the subject content, or in other
words the content of concepts in the document.
Thereafter the principal concept presented in the
subject content is identified, and finally, the concepts
are expressed in the indexing language. The indexing is
successful when the document and the indexing term
express the same concepts.
What is indexing languages?
The term ‘indexing languages’ may be understood as same as the term
‘indexing’ in the broader sense, that is, in a general sense.
 Indexing language is a set of items (vocabulary) and devices for
handling the relationships between them in a system for providing
index descriptions. Indexing language is also referred to as retrieval
language.
 Indexing language is the process of creating set of vocabularies that
helps to provide access to objects of information, books,
documents, articles, etc. Like any other language, it will consist of
two parts: vocabulary and syntax.
this process of creating and providing access to objects of
information could either be manual or through computer
technology.
The above definitions will help us define indexing language
in the following different ways:
• As terms or vocabularies used to represent
document or content of document which are extracted from
document text or assigned from authority list adhering some
process or techniques
• Serving as access points for searching
• Possibly being extracted or derived from document text:
natural language
• Possibly being assigned from authority control list:
controlled vocabulary
So in a nutshell
A system for naming subjects using subject-terms or
vocabularies and also devices for handling the
relationships between them to provide a systematic
index descriptions is called an indexing language.
Like any other language, it will consist of two
parts: vocabulary and syntax.
Again, we need to understand that:
If we use terms or vocabulary as they appear in documents
without modification, we are using natural language.
However, using natural language always may lead to
problems. Because, as per as vocabulary is concerned,
different authors may use different terms to express the same
idea or they may use synonyms to express same idea. If that is
so, it will lead to a decrease in recall while searching with any
one term (idea) appear in documents which is against the
whole purpose of indexing and retrieval.
For example: the same idea may be expressed in more than
one way as per syntax is concerned, like : paediatric or child
disease; geriatric or health care of old people; child
psychology or psychology of children; adult education or
education of adults.
For these reasons, assigned indexing systems introduce a
measure of control over the terms used: we use a controlled
vocabulary.
We also formalize a flexible syntax of natural language by
permitting only certain constructions, as for example, instead of
heat treatment of aluminium, we use aluminium-heat
treatment; instead of using libraries for children, or children’s
libraries, we use libraries, children’s. This is what called using
a structured language or controlled vocabulary
A controlled vocabulary and formalized structure are features of
an artificial indexing language.
The extreme example of an artificial indexing is the notation of
a classification scheme; instead of natural language terms, heat
treatment of aluminium, or the more formalized aluminium-
heat treatment, we use 669.71.04.
Once the subject analysis of the document is
completed, the final step is to represent the selected
concepts in the language of indexing system (as index
entries). The indexer should be familiar with the
indexing tools, and their working rules and procedures
in order to ensure that concepts are organized in a
usable and accessible form. The process of subject
indexing involves basically three steps:
Familiarization => Analysis => Representation
Let us now look at how indexing languages are actually
conceptualized and created.
All indexing languages originate as natural language, or the
language found in documents. Natural language does
not refer to writing style, but to the fact that the
language is not under authority control.
Language under authority control is called controlled
vocabulary. There is nothing special about the words in
controlled vocabulary except the fact that they are
standardized for use in certain systems.
The following diagram illustrates the processes involved in
translating natural language (NL) terms into controlled
vocabulary (CV) terms for entry in database records.
The diagram helps explain why . . .
1. Natural indexing languages are also called derived-term
approaches
2. Controlled indexing languages are also called assigned-
term approaches
Abstracting and Indexing Process
Processes Involved in Translating Natural Language Terms
into Controlled Vocabulary
Full-Text
Document
Abstract NL Record Field
NL Record Field
CV Record FieldAuthority File
Natural Language
Controlled
Vocabulary
Enter in
Enter in
Enter inChose from
Write into
To review, subject analysis requires you to
1. become familiar with document content;
2. extract significant concepts and terms;
3. translate extracted terms into the language—
often controlled—of the system; and
4. formalize the terms (format them, etc.) according
to input rules.
Types of Indexing languages
As the above discussion suggest, there are three
types of Indexing language
Natural Language or Natural indexing language
Controlled Vocabulary or Controlled indexing language
Free indexing language
Natural indexing language:
• This is a slightly broader language in which the description of the
document can be done using any of the terms present in the
document. Any term that is used to define or describe the content
within a document is known as a ‘subject term’. That is why
indexing language some time is called ‘subject indexing’ of
‘subject indexing language’.
• In Natural indexing language, a subject term can be used to
describe/search for a specific document based primarily on its
content.
• A subject term may also be described as a compact synonym or
surrogate for a specific subject representation.
Controlled indexing language:
• Controlled indexing language refers to the indexing language in
which only approved terms are allowed to be used to describe the
document. These subject terms are controlled vocabulary under
subject authority file.
• For subject terms under authority control (or vocabulary control), a
subject authority file or list . . .
may be described as a list of terms that are permitted to be
used in describing or representing specific subjects
May be said to standardize one of two synonyms that are
used to assign or represent specific topics
May be used to determine the preferred term when multiple
terms are used to define or describe a single topic
May be used to provide cross references for terms that are
on par with, hierarchical or alternate in position or
relationships
• Cataloguing and indexing professionals have created
different subject authority control structures:
Subject headings lists are used by cataloguers in cases
where subject terms have been used as subject headings.
A thesaurus is used by indexers where subject terms are
known as descriptors.
Free indexing language:
As the name suggests, this type of indexing language brings
into use any term within or outside the document for its
description.
In today’s times, the searching mechanism and trends have
changed and there is a higher use of free text search. This
demands that the natural language with the highest
possible indexing ideally indexing every text be done. Of
course, whether free text search or expert-driven well-
chosen vocabularies is being done to check which is more
efficient is a matter of research.
Here's how the processes differs for natural language and
controlled vocabulary:
Natural language Controlled vocabulary
Terms are based on existing
vocabulary of documents (which
may be inconsistent)
Terms are based on standardized
vocabulary intended to describe
concepts consistently
Indexers / cataloguers extract
terms from documents and
enter them (or their own terms)
in various subject fields extract
terms from documents,
Indexers / cataloguers choose
appropriate authorized terms from
controlled vocabulary list, and
enter terms in designated
controlled vocabulary field
Searchers may enter any search
terms that are likely to occur in
natural language
Searchers must enter search terms
that are in controlled vocabulary
Basics of Subject Indexing
MEANING:
In the literature of LIS, the phrases subject cataloguing and
subject indexing are used more or less interchangeably. But
it should be understood that subject cataloguing is
intended to embrace only that cataloguing activity which
provides a verbal subject approach to library collections,
especially macro documents (i.e. books). It refers
determining and assigning of suitable entries for the
subject component of a document for use in a library’s
catalogue, i.e. subject catalogue is a representation of
documents. The primary purpose of the subject catalogue
is to show which books on a specific subject are possessed
by the library.
Subject indexing refers to that indexing activity
which provides a verbal subject approach to
micro documents (e.g., journal articles, research
reports, patent literature, etc.). Subject indexing
provides a subject entry for every topic
associated with the content of a micro
document, i.e. subject index is a representation
the knowledge expressed by documents
The representation of documents and the knowledge
expressed by them is one of the central and unique areas
of study within Library and Information Science (LIS) and
is commonly referred to as subject indexing. Subject
approach to information has been a long and extensive
concern of librarianship and is assumed to be the major
approach (access method) of users for a very long
period. Indexes facilitate retrieval of information in both
traditional manual systems and newer computerised
systems. Without proper indexing and indexes, search
and retrieval are virtually impossible.
A subject is then any concept or combination of concepts
which is expressed in the document. The readers’ task is
to interpret the words and sentences in the document in
order to understand the concepts. Whether a reader
understands a document depends on how precisely the
author expresses the concepts he refers to and whether
the reader is aware of the concepts the author expresses.
The basic idea is that the concepts exist before the
author writes the document and the reader reads the
document.
END

Weitere ähnliche Inhalte

Was ist angesagt?

Library congress subject headings
Library congress subject headings Library congress subject headings
Library congress subject headings MahendraAdhikari7
 
Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & TechniquesDr. Utpal Das
 
Subject analysis, subject heading principles
Subject analysis, subject heading principlesSubject analysis, subject heading principles
Subject analysis, subject heading principlesRichard.Sapon-White
 
Anglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptAnglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptUniversity of Delhi
 
Common communication format
Common communication formatCommon communication format
Common communication formatavid
 
Classified catalogue
Classified  catalogueClassified  catalogue
Classified cataloguetonyviamll89
 
Relationship of information science with library science
Relationship of information science with library scienceRelationship of information science with library science
Relationship of information science with library scienceSadaf Batool
 
Library automation history Anandraj.L
Library automation history Anandraj.LLibrary automation history Anandraj.L
Library automation history Anandraj.Lanujessy
 
key word indexing and their types with example
key word indexing and their types with example key word indexing and their types with example
key word indexing and their types with example Sourav Sarkar
 
Virtual reference srevices
Virtual reference srevicesVirtual reference srevices
Virtual reference srevicesiqra Mubeen
 

Was ist angesagt? (20)

International Standard Bibliographic Description: background and recent devel...
International Standard Bibliographic Description: background and recent devel...International Standard Bibliographic Description: background and recent devel...
International Standard Bibliographic Description: background and recent devel...
 
Canons of cataloguing
Canons of cataloguingCanons of cataloguing
Canons of cataloguing
 
Library congress subject headings
Library congress subject headings Library congress subject headings
Library congress subject headings
 
Uniterm indexing
Uniterm indexing Uniterm indexing
Uniterm indexing
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 
Classified Catalogue Code (ccc)
Classified Catalogue Code (ccc)Classified Catalogue Code (ccc)
Classified Catalogue Code (ccc)
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & Techniques
 
Subject analysis, subject heading principles
Subject analysis, subject heading principlesSubject analysis, subject heading principles
Subject analysis, subject heading principles
 
Desidoc
DesidocDesidoc
Desidoc
 
Anglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 pptAnglo-American Cataloguing Rules AACR 2 ppt
Anglo-American Cataloguing Rules AACR 2 ppt
 
Common communication format
Common communication formatCommon communication format
Common communication format
 
Classified catalogue
Classified  catalogueClassified  catalogue
Classified catalogue
 
Relationship of information science with library science
Relationship of information science with library scienceRelationship of information science with library science
Relationship of information science with library science
 
Digital Library Software
Digital Library SoftwareDigital Library Software
Digital Library Software
 
Library automation history Anandraj.L
Library automation history Anandraj.LLibrary automation history Anandraj.L
Library automation history Anandraj.L
 
Ifla
IflaIfla
Ifla
 
key word indexing and their types with example
key word indexing and their types with example key word indexing and their types with example
key word indexing and their types with example
 
ISO 2709
ISO 2709ISO 2709
ISO 2709
 
Virtual reference srevices
Virtual reference srevicesVirtual reference srevices
Virtual reference srevices
 
Unisist ppt
Unisist pptUnisist ppt
Unisist ppt
 

Ähnlich wie Indexing language concept types and characteristics

Indexing languages (2)
Indexing languages (2)Indexing languages (2)
Indexing languages (2)yhen06
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...Sarah Morrow
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Lecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptxLecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptxsabinafarmonova02
 
Beyond the sentence
Beyond the sentenceBeyond the sentence
Beyond the sentenceMelisa Berto
 
Corpus study design
Corpus study designCorpus study design
Corpus study designbikashtaly
 
Discourse analysis new
Discourse analysis newDiscourse analysis new
Discourse analysis newHarry Subagyo
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptxIhsanSani4
 
4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databaseskeithstanger
 
English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1RanelRabago
 
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdfenglishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdfGinaTabling1
 
Concordancer
ConcordancerConcordancer
ConcordancerCt Hajar
 

Ähnlich wie Indexing language concept types and characteristics (20)

Indexing
IndexingIndexing
Indexing
 
Indexing languages (2)
Indexing languages (2)Indexing languages (2)
Indexing languages (2)
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Lecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptxLecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptx
 
Beyond the sentence
Beyond the sentenceBeyond the sentence
Beyond the sentence
 
Index Language.pptx
Index Language.pptxIndex Language.pptx
Index Language.pptx
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
 
Discourse analysis new
Discourse analysis newDiscourse analysis new
Discourse analysis new
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases
 
English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1
 
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdfenglishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
 
PPT FOR M1.pdf
PPT FOR M1.pdfPPT FOR M1.pdf
PPT FOR M1.pdf
 
LANGUAGE.pptx
LANGUAGE.pptxLANGUAGE.pptx
LANGUAGE.pptx
 
Cl35491494
Cl35491494Cl35491494
Cl35491494
 
Kieli analytics
Kieli analyticsKieli analytics
Kieli analytics
 
NLP todo
NLP todoNLP todo
NLP todo
 
Concordancer
ConcordancerConcordancer
Concordancer
 

Mehr von Dr. Utpal Das

Metrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptxMetrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptxDr. Utpal Das
 
Plagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptxPlagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptxDr. Utpal Das
 
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...Dr. Utpal Das
 
How to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptxHow to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptxDr. Utpal Das
 
Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...Dr. Utpal Das
 
Avoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availabilityAvoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availabilityDr. Utpal Das
 
Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it Dr. Utpal Das
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismDr. Utpal Das
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismDr. Utpal Das
 
Truth, fact and ethics in academic research
Truth, fact and ethics in academic researchTruth, fact and ethics in academic research
Truth, fact and ethics in academic researchDr. Utpal Das
 
Ethics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarismEthics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarismDr. Utpal Das
 
Success and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normalSuccess and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normalDr. Utpal Das
 
Information seeking and information use behaviour in libraries
Information seeking  and information use behaviour in librariesInformation seeking  and information use behaviour in libraries
Information seeking and information use behaviour in librariesDr. Utpal Das
 
Information literacy
Information literacyInformation literacy
Information literacyDr. Utpal Das
 
Chemical factors of deterioration of documents
Chemical factors of deterioration of documentsChemical factors of deterioration of documents
Chemical factors of deterioration of documentsDr. Utpal Das
 
Remedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritageRemedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritageDr. Utpal Das
 
Definition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of ManuscriptsDefinition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of ManuscriptsDr. Utpal Das
 
Manuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in AssamManuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in AssamDr. Utpal Das
 
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrievalDr. Utpal Das
 

Mehr von Dr. Utpal Das (20)

Metrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptxMetrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptx
 
Citation Database
Citation Database Citation Database
Citation Database
 
Plagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptxPlagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptx
 
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
 
How to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptxHow to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptx
 
Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...
 
Avoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availabilityAvoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availability
 
Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
 
Truth, fact and ethics in academic research
Truth, fact and ethics in academic researchTruth, fact and ethics in academic research
Truth, fact and ethics in academic research
 
Ethics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarismEthics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarism
 
Success and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normalSuccess and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normal
 
Information seeking and information use behaviour in libraries
Information seeking  and information use behaviour in librariesInformation seeking  and information use behaviour in libraries
Information seeking and information use behaviour in libraries
 
Information literacy
Information literacyInformation literacy
Information literacy
 
Chemical factors of deterioration of documents
Chemical factors of deterioration of documentsChemical factors of deterioration of documents
Chemical factors of deterioration of documents
 
Remedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritageRemedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritage
 
Definition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of ManuscriptsDefinition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of Manuscripts
 
Manuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in AssamManuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in Assam
 
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrieval
 

Kürzlich hochgeladen

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 

Kürzlich hochgeladen (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 

Indexing language concept types and characteristics

  • 1. Indexing Language: Concept, types & characteristics Dr. Utpal Das Dibrugarh University, Dibrugarh, Assam utpalishaan@gmail.com
  • 2. Introduction: A subject is then any concept or combination of concepts which is expressed in the document. The readers’ task is to interpret the words and sentences in the document in order to understand the concepts. Whether a reader understands a document depends on how precisely the author expresses the concepts he refers to and whether the reader is aware of the concepts the author expresses. The basic idea is that the concepts exist before the author writes the document and the reader reads the document.
  • 3. • Similarly, the indexer’s task is to identify concepts in the document and re-express these in indexing terms. This is done first by establishing the subject content, or in other words the content of concepts in the document. Thereafter the principal concept presented in the subject content is identified, and finally, the concepts are expressed in the indexing language. The indexing is successful when the document and the indexing term express the same concepts.
  • 4. What is indexing languages? The term ‘indexing languages’ may be understood as same as the term ‘indexing’ in the broader sense, that is, in a general sense.  Indexing language is a set of items (vocabulary) and devices for handling the relationships between them in a system for providing index descriptions. Indexing language is also referred to as retrieval language.  Indexing language is the process of creating set of vocabularies that helps to provide access to objects of information, books, documents, articles, etc. Like any other language, it will consist of two parts: vocabulary and syntax. this process of creating and providing access to objects of information could either be manual or through computer technology.
  • 5. The above definitions will help us define indexing language in the following different ways: • As terms or vocabularies used to represent document or content of document which are extracted from document text or assigned from authority list adhering some process or techniques • Serving as access points for searching • Possibly being extracted or derived from document text: natural language • Possibly being assigned from authority control list: controlled vocabulary
  • 6. So in a nutshell A system for naming subjects using subject-terms or vocabularies and also devices for handling the relationships between them to provide a systematic index descriptions is called an indexing language. Like any other language, it will consist of two parts: vocabulary and syntax.
  • 7. Again, we need to understand that: If we use terms or vocabulary as they appear in documents without modification, we are using natural language. However, using natural language always may lead to problems. Because, as per as vocabulary is concerned, different authors may use different terms to express the same idea or they may use synonyms to express same idea. If that is so, it will lead to a decrease in recall while searching with any one term (idea) appear in documents which is against the whole purpose of indexing and retrieval. For example: the same idea may be expressed in more than one way as per syntax is concerned, like : paediatric or child disease; geriatric or health care of old people; child psychology or psychology of children; adult education or education of adults.
  • 8. For these reasons, assigned indexing systems introduce a measure of control over the terms used: we use a controlled vocabulary. We also formalize a flexible syntax of natural language by permitting only certain constructions, as for example, instead of heat treatment of aluminium, we use aluminium-heat treatment; instead of using libraries for children, or children’s libraries, we use libraries, children’s. This is what called using a structured language or controlled vocabulary A controlled vocabulary and formalized structure are features of an artificial indexing language. The extreme example of an artificial indexing is the notation of a classification scheme; instead of natural language terms, heat treatment of aluminium, or the more formalized aluminium- heat treatment, we use 669.71.04.
  • 9. Once the subject analysis of the document is completed, the final step is to represent the selected concepts in the language of indexing system (as index entries). The indexer should be familiar with the indexing tools, and their working rules and procedures in order to ensure that concepts are organized in a usable and accessible form. The process of subject indexing involves basically three steps: Familiarization => Analysis => Representation
  • 10. Let us now look at how indexing languages are actually conceptualized and created. All indexing languages originate as natural language, or the language found in documents. Natural language does not refer to writing style, but to the fact that the language is not under authority control. Language under authority control is called controlled vocabulary. There is nothing special about the words in controlled vocabulary except the fact that they are standardized for use in certain systems.
  • 11. The following diagram illustrates the processes involved in translating natural language (NL) terms into controlled vocabulary (CV) terms for entry in database records. The diagram helps explain why . . . 1. Natural indexing languages are also called derived-term approaches 2. Controlled indexing languages are also called assigned- term approaches
  • 12. Abstracting and Indexing Process Processes Involved in Translating Natural Language Terms into Controlled Vocabulary Full-Text Document Abstract NL Record Field NL Record Field CV Record FieldAuthority File Natural Language Controlled Vocabulary Enter in Enter in Enter inChose from Write into
  • 13. To review, subject analysis requires you to 1. become familiar with document content; 2. extract significant concepts and terms; 3. translate extracted terms into the language— often controlled—of the system; and 4. formalize the terms (format them, etc.) according to input rules.
  • 14. Types of Indexing languages As the above discussion suggest, there are three types of Indexing language Natural Language or Natural indexing language Controlled Vocabulary or Controlled indexing language Free indexing language
  • 15. Natural indexing language: • This is a slightly broader language in which the description of the document can be done using any of the terms present in the document. Any term that is used to define or describe the content within a document is known as a ‘subject term’. That is why indexing language some time is called ‘subject indexing’ of ‘subject indexing language’. • In Natural indexing language, a subject term can be used to describe/search for a specific document based primarily on its content. • A subject term may also be described as a compact synonym or surrogate for a specific subject representation.
  • 16. Controlled indexing language: • Controlled indexing language refers to the indexing language in which only approved terms are allowed to be used to describe the document. These subject terms are controlled vocabulary under subject authority file. • For subject terms under authority control (or vocabulary control), a subject authority file or list . . . may be described as a list of terms that are permitted to be used in describing or representing specific subjects May be said to standardize one of two synonyms that are used to assign or represent specific topics May be used to determine the preferred term when multiple terms are used to define or describe a single topic May be used to provide cross references for terms that are on par with, hierarchical or alternate in position or relationships
  • 17. • Cataloguing and indexing professionals have created different subject authority control structures: Subject headings lists are used by cataloguers in cases where subject terms have been used as subject headings. A thesaurus is used by indexers where subject terms are known as descriptors.
  • 18. Free indexing language: As the name suggests, this type of indexing language brings into use any term within or outside the document for its description. In today’s times, the searching mechanism and trends have changed and there is a higher use of free text search. This demands that the natural language with the highest possible indexing ideally indexing every text be done. Of course, whether free text search or expert-driven well- chosen vocabularies is being done to check which is more efficient is a matter of research.
  • 19. Here's how the processes differs for natural language and controlled vocabulary: Natural language Controlled vocabulary Terms are based on existing vocabulary of documents (which may be inconsistent) Terms are based on standardized vocabulary intended to describe concepts consistently Indexers / cataloguers extract terms from documents and enter them (or their own terms) in various subject fields extract terms from documents, Indexers / cataloguers choose appropriate authorized terms from controlled vocabulary list, and enter terms in designated controlled vocabulary field Searchers may enter any search terms that are likely to occur in natural language Searchers must enter search terms that are in controlled vocabulary
  • 20. Basics of Subject Indexing MEANING: In the literature of LIS, the phrases subject cataloguing and subject indexing are used more or less interchangeably. But it should be understood that subject cataloguing is intended to embrace only that cataloguing activity which provides a verbal subject approach to library collections, especially macro documents (i.e. books). It refers determining and assigning of suitable entries for the subject component of a document for use in a library’s catalogue, i.e. subject catalogue is a representation of documents. The primary purpose of the subject catalogue is to show which books on a specific subject are possessed by the library.
  • 21. Subject indexing refers to that indexing activity which provides a verbal subject approach to micro documents (e.g., journal articles, research reports, patent literature, etc.). Subject indexing provides a subject entry for every topic associated with the content of a micro document, i.e. subject index is a representation the knowledge expressed by documents
  • 22. The representation of documents and the knowledge expressed by them is one of the central and unique areas of study within Library and Information Science (LIS) and is commonly referred to as subject indexing. Subject approach to information has been a long and extensive concern of librarianship and is assumed to be the major approach (access method) of users for a very long period. Indexes facilitate retrieval of information in both traditional manual systems and newer computerised systems. Without proper indexing and indexes, search and retrieval are virtually impossible.
  • 23. A subject is then any concept or combination of concepts which is expressed in the document. The readers’ task is to interpret the words and sentences in the document in order to understand the concepts. Whether a reader understands a document depends on how precisely the author expresses the concepts he refers to and whether the reader is aware of the concepts the author expresses. The basic idea is that the concepts exist before the author writes the document and the reader reads the document.
  • 24. END