Search engines use bots to index words on websites and create a huge database. When a user searches keywords, the search engine accesses its own index to return results based on a secret ranking system. However, search engines only index about 40-50% of the web and miss dynamically generated and framed pages. Subject directories organize websites by categories chosen by people rather than keywords. They provide a broad index of high-quality sites for a topic but fewer links than search engines.
2. Search engines
• Search engines are the
means by which most
people search the
Web.
• Common examples
are Google, Altavista,
Direct Hit.
MADE BY AMEER
3. Yet they don’t search the Internet
• Yet a search engine does not
actually search the Web
during your search.
• A search engine searches
itself.
• It’s a three-step process.
MADE BY AMEER
4. 1) Bots index words
• Search engines continually
send out hundreds of
“robots” or “bots” (or
“spiders” or “crawlers” )
• Bots visit web sites, read
word by word, and then
index those words.
MADE BY AMEER
5. 2) A database is created
• A huge database of Web
sites thus is gathered
and indexed by word.
• These databases can be
huge, with millions of
links.
MADE BY AMEER
6. 3) The Interface gives you access
• Using the
keywords you
give it, a
search engine
then searches
its own current
index.
MADE BY AMEER
7. Interfaces are based on rankings
• Search engines
return results based
on a ranking system.
• Ranking is the order
that files are listed
when they are
retrieved.
MADE BY AMEER
8. The ranking system is secret
• These systems are proprietary and
often “secret.” In general:
• Altavista ranks web pages higher if
your search terms are found in the
first few words of the page
• Google ranks by document
“popularity” with other similar
searches
• Direct Hit ranks by the length of
time other users spent at the site
MADE BY AMEER
9. Not even half the Web
• With all of this software
and sophistication, even
the best search engines
cover only 40-50% of
the Web.
• And they miss much
else on the Internet.
MADE BY AMEER
10. Bots hit and miss
Bots miss:
XML pages, pdf files
Dynamically created HTML pages
Frames-based pages
New pages or recent updated text
Some say the Invisible Web is 500
times larger than Web
MADE BY AMEER
11. Subject Directories
• A subject directory is
also a database of web
sites and references.
• But a subject directory
is organized not by
keywords but by
category or subject.
MADE BY AMEER
12. Yahoo!
• Yahoo! Is the most
popular subject
directory.
• www. about.com takes
the idea a step further
with subject guides for
selected topics.
MADE BY AMEER
13. Subjects are organized by people.
• Information is
selected, organized and
cataloged by a person,
not software.
• You can usually be
more assured that the
search results will
make sense.
MADE BY AMEER
14. You get an index of sites.
• Subject directories
will not often provide
you with ranked web
sites.
• Instead, you will get a
broad index related to
your topic, divided
further by
subheadings.
MADE BY AMEER
15. Use for early searching.
• Use a subject directory
early in your search
process to learn about
your subject.
• You will get fewer
links of higher quality.
• When you get more
specific questions, you
should use a search
engine.
MADE BY AMEER