Week10

How Internet Search Engines Work

* Search engine system
(1,2)

* The indexing software
(3)

* How to use the search engine
(4)

* The Database and The Search Engine
(5,6)

Search engine system

Each search engine system called “crawler” or “spider”.

It crawls all home pages are not contained ignore sites and its
links on the Internet and check its contents (graphics files,
sound files, and animation files).
It can take for several seconds to many minutes to crawl each
site it finds, depending on the size and complexity of the site.

The spider sends the information about the contents to
indexing software, as finds documents and URLs.

Crawler and Spider

http://kamezo.cc/blog/entry/44272
http://www.white-cube.jp/cat8/

The indexing software

*About indexing software

For example: SKY Index, MACREX, CINDEX™.etc...

*Researchers have been trying for many years to develop
linguistic processing systems.

● Receives the documents and URLs from the agent.
● Extract information from the document(into and put
database)
● Each search engine
○ extracts and index

■ →different type information.

*Quotation by:
● Sentence:
○ http://www.indexers.org.uk/index.php?id=211
● Picture :
○ http://home.windstream.net/wordsmith/
○ http://www.anindexer.com/about/sw/swindex.html

How to use the search engine

When you want to know something , you will be use the Internet.

But, do you know how to use the Internet?

If you want to know something , you type words on a web page that describe the
information you want to find.

Depending on the search engine , more than just keywords can be used.

For example, you can search by date and other criteria with some search engines.

The image was quoted from:
http://www.highposition.net/news/what-does-the-recession-mean-for-search-
engine-marketing/

The Database and The Search Engine
● The database is searched by the search engine based on the
standard you have set.

● The search engine returns the results in HTML pages.

● The way to return the results is many different.

● You are sent straight to the document that you are interested in.

● The document itself does not exist in the database or on the
search engine site.

http://yutakarlson.blogspot.com/2009/05/funny-restaurant.html

References

Wikipedia
http://ja.wikipedia.org/wiki/

Google
http://www.google.co.jp/

Week10

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (16)

Ähnlich wie Week10

Ähnlich wie Week10 (20)

Week10