|
Instructional Module W17c
|
|
||
|
|
|||
| |
|---|
|
Web search engines are made of three parts:
Figure 1 shows the relationship between the three parts.
Read: |
| |
|
|---|---|
|
What They All Do
|
Crawlers are software programs that run on computers belonging to the search engine. (They don't actually move from one computer to another.) The job of the crawlers is to automatically harvest information. This is the general pattern of their actions:
In this way, information from millions of Web pages is sent back to the search engines. Keeping Crawlers OutSome Websites prefer not to appear in search engines. There are two ways to ask the Web crawlers not to enter your site:
|
|
Differences
|
Search engine crawlers aren't all programmed the same. There are a number of ways in which their designers try to make them better, or more specialized.
Read: |
| |
|
|---|---|
|
What They All Do
What is
a hit? |
The User Interface (UI) is what we see when we use a search engine. It's more than a pretty page, of course! The UI is responsible for these jobs:
|
|
Differences
|
Differences in the UI are the most obvious differences between search engines. They include:
Read: |
| |
|
|---|---|
GOOGLE BULKS UP AS COMPETITION LOOMSGoogle added an additional 1 billion pages to its Web index yesterday, increasing the number of pages it indexes from 3.3 billion to 4.28 billion. The search leader said it also had doubled the number of images in its index from 400 million to 880 million. Even those impressive numbers don't come close to covering the whole Web, however, which is estimated at somewhere around 10 billion pages. Meanwhile, rivals Yahoo and Microsoft are girding for battle. Yahoo plans to dump Google as its search engine and switch over to technology acquired through its purchase last year of Inktomi and Overture. At the same time, Microsoft is spending millions to develop its own proprietary search engine to use on MSN.com. According to comScore Media Metrix, Google's Web sites handled 35% of all Web searches in December, while Yahoo claimed 27% and Microsoft 15%. AOL and other Web sites owned by Time Warner made up 16% of the market. (AP/Los Angeles Times 18 Feb 2004) Quoted in NewsScan Daily, 18 February 2004 There are a few really well known search engines, and many special purpose ones. In addition, there are indexes or directories of Web content. To keep track of what's out there now, the best source of information is Danny Sullivan's Search Engine Watch. Read: |
|
|
Meta Engines
|
With all the differences between search engines, it's often a good idea to ask more than one. This is where the meta-search engines come in. They submit your search to multiple search engines, and organize the results for you. There are two kinds of meta engines:
Read: |
| |
|
|---|---|
![]() |
Click here for review questions. |
|
Audience |
|
| Objectives | On successful completion of this module, you will be able to:
|
| Module W17c: Search Engine Operational Principles |
This document is part of a modular instruction
series in Computer Instruction. For more information, see the overview
or the list of modules in this series, W: World Wide Web..
This document has been used in the following classes: INP
160.
|
| History: |
Original: 16 October 2003, by Laurence J. Krieg
Last modification: Thursday, 18-Nov-2004 21:45:19 EST |
| Copyright: |
Copyright © 2003-2004, Laurence
J. Krieg, Washtenaw Community College Instructors: You may point to this file in your Web-based materials; however, its location may change without notice. Students: You are welcome to make a copy for your personal use. All other uses: Please contact the author, Laurence J. Krieg, for permission: krieg@ieee.org. |