banner_450x90.gif (12558 bytes)
Table of Contents
Site Map

Searching for Web Pages and Web Sites

WWW Search Tools:

While most of the web search engines do not offer all the features that database interfaces offer in shaping retrieval and precision, many have features that can eliminate spurious items.

One major difference between searching the web and databases is that many major databases are indexed for controlled language searching, as opposed to the more common natural language searching. Natural language terms are ordinary terms in common usage. Controlled language terms are specifically selected to represent concepts, and are a further step in organizing the content of databases to enhance precision of retrieval. Retrieval refers to the number of items or "hits" that the search produces; precision refers to the number of items within the retrieval that are relevant to the searcher. Searches for specific words, phrases, or names are generally the easiest type to perform, and result in low retrieval with high precision.

Subject searching is much more difficult, and generally results in higher retrieval (especially by web search engines!) with less precision (i.e., more irrelevant "hits"). The best general advice for online searching is to learn one or two search engines well, back them up with the meta engines, and practice! For help in selecting the proper search tool for your needs, refer to Debbie Abilock's excellent summary on WWW search engines. Before attempting any searches, make sure that you understand the major search concepts discussed below.

We will discuss in general the kinds of searches that can be performed on the WWW via the browser interface. WWW hierarchical subject directories (e.g. Yahoo!), WWW search engines (e.g. Alta Vista) or WWW meta search engines (e.g. MetaCrawler) can be used to locate either pages in websites containing relevant information, or used to locate databases accessible via the WWW which can be searched further. Many WWW databases have interfaces designed to facilitate searching from the web browser. Many large websites use their own search engine at the site to locate content within the web site. These generally are simple search engines that search for user-supplied keywords.

Web Browser Tips:

To view the demo movies for the browser tips, click on the hyperlink; then view the demo.  When motion stops in the screen, close the RealPlayer window to return to this page.

Page text search (Demo)

Use the Find or Find Again command (generally in the Edit menu of your browser) to locate specific text strings on large web pages.

Graphics no load (Demo)

To speed up page loading, turn off graphics loading in the browser Preferences or Options menu. To look at specific graphics, click on them. This is useful for modem connections.

Save (Demo)

Rather than read long passages of text online, save the page to a file and peruse it without the constraints of modem line fees.

View command (dates) (Demo)

This can be used to find out Info, which tells when the page was created and last updated. This helps in determining the currency of the information it contains (see evaluation section). The View command also allows you to view the HTML source code for the page, an aid to learning to create your own HTML documents.
 
Print (Demo)
 
Like the save page command, a page can be sent to the printer and viewed offline to minimize connect charges.
 
Copy and paste (Demo)
 
Text or charts cited in your authored works can be copied and pasted to your pages. It is absolutely important to correctly cite them to credit the original author and avoid plagiarism (see citing online sources).  Also, URLs can be copied from text and pasted in the browser location window.
 
Incremental truncation of the URL (Demo)
 
If the location that you seek cannot be accessed, start deleting elements of the directory path (between the slashes "/") in the browser location box sequentially from the right end towards the left.
 
Go choice on the menubar (Demo)
 
Clicking on this menubar item displays a drop-down menu of the most recently visited websites accessed during your current browser session; this can be used to directly access sites you have visited instead of using the arrow buttons to move forward or backward sequentially through the pages.
 
History choice in Netscape Communicator (Demo)
 
This item can be found by clicking the Communicator item in the menubar; it offers a more extensive list of websites that you have visited, stretching back several days.

WWW Search Engines:

  • Basic interface
    • The simple search interface is generally a text box where you can type search terms without specifying relationships or using search syntax. The retrieval from these searches is generally quite large, and the precision is generally low..
  • Advanced interface
    • The advanced interface offers the opportunity for introducing relationships between terms with search syntax, and for limiting the output of the search. This produces smaller, more specific sets with less irrelevancy (lower retrieval and higher precision).

Meta search engines combine several web search engines to conduct wider searches very quickly. Because each component search engine may have its intrinsic search syntax and logic, usually only very broad and general subject searches can be performed. Meta engines are very useful in searching for a specific word, name, or phrase.

WWW Directories & Lists:

These tools attempt to simplify searching by identifying subject headings and arranging them in hierarchies for searchers. They are often a good place to start when one doesn't have the necessary background to specify search terms or synonyms. They may also incorporate a simple search interface to get the searcher to the most specific page within a topic heading (e.g. Yahoo!). Others may be simply lists of lists of hyperlinks that are dedicated to a specific subject(s).

Specific Directories:

These can be searched for phone numbers, email addresses, maps, software, etc.

 

top.gif (1204 bytes) <date last modified:08/30/00>