ACCC Home Page Academic Computing and Communications Center  
Accounts / Passwords Email Labs / Classrooms Telecom Network Security Software Computing and Network Services Education / Teaching Getting Help
 
The ADN Connection, September/October 1994 The A3C Connection
Sept/Oct 1994 Contents Welcome to the New Year and the New ADN ADN Free Public Micro Labs Welcome to the Wonderful World Wide Web The Mosaic/WWW Alphabet Soup A Small Sample of HTML
Do you want to be a WWW server yourself? Meeting the WWW Challenge By Your Netid You Shall Be Known Free Seminars for Fall 1994 About the ADN Connection  

Welcome to the Wonderful World Wide Web

 
The Campus Beat
WWW Everyone
 
     
 
     
Click
 

Click, Switzerland. Click, Seattle. Click, Sweden. Click, Paris and a tour of the Louvre. Click, Silicon Valley and the latest offerings. Click, click -- order flowers, see maps, hear sound bites, view movie clips, read electronic books, analyze the federal budget, search the world for whatever it takes to scratch that itch. And still be back in time for lunch.

The World Wide Web or WWW makes it easy like never before. The Web was developed at CERN (a high-energy physics lab in Geneva) to facilitate sharing of information. Its popularity, and the number of information providers, has soared over the past year due to some high-quality Web browsers, notably Mosaic from the National Center for Supercomputer Applications in Urbana. Mosaic and the Web have been written up in the popular press, and have attracted serious attention from some no-nonsense companies. There may well be some truth to the claim that Mosaic is the first "killer application" of the Internet, useful to beginner and expert alike.

Fundamentally, the World Wide Web is a worldwide hypertext system. When you read a hypertext document, certain words will appear specially marked. Point to a marked word, click your mouse, and additional information that the word pointed to is displayed. The additional information may be another location in the document you're reading or it may be in another document entirely. Most hypertext systems put all the necessary files on one machine. But the WWW documents are scattered on Web servers throughout the world; your next click could take you next door or to the next continent.

Hypertext is conceptually very similar to books with footnotes and bibliographic citations. But when you encounter a reference to a cited work in a book, you must physically travel to the library to check out the citation. In a Web document, on the other hand, the citation might include the electronic location of the referenced work, and a click of the mouse will bring it to you. If the author of the document has done a good job marking citations, this is a significant convenience.

As if hypertext among multiple servers wasn't enough, the Web's concept of a "document" includes mixing words with pictures, sounds, and video clips. In fact, it's possible to mark pictures as well as words with hypertext pointers to other documents. There are some truly amazing examples of what can be done on the Web in the "Best of the Web" selection at the bottom of the ADN Web home page. But do keep in mind that your appreciation of the dazzling display might be somewhat limited if your WWW client (usually called a "browser") doesn't do videos or sound clips.

 
     
Get Started on the Web the Easy Way:
  NCSA's Mosaic Web browser is already installed on the ADN public MS-Windows and Macintosh personal computers. Just click on the Mosaic icon (a whirling globe) to open the ADN home page and begin your exploration.

Mosaic is also part of the Network Services Kit for Windows and for Macs. Call the CSO to arrange to have it installed on your desktop computer, and you can have the world on your desk.

Be careful -- it's habit forming!

 
     
Gopher vs. WWW
  Aside from the fancy sound-and-light show, WWW has much in common with Gopher. Both use the client server model. Both involve multiple, worldwide servers, where any server can point to a document on any other server. Both let you "walk" from server to server by reading a screen of information, making a selection, reading another screen, and so forth. Gopher makes you walk from one menu to another until you finally get a document, whereas WWW lets you walk from one document directly to the next document. In fact, most Web browsers can be used to explore Gopher. (The reverse is not true; gopher clients can't be used to browse the Web.)

Gopher started as a simple system to make documents available remotely. Its simplicity is a serious virtue. The ease of constructing gopher clients and servers led to its rapid proliferation, as gophers are wont, and today there is probably more useful information stored on gopher servers than on generic WWW servers.

The balance between Gopher and WWW is changing, though, and it will not be long before WWW is the preferred method for both clients and servers. The reason is that WWW browsers use four key ideas that retain a lot of Gopher's simplicity, yet are easily extensible in ways that will keep WWW current in technology for years to come.

 
     
Web Does Gopher, and a Lot More...
  Gopher clients only "talk" Gopher protocol, but WWW browsers use multiple protocols for information exchange. They can "talk" HTTP, Gopher, anonymous FTP, news (as in Usenet/neatness), WAIS and possibly others. Thus Web browsers can access information on many different kinds of servers. In particular, they can act as Gopher clients, depending on the type of pointer they are following. There is no need for a Gopher-only client on any system that can support a Web browser.

How does a WWW client know what protocol to use in each instance? Each marked word or phrase in a WWW document is associated with a pointer that leads to another item. The pointer is called a Uniform Resource Locator, or URL. The protocol to be used for the new item is included in the URL, along with its name and exact location. (See "What is a URL?".)

 
     
Web Provides Marked Text
  Most files from a Gopher server are plain text, and are displayed verbatim with no changes in fonts or line breaks. Although Web browsers can certainly deal with plain text files in the same way, they excel at displaying HTML files (HyperText Markup Language). HTML files are text files, but not "plain" ones. They contain instructions that indicate that this is a title, that's a heading, this group of words forms a list, a paragraph begins here, and most importantly, that this particular word or phrase is to function as a hypertext pointer. (See "A Small Sample of HTML".)

Markup, at its best, is used to mark the function of the words. This leaves the rendering of the words to the client program, which can take best advantage of available hardware and selected user options. Thus, when you view an HTML file with a Web browser, the browser uses the markup to decide how to display the file. Titles are displayed in one font, headings in another; some browsers (Mosaic in particular) allow you to select the style and size of font it uses. Lists are formatted as lists; hypertext pointers are highlighted in some way underlined or maybe bolded; and the text is sized specifically for the window it is in. Often there are additional things the browser can do with the HTML file -- you can ask it to show you the original HTML file, to convert it to a plain text file that you can save on your disk, to generate a PostScript file for printing, or possibly something else.

 
     
Web Browsers Understand HTTP
  HTTP (Hypertext Transport Protocol), one of the protocols supported on the Web, is rapidly becoming the most popular for Web servers. HTTP servers, like gopher servers, are very efficient in serving a large number of clients with quick make-interact-break connections. (You make a selection, your client connects to the appropriate server and relays your request. The server receives and processes your request, sends back the requested information, then immediately breaks the connection.)

But unlike Gopher, HTTP relies heavily on the MIME (Multipart Internet Mail Exchange) format used to send sound, video, graphics, binary files, and even regular text through electronic mail systems. HTTP first tells the client what kind of files it is about to send, and then sends them. Since the client knows, for example, that the first file it's receiving is really a sound file in .AU format, and the second file is really a graphic image in GIF format, it can properly display them.

HTTP goes beyond simply labeling the types of files that are sent. When a client makes a connection, it typically sends the name of an interaction "method" before sending a request for a particular file. The "get" method is most common, and it means to simply fetch a file. But another method called "put" can be used by the client to send information to the server. For example, by using the put method, a client can send a sophisticated database query to an HTTP server, and have the server act on the query by activating a special program. All kinds of custom interactive applications can be built this way, using the generic functions of HTTP.

 
     
Some Browsers Use External Viewers
  Not all Web browsers use external viewers, but external viewers are used particularly well by the National Center for Supercomputer Applications' WWW browser, Mosaic. To be practical, Mosaic by itself does not provide an appropriate means to view every kind of graphic or sound format for all different kinds of hardware. Instead, when Mosaic downloads a particular type of file, it looks up that type in a table, and runs an independent program to view that file. You can add hardware or change the available viewers, and after you update the table, you can still use your original Mosaic program. Thus Mosaic can continue to act as the core client, and deal with file formats and viewers that were not even envisioned when Mosaic was written.

To be fair to Gopher, there is an improved Gopher protocol, Gopher+, that addresses many deficiencies in the original gophers. But the advantages of WWW combine in a natural and extensible way that Gopher+ can't quite match. The fact that good WWW browsers are now becoming widely available has probably turned the tide in the long run.

 
     
There's a WWW conference in Chicago this fall:
  The Second International WWW Conference 94: Mosaic and the Web, sponsored by NCSA, the Open Software Foundation, the National Science Foundation, and CERN, October 17-20. A URL for more information: http://www.ncsa.uiuc.edu/SDG/IT94/General.html  
     
Reports of Gopher's Death are Premature
  Despite this glowing review of WWW, Gopher will not die a quick death because it still retains some important advantages, even aside from its installed base:
  • Good hypertext is hard and very labor intensive. A human brain has to decide how to mark the text, and that takes time. Sure, there are automatic conversion programs for many formats, but a good, clear document design still depends on real Homo Sapiens, not just software. It is much easier to make plain text available, and gopher has a natural simplicity in dealing with plain text.
  • The Gopher menu format might seem rigid, but it forces every entry to have an associated title. Reasonable titles are clear and descriptive, and they can be searched by programs like Veronica. URLs embedded in HTML documents, in contrast, are not required to have any associated description at all. The item a URL points to might have a title, but you won't know what it might be until you fetch it.
  • WWW documents tend to become collections of small subdocuments, all pointing to each other, whereas Gopher documents tend to be larger, more cohesive pieces. This is perhaps an overgeneralization, and neither scheme really forces a particular style of document organization. The multiple subdocument structure is well suited to hypertext manuals, where instant look up is valuable, but it is less useful when one wants to print out the whole set to get an overview. Also, the multiple document collection needs to be well thought out, because each document will be read in many different contexts.
 
     
WWW at the ADN
  Gopher is now the mainstay of our Campus Wide Information System, and will stay this way for some time, for three reasons. First, we don't have the person-power to turn all of our Gopher plain text files into HTML quickly, and there is no advantage to serving plain files with HTTP rather than Gopher. Second, there is no good Web browser for CMS, and until recently there has not been a browser for plain DOS, but Gopher clients exist for essentially all platforms on campus. And third, many information providers on the UIC campus already know how to deal with Gopher and our Gopher-related tools, and many information seekers are already familiar with Gopher and with the organization of our Gopher server. Of course the superior technology of WWW is too enticing to ignore; in the long run, WWW will supplant Gopher.  
     
How to Jump on the WWW Bandwagon
  By all means, try out the World Wide Web. Take a look at the ADN Web home page; use Mosaic on a public Macintosh or MS-Windows computer, or point your browser to the URL http://www.uic.edu/. Don't miss "Best of the Web, 1994", under "The World" at the bottom of the home page.

Mosaic is part of the Network Services Kit for Macs or MS- Windows. Mosaic for X Windows is available from ftp.ncsa.uiuc.edu. Lynx for UNIX/VT100 is available for public use on tigger (enter lynx), or you can get a copy for your own workstation from the anonymous FTP site ftp2.cc.ukans.edu, in the directory pub/lynx.

How about making your own information available on the Web? It's quite easy to make and test a simple HTML file there's enough information in "A Small Sample of HTML" to get started, and "Do You Want to be a WWW Server Yourself?" explains how to make your HTML files public. We may be able to advise you on some tools to make larger jobs fairly straightforward, too. And, of course, if you find something interesting on the Web, please let us know. We're always on the lookout for more useful information to be added to our growing collections.

Comments are welcome; please send them to
Robert Goldstein, bobg@uic.edu
 
The ADN Connection, Sept/Oct 1994 Previous:  ADN Free Public Micro Labs Next:  The Mosaic/WWW Alphabet Soup


1999-10-4  connect@uic.edu
UIC Home Page Search UIC Pages Contact UIC