The World Wide Web
History
Overview of Structure
Overview of Search Engines

History of the Web
1945 � Vannevar Bush, former dean of engineering at MIT, proposed the �Memex�
1965 � Ted Nelson proposed �Literary Machines,� computers that would allow writing and publishing of nonsequential text � hypertext � and the Xanadu Project � allthe world�s information hyperlinked together

History of the Web
Late 1960s � Doug Engelbart at SRI developed the oNLine System (NLS), software for the about-to-be ARPANET that allowed hyperlinking between files on different computers
1968 � Engelbart gives �the mother of all demos� in SF: word processing, windows, hypertext, the mouse, and video conferencing

History of the Web
1970 � no Web�
1980 � still no Web�although Tim Berners-Lee wrote a program, �ENQUIRE,� that hyperlinked between local computers
1984 � still no Web, but the mouse becomes mainstream in the Apple Macintosh, 15 years after Engelbart invents it!

History of the Web
1989-90 � Berners-Lee proposes that CERN create a �global hypertext system�
9/90 � approval to buy a NeXT computer
10/90 � TBL writes first browser program, names it �World Wide Web��

Slide 6

History of the Web
1989-90 � Berners-Lee proposes that CERN create a �global hypertext system�
9/90 � approval to buy a NeXT computer
10/90 � TBL writes first browser program, names it �World Wide Web��
12/90 � first access to Internet news articles
8/91 � software released on the Internet

History of the Web
12/91 � first international demo at a hypertext conference in San Antonio
12/12/91 � first Web server installed in U.S. at SLAC (Stanford Linear Accelerator)
5/92 � �Viola� browser for UNIX
11/92 � 26 �reasonably reliable� servers

History of the Web
9/93 � �Mosaic� browser for PC; Web traffic measures 1% of traffic on NSFnet backbone
1/94 � O�Reilly Associates releases �Internet in a Box� software for home users
3/94 � Marc Andreessen et al. form precursor of Netscape
10/94 � first banner ads on hotwired.com for Zima and AT&T
1995 � AOL, Compuserve provide Web access

Web Sites

Web Structure
There is no right or wrong way to display the �structure� of the Web.
But there is one fundamental difference between Internet and Web structures�
Internet structure is controlled by wiring,
which leads to the possibility of finding this:
http://www.shibumi.org/EotI

Slide 12

Internet Mapping Project
Creating the �peacock on the windshield� with traceroute�
http://research.lumeta.com/ches/map/index.html
And selling the results (it�s all about money)
http://www.peacockmaps.com

Web Structure
There is no right or wrong way to display the �structure� of the Web.
But there is one fundamental difference between Internet and Web structures�
Internet structure is controlled by wiring
Web structure is controlled by hyperlinks
But why should we care what either looks like?�

Slide 15

Web Structure
There is no right or wrong way to display the �structure� of the Web.
But there is one fundamental difference between Internet and Web structures�
Internet structure is controlled by wiring
Web structure is controlled by hyperlinks

Web Structure
Portrayals of Web structure
Illustrative � almost infinite
http://www.cybergeography.org

Steve Martin
��������� ?
Robert DeNiro
�������� ?
Julia Roberts

Steve Martin
Darryl Hannah
Tom Hanks
Meg Ryan
Billy Crystal
Robert DeNiro
Michael Pollard
Faye Dunaway
Ren� Russo
Mel Gibson
Julia Roberts

Web Structure
1999 � �Six Degrees of Kevin Bacon� Theory:�Any two randomly chosen Web pages are, on average, 19 clicks away from each other.�This was taken as a measure of the �diameter� of the Web�
2000 � The Bow Tie Theory says it�s much more complicated�

The Bow Tie Theory
500 million Web pages examined
Pieces/pages are very disconnected
Core diameter is at least 28�
Overall diameter is at least 500�

Web Structure
1999 � �Six Degrees of Kevin Bacon� Theory:�Any two randomly chosen Web pages are, on average, 19 clicks away from each other.�This was taken as a measure of the �diameter� of the Web�
2000 � The Bow Tie Theory says it�s much more complicated�
And then there�s The Invisible Web�

Search Engines
Again, there is no right or wrong way�
But there are three organizational categories:
directories, which use people

Search Engines
Again, there is no right or wrong way�
But there are three organizational categories:
directories � Yahoo
indexes, which use spiders or crawlers

Search Engines
Again, there is no right or wrong way to search�
But there are three organizational categories:
directories � Yahoo
indexes � AltaVista
metasearchers, which use other search engines

Search Engines
Again, there is no right or wrong way to search�
But there are three organizational categories:
directories � Yahoo
indexes � AltaVista
metasearchers � MetaCrawler
We will focus on the first two types.