[Air-l] Worldwide Web Pages

Ellis Godard ellisgodard at starband.net
Tue Aug 28 19:06:43 PDT 2001


Google's index includes dead pages which are cached but otherwise lost,
FWTW.

-----Original Message-----
From: air-l-admin at aoir.org [mailto:air-l-admin at aoir.org]On Behalf Of
Danyel Fisher
Sent: Tuesday, August 28, 2001 11:53 AM
To: air-l at aoir.org
Subject: [Air-l] Worldwide Web Pages



Check the slightly-out-of-date, fairly technical article from the WWW9
conference (2000). It made a lot of headlines at the time, as it was a
collaboration between IBM and AltaVista / Compaq.

http://www.almaden.ibm.com/cs/k53/www9.final/

They estimate only / as many as 56 million "strongly connected" web pages,
which they found by a variety of techniques, including random IP address
checking and web crawls, in a larger set of 200 million pages.

One might question their technique, as Google claims to have indexed over
"1,387,529,000 web pages".

Danyel


_______________________________________________
Air-l mailing list
Air-l at aoir.org
http://www.aoir.org/mailman/listinfo/air-l





More information about the Air-L mailing list