Flutterby™! : Wayback Machine

Next unread comment / Catchup all unread comments User Account Info | Logout | XML/Pilot/etc versions | Long version (with comments) | Weblog archives | Site Map | | Browse Topics

Wayback Machine

2002-01-23 20:47:10+01 by Dan Lyke 0 comments

Last night at the Weblogger Interest Group[Wiki] shindig, Ev said that How the Wayback Machine Works was a good read, a look at building a 100 terabyte database, mostly with basic commodity hardware.

So if a book is a megabyte, which is about what it is, and the Library of Congress has 20 million books, that's 20 terabytes. This is 100 terabytes. At that size, this is the largest database ever built. It's larger than Walmart's, American Express', the IRS. It's the largest database ever built. And it's receiving queries -- because every page request when people are surfing around is a query to this database -- at the rate of 200 queries per second.

[ related topics: Free Software Cool Science Open Source ]

comments in ascending chronological order (reverse):