How I'd build the next Google
How I'd build the next Google
-or-
How to exploit the fact that only the first 10 pages of a search result are really necessary, to do a distributed map-reduce that doesn't bottleneck on the difficult-to-parallelise 'reduce'.