 |
So Yahoo gets your query and they sent it to Inktomi
in Virginia even though there is a cluster in Santa Clara that we will
talk about in a little bit. The cluster gives them back the answer
in real time and they then do a presentation, which is to say they convert
it to HTML, and they insert whatever advertisement theyd like and what
other things, little icons and such that show up on the page. Maybe
its Yahoo to get your stock quotes or something like that. So they
actually do the presentation and then you get your answer. So there
is actually quite a few steps in that. Theres a step to Yahoo than
a step from Yahoo to Inktomi. It turns out most search engines work
this way. In fact what is interesting is that there is one big cluster
that does most of searches on the internet. Its actually not very
far from here, its off the Great America highway, I dont know exactly
but about a mile or 2 miles from here.
That main cluster has now 166 nodes and each of
those have 2 CPUs. When I say a large virtual machine what I mean
a machine thats got a 166 nodes in it, more than 300 hundred processors
that solve the search engine queries. It works as a virtual computer.
It also has a lot of disks, this is where I am gonna go with this.
But I think what is interesting is, that this picture implies, that there
is an infrastructure being built in the back ground that actually does
what we call the heavy lifting. And its important because it is
not only a centralization of CPU resources but a centralization of disk
resources. In fact I think that the most important trend from the
internet for the disk drive industry is that most storage that end users
own wont be in their house, it wont be in their laptops. Its gonna
be in the infrastructure. This means that the market is going to
have to shift a little bit. |