large distributed systems
So I have recently been paying a lot of attention to systems with huge amounts of data in them.
be it Relegence that deals with lots of incoming news stories and figuring out what they are about in real time or the data layer that is dealing with click streams and recommendation engines.
One of the interesting questions is how we make this data available to the publishing ...