Looks like Yahoo! can now refresh its entire data base 34% faster. Meaning its index will now be overall updating more often than google. (although google is still the king of indexing content on a 'rate of freshness' scale')
There is lisp in there? I am betting elisp for an emacs major mode or something. But 72% Java. c/c++ would be for posix interfaces, and probably enough core VM code, and perhaps hotspot stuff.
Hadoop seems to be the quasi-secret sauce in a number of projects.
A comparison between Hadoop and Google's Sawzall is at: http://glinden.blogspot.com/2007/04/yahoo-pig-and-google-saw...
The NLP search engine Powerset also uses it. http://blog.powerset.com/2007/10/16/powerset-empowered-by-ha...