Wednesday, March 18, 2009

Microsoft loves Hadoop, do you?

Search Engines, spot the odd man out...
  • Google
  • Yahoo
  • Microsoft
  • A9
Increasingly Google dominates search, certainly it works really well and fast. Try Yahoo, it works really well now, they spent millions rewriting their search to use Hadoop.

Corporate search behind the file wall, how does your search perform..?

How many of use corporate search built using Microsoft IT Architecture patterns. What are these architecture patterns, a single 64-bit SQL Server, 4 replicated SQL servers? Really these patterns are stepped in Windows client server technologies, not high performance computing.

Maybe it's all about to change, Microsoft also loves HPC and Hadoop!

Microsoft purchased Powerset in June 2008 for $100 million, to help it catch up with the Hadoop based search engine companies. ZDnet reports Yahoo has over 10,000 Linux servers. Google is rumored to have many more servers. But how can ISVs bennefit from Hadoop?

In a couple of ways, Hadoop technologies are likely to be interwoven in Microsoft Azure services through Powersets natural language search built on Hadoop and Mapreduce. Microsoft even changed the internal policys, to allow the Powerset dev team to continue contibuting back to the Hadoop open source project. If not Hadoop then Microsoft research has their own Dryad approach (pointer c/o David Chappell)














Another way to access Hadoop rleated technlogies, a new company Cloudera staffed by rockstar developers is set to commericalise open source Hadoop.

Marten Mickos, sold MySQL to SUN for $1B, he's now an investor in Cloudera. Seeing another growth story?

Some ISV's will use Cloud services, another set will build out own data centers?

So, how many data servers will you have in 5 years?