Three Ways Enterprises Can Use Hadoop

Hadoop has recently gained lots of attentions from enterprises. Just think about the rapid growth of attendees in Hadoop Summit. There are many different ways to leverage Hadoop in enterprises. But in general, there are three major types of usage patterns as detailed below.

As a Framework

Lost VMs or Containers? Too Many Consoles? Too Slow GUI? Time to learn how to "Google" and manage your VMware and clouds in a fast and secure HTML5 App.

This is what Hadoop was initially intended to be, and continues to be one of the major approaches in the short term. It means that an enterprise needs to invest in customized application development, which normally costs more than out of shelf applications.

In the long term, I expect it will slowly moves to the next two approaches. But it will continue to maintain certain level because you simply cannot buy any applications from the market. Also, you want to control an application if it’s your core competence against your competitors.

As a Platform

When an enterprise can buy a Hadoop application from app store, it can run it on its Hadoop clusters. There will be certain configurations but no software development involved.

To get there, there got to be certain standardization on the data format including input and output data, and stable Hadoop interfaces. Without these pre-requisites, it’s hard to run this way.

As an Application

For certain big data applications, it’s highly possible to embed Hadoop in the application. Consider a Web application that includes Tomcat. You don’t even notice the existence of Hadoop and every detail is hidden there.

This approach offers the best encapsulation and simplicity, but may not as efficient as the second approach where applications can share same clusters. It justifies when, for example, the underlying cluster is pretty small and the cost of dedicated cluster is relatively small, or the application is very demanding that it uses pretty all the resources of a cluster.

This entry was posted in Big Data and tagged , . Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

One Trackback

  • By Hadoop vs. Tomcat | on July 25, 2012 at 12:03 am

    […] my previous article, I talked about three different ways enterprises use Hadoop. Thinking a bit more, you may have come to realize that the three usage patterns are very similar […]

Post a Comment

Your email is never published nor shared. Required fields are marked *


You may use these HTML tags and attributes <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>


    My company has created products like vSearch ("Super vCenter"), vijavaNG APIs, EAM APIs, ICE tool. We also help clients with virtualization and cloud computing on customized development, training. Should you, or someone you know, need these products and services, please feel free to contact me: steve __AT__

    Me: Steve Jin, VMware vExpert who authored the VMware VI and vSphere SDK by Prentice Hall, and created the de factor open source vSphere Java API while working at VMware engineering. Companies like Cisco, EMC, NetApp, HP, Dell, VMware, are among the users of the API and other tools I developed for their products, internal IT orchestration, and test automation.