The Data Stack: The Next Focus of Cloud Computing?

Many of us have already heard of the term “software stack.” It shows the software layers in boxes stacking up on each other, all the way from operating system, to middleware, and to applications. When these layers are offered as services, we have IaaS (Infrastructure As A Service), PaaS (Platform As A Service), and SaaS (Software As A Service) respectively for so called cloud service stack. These two stacks are essentially similar if not the same.

If we stop for a moment and think more about the bigger picture, we can find out that the stack is actually aligned with the compute, which is key part of the compute, network, and storage triad.

Bothered by SLOW Web UI to manage vSphere? Want to manage ALL your VMware vCenters, AWS, Azure, Openstack, container behind a SINGLE pane of glass? Want to search, analyze, report, visualize VMs, hosts, networks, datastores, events as easily as Google the Web? Find out more about vSearch 3.0: the search engine for all your private and public clouds.

How about network stack? There is the famous OSI model with several layers of protocols (physical layer, data link layer, network layer, transport layer, session layer, presentation layer, application layer). So networking is covered. I actually think the software stack was inspired by the OSI model, but haven’t done research on this.

Now is the last part, which is the storage or data. Interestingly, while searching for data stack, I got many top pages on stack of data structure – the one that pushes or pops an item at a time on the top. This is of course not what I intended to discuss here.

What I intend to discuss is at high level – how the data is layered in terms of its meaning abstraction of values. This may or may not relate to the software stack. In other words, there is no direct mapping from a software layer to a data layer.

The following is a diagram on data stack:


At the bottom is the storage, which only stores bits and has no idea about what these bits mean. These bits can well be binary programs, thus not data at all.

On top of that is the file system, which interprets and organizes the bits into files of bytes and characters with specific encodings like UTF-8 for Unicode. These files are more human consumable than the underlying storage. They are also better organized with folders (or directories) in a file system.

Moving up is the model layer, which gives some level of meanings to the underlying files. For example, the underlying text file is interpreted as an XML with a specific schema, therefore can be parsed by XML parsers. There are standard content formats defined by MIME.

At the top of the layers is the domain, which clearly defines the meaning of the data, and makes it ready for a software application to make full sense of it. The meaning is more than the literal interpretation but closely related to domains which set up context for a piece of data.

In the same way in which software stack maps to cloud service stack, the data stack will be offered as data services as well: storage as service (SaS), file as service (FaS), model as service (MaS), and domain as service (DaS).

With this data stack, the story of stack architecture of computing is now complete.

This entry was posted in Cloud Computing and tagged , , , . Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

  • NEED HELP?


    My company has created products like vSearch ("Super vCenter"), vijavaNG APIs, EAM APIs, ICE tool. We also help clients with virtualization and cloud computing on customized development, training. Should you, or someone you know, need these products and services, please feel free to contact me: steve __AT__ doublecloud.org.

    Me: Steve Jin, VMware vExpert who authored the VMware VI and vSphere SDK by Prentice Hall, and created the de factor open source vSphere Java API while working at VMware engineering. Companies like Cisco, EMC, NetApp, HP, Dell, VMware, are among the users of the API and other tools I developed for their products, internal IT orchestration, and test automation.