| By Dave Graham | Article Rating: |
|
| January 5, 2009 11:15 AM EST | Reads: |
9,200 |
Dave Graham's Blog
With the advent of Cloud Computing and the general resurgence of computing grids, data storage has been taken for granted. However, as cloud computing’s storage and access demands continue to grow, the need for an optimized storage layer and hardware accompaniment become even more critical.
The general focus has been on computational power, integration points via software (API access, for example), and code portability. Storage, on the other hand, was considered a commodity to be taken advantage of; a simple pool of storage for whatever data needed some level of retention and access. However, as cloud computing’s storage and access demands continue to grow, the need for an optimized storage layer and hardware accompaniment become even more critical.
In this series of articles (which represent a paper I am writing), I will attempt to examine key areas where a Cloud Optimized Storage Solution (referred to as COSS through the remainder of this document) can both bolster the general availability of the cloud at large and concurrently provide appropriate performance for the cloud application set based on Service Level Agreements (SLAs). As part of this examination, I will look at the following key areas: the type of content being stored and how it is being allocated to storage, the expected performance “tiers” or metrics associated with performance, SLA types and examples, COSS hardware performance, the interoperability and portability of the COSS solution, compliance and authentication characteristics for storage, and the integration of environmental considerations in the design (i.e. “green” computing). Further, I will also attempt to introduce the concept of neural networking as an integration model for COSS. In order to enhance the understanding of each of these categories, I will include visuals where appropriate as well as tabular data explanations for consideration.
Part 1: What Content is being stored on COSS?

Data within the global Cloud is considered to be of two distinct varieties: structured and unstructured. Structured data is best defined as: “data that ha(s) been represented in a manner that allows computation with those data .” Structured data includes (but is not limited to) meta-data, XML/XHTML, database frameworks and underlying structures, and email (some). Structured data is therefore “information that has been organised to allow identification and separation of the context of the information from its content .” Unstructured data, on the other hand, is best defined as: “Data that is not in tabular or delimited format ” or “Data which is not structured such as free-text . The computer cannot automatically extract properties and relationships… ” Unstructured data, in praxis, refers to content such as audio, video, graphic images, email (some), documents (some), and some variations of XHTML (not tagged).
Each type of content, whether it be structured or unstructured, has different influencing factors affecting its storage and retrieval. For example, unstructured data, like audio or video, typically has some method of compression applied to it (MP3, AAC, WMV, MP4, etc) that limits the actions that can be applied to them by storage systems or host-level software. Conversely, structured data can contain high levels of commonality which influence the ability to provide de-duplication level services. In either case, how this content can be managed is a function of both its inherent nature as well as how it is being stored.
[This appeared originally here and is republished in full by kind permission of the author, who retains copyright.]
Published January 5, 2009 Reads 9,200
Copyright © 2009 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Dave Graham
Dave Graham is a Technical Consultant with EMC Corporation where he focused on designing/architecting private cloud solutions for commercial customers.
- Ubuntu-based Open Source Linux Mint Tests KDE Version
- Linux Virtualization and Tired Open Source Myths
- IGEL Supports Red Hat Enterprise Virtualization 3.0
- CloudLinux Announces Support for Atomia
- Amazon Kindle Fire Gets Its Own 'Personal Cloud Desktop' with AlwaysOnPC App Launch
- SPIRIT DSP Receives 2011 INTERNET TELEPHONY Product of the Year Award
- Hadoop Quickstart: Use Whirr to automate standup of your distributed cluster on Rackspace
- Jury Gets Novell Antitrust Case Against Microsoft
- The Utility Infrastructure Security Market 2012-2022: Cybersecurity & Smart Grids
- FORTUNE Magazine Names Rackspace Among “100 Best Companies to Work For”
- iFollowOffice Turns to Virtual Bridges and Savvis for On-Demand Virtual Desktop Services
- EnterpriseDB Announces Availability of Postgres Plus Cloud Database
- i-Technology in 2012: Five Industry Predictions
- Ubuntu-based Open Source Linux Mint Tests KDE Version
- Amazon to Rent Out Supercomputers
- Amazon Émigré Starts Network Monitoring Firm
- HP’s Putting a Back Door in the Itanium Alamo
- Linux Virtualization and Tired Open Source Myths
- CloudLinux Announces Preferred Partner Program
- MapR Pushes the Hadoop Envelope
- Rightware Announces Gaming Performance Benchmark for OpenGL ES 3.0/Halti
- IGEL Supports Red Hat Enterprise Virtualization 3.0
- CloudLinux Announces Support for Atomia
- 3Dconnexion Announces its Newest 3D Mouse - the SpaceMouse Pro
- The i-Technology Right Stuff
- Linux.SYS-CON.com Exclusive: Linus Discloses *Real* Fathers of Linux
- After Ubuntu, Windows Looks Increasingly Bad, Increasingly Archaic, Increasingly Unfriendly
- A Closer Look at Damn Small Linux
- Linus' Top Ten SCO Barbs
- SCO CEO Posts Open Letter to the Open Source Community
- Netscape Co-Founder's 12 Reasons for Growth of Open Source
- Where Are RIA Technologies Headed in 2008?
- *POINT - COUNTERPOINT SPECIAL* What's Wrong with the Open Source Community?
- Introducing "Cooperative Linux" - Linux for Windows, No Less
- Linux.SYS-CON.com Exclusive: What Would UserLinux Look Like?
- Why Recovering a Deleted Ext3 File Is Difficult . . .
















