YOUR FEEDBACK
Rapid Module Development for DotNetNuke
MICHEAL SMITH wrote: GO TO THE LINK, U HAVE EVERYTHING U WANT THERE. MICHEAL...


2007 West
GOLD SPONSORS:
Active Endpoints
Your SOA Needs BPEL for Orchestration
BEA
Virtualized SOA: Adaptive Infrastructure for Demanding Applications
Nexaweb
Overcoming Bandwidth Challenges with Nexaweb
TIBCO
What is Service Virtualization?
SILVER SPONSORS:
WSO2
Using Web Services Technologies and FOSS Solutions
Click For 2007 East
Event Webcasts

2008 East
PLATINUM SPONSORS:
Appcelerator
Think Fast: Accelerate AJAX Development with Appcelerator
GOLD SPONSORS:
DreamFace Interactive
The Ultimate Framework for Creating Personalized Web 2.0 Mashups
ICEsoft
AJAX and Social Computing for the Enterprise
Kaazing
Enterprise Comet: Real–Time, Real–Time, or Real–Time Web 2.0?
Nexaweb
Now Playing: Desktop Apps in the Browser!
Sun
jMaki as an AJAX Mashup Framework
POWER PANELS:
The Business Value
of RIAs
What Lies Beyond AJAX?
KEYNOTES:
Douglas Crockford
Can We Fix the Web?
Anthony Franco
2008: The Year of the RIA
Click For 2007 Event Webcasts
SYS-CON.TV
TOP LINKS YOU MUST CLICK ON


Linux.SYS-CON.com Feature - Best Practices in Cluster Management and HPC
How to deliver high ROI, improved system performance, and minimal business risk

Digg This!

Page 1 of 2   next page »

Albert Einstein defined success as 10 percent inspiration and 90 percent perspiration. Although he had no inkling about the emergence of Linux Clusters for High Performance Computing (HPC), his words ring true for designing, building, and managing compute clusters.

Linux Computing Is Here
A Disruptive Computing Paradigm
Businesses, research labs, and universities that require intensive computing power have recently discovered that Linux cluster architectures can meet their ever-increasing demand for computing power, application speed, and "always-on" availability. Clusters have proven to be an excellent solution for compute-intensive technical and business computing tasks requiring significant data crunching.

Linux clusters have enjoyed widespread adoption in performance-driven sectors such as financial services, government, energy, life sciences, and advanced manufacturing. Linux clusters are used for a wide variety of demanding applications including derivative trading, drug design, risk analysis, weapons simulation, petroleum exploration, and design automation.

As penetration has moved beyond the early adoption phase, it's no longer a question of whether to deploy Linux clusters, but rather how to implement cluster computing to deliver high ROI, improved system performance, and minimal business risk. Fundamentally, Linux computing success is also about how clusters are implemented and managed and not just about the choice of hardware or software tools.

Scaling Out Linux Clusters
Compelling Benefits but Hidden Costs and Challenges
For small, 2 to 8 node clusters, with minimal or no scheduling requirements, cluster management is relatively straightforward. However, clusters tend to have a strong growth impetus, given their attractive economies and the typical insatiable application demand for computing power. At a certain point, often around 16-64 nodes, cluster management gets problematic and expensive due to growing workload scheduling requirements, increasing cluster size, and the increasing (and inherent) heterogeneity of software and hardware components. Every node added at this point increases management attention, magnifies complexity and business risk, and multiplies both direct (overhead, labor) and indirect (lateness to market, hassle) costs. Adding to the complexity is a myriad of clustering tools and applications, interconnects, MPIs, compilers and libraries, and management tools.

Large cluster sizes makes it even tougher to deploy applications quickly, fully control and monitor all resources, or respond to rapid changes to user or business requirements. All of these issues conspire to reduce the organization's return on IT assets and overall business performance. Consequently, as new clusters scale or existing Linux infrastructures grow, there is an ongoing and critical need for software management solutions that can quickly and economically scale, manage, and troubleshoot clusters.

"Clusters are enabling more work to be undertaken and therefore are a major driver in the growth of the HPC marketplace...there still are clearly large hurdles that clusters will need to overcome in order to expand the addressability of the technology...[including] the management of a large number of nodes and greater levels of parallelization of cluster applications as well as the conversion of applications to cluster operating environments," according to a 2004 IDC study. The pain around cluster management is shared across industries, processing needs, and user roles, representing a significant user concern.

Fortunately, there are management solutions that reduce the pain and cost of building and scaling HPC clusters, even when the cluster size exceeds 1,000 nodes. Powerful, commercially available software solutions can simplify and automate the management, scale out, and support of Linux clusters by bundling industry standard servers, operating systems, and application software into an easily managed Linux datacenter.

After deciding Linux clusters are the way to go, knowing how to plan and implement a cluster is equally crucial. This requires a significant amount of upfront analysis and a detailed, yet realistic, implementation methodology.

Implementing Linux Clusters
Document, Plan, Test, Launch
The process of implementing clusters is no different than the usual project-planning methodology including a review of goals, resources, staffing, and features. There are, however, some guidelines that can shorten time to implementation and results. For example, Scali has identified a number of best practices for deploying, scaling, and managing clusters. In general, investing more time upfront in planning and testing will bear rewards later on in terms of reduced rework, cost, and deployment time.

Baseline Information Needs
Before beginning an implementation, two important lists should be created: a hardware inventory including specific information on minimum hardware requirements; performance tests and specifications on how to set up and configure nodes; and an application inventory that would include information on install applications, fix run errors, and run and test performance.

Hardware Proof
To minimize risk and the time associated with "going live," it is useful to set up a mini proof of concept to avoid unanticipated problems during actual deployment. The proof of concept helps to verify cluster software installation and review the performance and reliability of the network and hardware. Installation requirements may be customized to support hardware or other software unique to the cluster and performance-testing tools to optimize performance. A management solution can handle replication of those requirements across additional nodes.

Application Proof
All applications must be tested on the mini proof of concept to ensure high performance and robustness for stakeholders. As there are likely many different kinds of off-the-shelf, proprietary, and custom applications in the environment, this process may take time with many fits and starts. It is critical to configure, run, and test the applications to highlight missing functionality or corrupted operation instead of relying only on third-party documentation. This process should also be repeated for all software, to ensure a complete environment.

Following configuring and testing, the next important phase is implementing the cluster. Successful cluster implementation should incorporate the following steps:

  1. Communicate critical information on policies, system information, support, and application performance to cluster users.
  2. Brief the operator on the design philosophy, system architecture, and user interface as well as management tools, scripting, and nomenclature.
  3. Prepare the system for running your application software through hardware migration. This may include power testing of any new hardware (i.e., master node, compute nodes, power systems), PXE boots, or changes to nodal BIOS settings.
  4. Once the new nodes are active, the system is ready for application migration and testing.
  5. Upon completion of application proofing, commissioning can begin. At this stage, users and operators should be made aware that the cluster can now run production jobs and be aware of any missing or delayed features.
Completion of these steps helps ensure a robust cluster implementation.


Page 1 of 2   next page »

About Rick Friedman
Rick Friedman is director of marketing at Scali.

Linux News Desk wrote: Linux World Feature - Best Practices in Cluster Management and HPC. Albert Einstein defined success as 10 percent inspiration and 90 percent perspiration. Although he had no inkling about the emergence of Linux Clusters for High Performance Computing (HPC), his words ring true for designing, building, and managing compute clusters.
read & respond »
LinuxWorld News Desk wrote: Linux World Feature - Best Practices in Cluster Management and HPC. Albert Einstein defined success as 10 percent inspiration and 90 percent perspiration. Although he had no inkling about the emergence of Linux Clusters for High Performance Computing (HPC), his words ring true for designing, building, and managing compute clusters.
read & respond »
LATEST LINUX STORIES
Kevin Hoffman's Review of Iron Man
I took the advice of a friend of mine and steered clear of the 'normal' movie theaters and went a little out of the way to go to a DLP movie theater. The experience of comparing a regular movie theater to a DLP movie theater is like comparing standard def analog TV with a 1080i HDTV si
3rd International Virtualization Conference & Expo: Themes & Topics
From Application Virtualization to Xen, a round-up of the virtualization themes & topics being discussed in NYC June 23-24, 2008 by the world-class speaker faculty at the 3rd International Virtualization Conference & Expo being held by SYS-CON Events in The Roosevelt Hotel, in midtown
Verizon Becomes a Counter-Android Linux Convert
Verizon Wireless is snubbing Google's Linux-based Android initiative to go with the LiMo Foundation's mobile Linux spec for its next wave of mobile phones expected next year. Along with Verizon, Mozilla signed up - giving the consortium its first major open source ISV - and a key one f
Adaptec Launches New Series 2 RAID Controller For Linux Users
Adaptec unveiled a new family of entry-level Unified Serial RAID controllers. The new low-profile Series 2 RAID controllers, built on the same Adaptec dual core RAID-on-Chip (ROC) architecture used in its successful Series 5 RAID controllers, provide significant performance enhancement
JavaOne 2008: Sun Challenges Linux
Sun's mule train has finally pulled into Indiana after three years on the road. Indiana is the Linux-friendly Fedora-like OpenSolaris project meant to move the Solaris-shy Linux community off Linux and on to Solaris tempted by Solaris widgetry like the highly scalable, rollback-easy, 1
Curl Announces Support for Ubuntu for Enterprise RIA Platform
Curl announced it has released the availability of an Ubuntu Installer for the Curl Rich Internet Application (RIA) platform. Curl is a Rich Internet Application platform that competes with Adobe AIR/Flex, Silverlight, and Ajax. Curl has been shipping with Linux support for RedHat 9, S
SUBSCRIBE TO THE WORLD'S MOST POWERFUL NEWSLETTERS
SUBSCRIBE TO OUR RSS FEEDS & GET YOUR SYS-CON NEWS LIVE!
Click to Add our RSS Feeds to the Service of Your Choice:
Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
Publish Your Article! Please send it to editorial(at)sys-con.com!

Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021

SYS-CON FEATURED WHITEPAPERS

ADS BY GOOGLE