<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>The Grid Guy</title>
	<atom:link href="http://gridguy.net/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://gridguy.net</link>
	<description>My Musings on All Things Grid</description>
	<pubDate>Sat, 09 Dec 2006 01:17:52 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Too Good to Be True?</title>
		<link>http://gridguy.net/?p=18</link>
		<comments>http://gridguy.net/?p=18#comments</comments>
		<pubDate>Sat, 28 Oct 2006 14:25:53 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Database]]></category>

		<category><![CDATA[Storage]]></category>

		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=18</guid>
		<description><![CDATA[There&#8217;s the old adage &#8230;
&#8220;If it looks too good to be true, it probably is.&#8221;
I&#8217;ve counseled others with that same advice.Â  But now, I find myself frequently fighting that very conception &#8211;Â most recently, this past week at Oracle Open World.

On Monday, I had the opportunity to present NetApp&#8217;s Snapshot and FlexClone technologies and their role [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s the old adage &#8230;</p>
<blockquote><p>&#8220;If it looks too good to be true, it probably is.&#8221;</p></blockquote>
<p>I&#8217;ve counseled others with that same advice.Â  But now, I find myself frequently fighting that very conception &#8211;Â most recently, this past week at <a href="http://www.oracle.com/openworld">Oracle Open World</a>.</p>
<p><span id="more-18"></span></p>
<p>On Monday, I had the opportunity to present NetApp&#8217;s <a href="http://www.netapp.com/products/software/snapshot.html">Snapshot</a> and <a href="http://www.netapp.com/products/software/flexclone.html">FlexClone</a> technologies and their role in supporting <a href="http://en.wikipedia.org/wiki/Rapid_application_development">rapid application development (RAD)</a> methodologies for database development, test, and QA:</p>
<blockquote><p>S283383: &#8220;Reducing Time to Deployment: Improving Agility for Database Development, Test, QA, and Migration&#8221;</p>
<p><em>Rapidly creating copies of production and test databases for use in a variety of development, test, quality assurance, and migration scenarios is fundamental to today&#8217;s application development lifecycle. This session reviews traditional and emerging approaches to &#8220;database cloning&#8221; and demonstrates how space and time-efficient copies, such as Network Appliance&#8217;s FlexClone technology, can drastically reduce time to deployment and improve business agility for Oracle Applications and other database-centric application development.</em></p></blockquote>
<p>I had <a href="http://www.netapp.com/go/techontap/matl/0506techontap.pdf">written previously on this topic</a> in NetApp&#8217;s <a href="http://www.netapp.com/techontap/">Tech OnTap</a> - a monthly newsletter aimed at IT professionals that implement and manageÂ NetApp storage technology.Â  And I&#8217;ve spoken on many occasions with NetApp customers about the ways in which these technologies can transform database development.</p>
<p>So once again, after describing <a href="http://www.netapp.com/library/tr/3002.pdf">how NetApp&#8217;s underlying storage system (WAFL) works</a>, and explaining how Snapshot and FlexClone technologies leverage that implementation for space-efficient, near-instantaneous copies of file systems and LUNs, I proceeded to explain how NetApp customers can make working copies of multi-hundred gigabyte databases nearly instantaneously (a few minutes) without consuming any additional storage.</p>
<p>You&#8217;d have thought I was selling <a href="http://en.wikipedia.org/wiki/Snake_oil">snake oil</a>!Â  As often happens, the discussionÂ was met with a range of responses, from keen interest &#8212; because, if true (which it is!), it can revolutionize the manner in which database development is done &#8212; to <a href="http://www.sos.mo.gov/archives/history/slogan.asp">Missouri-like healthy skepticism</a>.</p>
<p>I guess I should have felt slighted, when, after my presentation a number of attendees came to the front of the room - not to discuss the topic with me, but to speak with a young woman who had volunteeredÂ the fact thatÂ she was a NetApp customer, and in fact had experienced some of these effects in her organization.</p>
<p>But I&#8217;veÂ experience this reaction before - and fully expect that I&#8217;ll continue to experience it in the future.</p>
<p>After all, it really doesÂ seem too good to be true &#8230;</p>
<p>References</p>
<p>[1] <a href="http://www.netapp.com/library/tr/3347.pdf">&#8220;A Thorough Introduction to FlexClone(tm) Volumes&#8221;, NetApp TR-3347</a>.</p>
<p>[2] <a href="http://www.netapp.com/library/tr/3355.pdf">&#8220;Oracle 9i for UNIX: Cloning Database Using NetApp Filer Flexible Volumes in NAS and SAN Environments&#8221;, NetApp TR-3355</a>.</p>
<p>[3] <a href="http://www.netapp.com/library/tr/3460.pdf">&#8220;DB2: Cloning a Database using NetApp FlexClone(tm) Technology&#8221;, NetApp TR-3460</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=18</wfw:commentRss>
		</item>
		<item>
		<title>Throwing Down the Gauntlet</title>
		<link>http://gridguy.net/?p=17</link>
		<comments>http://gridguy.net/?p=17#comments</comments>
		<pubDate>Thu, 26 Oct 2006 02:45:47 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Database]]></category>

		<category><![CDATA[Musings]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=17</guid>
		<description><![CDATA[Not sure how many of you saw it, but Larry Ellison threw down the gauntlet today (at least in this bloggers opinion) at Oracle Open World, announcing that Oracle would maintain and support (it&#8217;s own?) RedHat-basedÂ Linux distribution, Unbreakable Linux.
Reminds me of an old cartoon &#8230;

Pinky: &#8220;Gee, Brain, what do you want to do tonight?&#8221;
Brain: &#8220;The [...]]]></description>
			<content:encoded><![CDATA[<p>Not sure how many of you saw it, but <a href="http://www.oracle.com/pls/ebn/live_viewer.main?p_shows_id=4954117&#038;p_referred=undefined">Larry Ellison threw down the gauntlet</a> today (at least in this bloggers opinion) at <a href="http://www.oracle.com/openworld">Oracle Open World</a>, announcing that Oracle would maintain and support (it&#8217;s own?) RedHat-basedÂ Linux distribution, <a href="http://www.oracle.com/linux">Unbreakable Linux</a>.</p>
<p>Reminds me of an old cartoon &#8230;</p>
<p><span id="more-17"></span></p>
<blockquote><p>Pinky: &#8220;Gee, Brain, what do you want to do tonight?&#8221;</p>
<p>Brain: &#8220;The same thing we do every night, Pinky, try to take over the world!&#8221; [1]</p></blockquote>
<p>The plan includes:</p>
<ul>
<li>a sweetheart deal to get RedHat subscribers to switch to an Oracle Unbreakable Linux subscription</li>
<li>the promise of lower longterm maintenance costs, and</li>
<li>indemnification from <a href="http://www.sco.com/">SCO</a>, who has devolved a once innovativeÂ business into one of the <a href="http://www.groklaw.net/staticpages/index.php?page=20051113153750111">most despicable business models I&#8217;ve ever seen</a> (sue, sue, sue).</li>
</ul>
<p>Larry claims that Oracle&#8217;s initiative is an effort to accelerate the robustness of Linux for enterprise data center applications.Â  Looks like investors have a slightly different take &#8230; <a href="http://finance.yahoo.com/q?s=rhat">RHAT</a> is down more than 16% in after hours trading.</p>
<p>So the Oracle vs. Microsoft title bout looks to be shaping up nicely - with both parties sporting a full software stack - OS, storage, database, mail, apps, &#8230;Â  Wonder who they&#8217;re gonna get from the DOJ to officiate?</p>
<p>Oh well, maybe Larry will follow <a href="http://www.gatesfoundation.org/default.htm">Bill</a> and <a href="http://www.theregister.co.uk/2006/06/26/buffet_gates_charity/">Warren&#8217;s</a>Â lead and decide that his continued good fortune should beÂ used to help address some of the world&#8217;s ills.</p>
<p>References</p>
<p>[1] <a href="http://youtube.com/watch?v=MpZsbZ5sqyw">Pinky and the Brain</a>, YouTube.Â  Or try theÂ <a href="http://youtube.com/watch?v=Oqt-5eUoPxw">politically slanted alternative version</a> if you&#8217;re so inclined.</p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=17</wfw:commentRss>
		</item>
		<item>
		<title>Drunken what?</title>
		<link>http://gridguy.net/?p=16</link>
		<comments>http://gridguy.net/?p=16#comments</comments>
		<pubDate>Wed, 25 Oct 2006 14:51:02 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Storage]]></category>

		<category><![CDATA[Tech Computing]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=16</guid>
		<description><![CDATA[I knew it couldn&#8217;t possibly end there &#8230;
I recently sent Jon ToigoÂ (DrunkenData) a formal response to questions he and some of his readers had about Data ONTAP GX, NetApp&#8217;s scale-out storage architecture.
In a followon comment, one of Jon Toigo&#8217;s readers, who failed to indicate his affiliation, but whose name links toÂ LeftHand Networks :-), wrote:
&#8220;Itâ€™s nice [...]]]></description>
			<content:encoded><![CDATA[<p>I knew it couldn&#8217;t possibly end there &#8230;</p>
<p>I recently sent <a href="http://www.drunkendata.com">Jon ToigoÂ (DrunkenData)</a> a <a href="http://drunkendata.com/?p=622">formal response</a> to questions he and some of his readers had about <a href="http://www.netapp.com/products/software/ontap-gx.html">Data ONTAP GX</a>, NetApp&#8217;s scale-out storage architecture.</p>
<p>In a followon comment, one of Jon Toigo&#8217;s readers, who failed to indicate his affiliation, but whose name links toÂ LeftHand Networks :-), wrote:</p>
<blockquote><p><em>&#8220;Itâ€™s nice to see NetApp being open about its GX architecture. This being said, the architectural deficiencies were clearly spun in a positive light with marketing hype.&#8221;</em></p></blockquote>
<p>So, in the interest of educating that readerÂ (and perhaps some other readers) about <a href="http://gridguy.net/?p=11">scale-out storage andÂ the scale-outÂ computing revolution</a>, I thought I&#8217;d take the opportunity here to share some of my <a href="http://gridguy.net/?page_id=2">experience</a> and personal observations.<span id="more-16"></span></p>
<p>And &#8230; sorry to disappoint,Â but no &#8220;spin&#8221; here.</p>
<p>In that post, Jon&#8217;s reader missed two key points, imho.</p>
<p>First, the *primary* purpose of almost all scale-out storage architectures is to provide scalable AGGREGATE performance for a large number of clients - not to speed single stream performance to a single host.Â  This is true both in technical computing apps and in largescale enterprise deployments.</p>
<p>In fact, most hosts in cluster computing configurations (or enterprise server environments) haveÂ only a single GbE (or 2Gb FC) into the client fabric - so you&#8217;ll never get more than a wire&#8217;s worth of bandwidth (90 or 100 MB/s for GbE) to a single host anyway.</p>
<p>For higher performance interconnects - including 10GbE and IB, it *is* possible for single hosts to obtain higher throughputs, but you still won&#8217;t exceed the single wire (or single &#8220;logical wire&#8221; if you&#8217;re using and can leverage link agg) performance from host to storage subsystem.Â  So the fact that a host goes through an N-blade in the GX architecture to get the data is irrelevant as long as the N-blade can deliver the single session bandwidth expected by the host - which, of course, it can.</p>
<p>That having been said, technical computing apps requiring high aggregate I/O for &#8220;single problem&#8221; scenarios do exist.Â  But those applications are structured in one of two ways to leverage parallel computing - either as a set of &#8220;embarrassingly parallel&#8221; processes, or as part of a large parallel job (using, e.g. MPI).</p>
<p>In the former case, this is simply an aggregate I/O problem with lots of hosts accessing lots of files on the shared storage system (perhaps in the same directory, perhaps in multiple directories).Â  In the latter case, the individual &#8220;workers&#8221; in an MPI communication group (node/process set) will EACH perform their own I/O - either accessing individual files or accessing (typically) disjoint portions of large files (perhaps through a parallel I/O layer such as MPI-IO).Â  For either case, GX does just as it is supposed to - it spreads the accesses across *BOTH* the N-blades and the D-blades to provide scalable I/O.</p>
<p>This is basically the manner in which *all* parallel / clustered file systems are employed for scale-out computing applications.Â  There are URLs to many of the companies developing such systems in my aforementioned post on <a href="http://gridguy.net/?p=11">The Scale-out Evolution</a>.</p>
<p>As I indicated in my response to Jon, pNFS is an NFSv4 proposed extension that will effectively allow you to take the VLDB functionality &#8220;out-of-band&#8221;, so that a pNFS-savvy client can get a &#8220;map&#8221; of the locations of all of the file segments through the metadata server (at time of open or in response to file extend callbacks), then go directly to those segments for its &#8220;data access&#8221; commands (read, write) - in essence, eliminate the &#8220;hop&#8221; that Jon initially asked about.</p>
<p>There are other performance/efficiency options that can be effected in such architectures as well - both in an out-of-band metadata scenario AND in the current N-Blade implementation.Â  These have to do with how those components may choose to do read-ahead across distributed segments of a large shared file - effectively issuing requests for multiple portions of the file at the same time to &#8220;fill&#8221; a larger pipe back to the requestor.Â  With a pNFS client, the multiple segment read-ahead could be effected by the host.Â  With the current GX implementation, by the N-Blade.</p>
<p>Bottom line &#8230; yes, it&#8217;s true that a single host&#8217;s I/O will be limited by the *smaller* of: (a) it&#8217;s network ingress capability (typically 1 GbE); (b) the network egress capability of the N-Blade to which the host session (mount) is connected.Â  Not a problem for most apps I&#8217;ve ever seen given the hardware on which GX currently runs.</p>
<p>The reader&#8217;s second pointÂ has to do with the functionality that is part of the current <a href="http://www.netapp.com/products/software/ontap-gx.html">ONTAP GX</a> release.Â  The initial release of ONTAP GX, as I had indicated, is targeted at technical computing applications, for which the set ofÂ featuresÂ currently provided are critical.</p>
<p>Technical computing applications work almost exclusively against files and filesystems.Â  What I think Jon&#8217;sÂ reader is missingÂ is the difference between a block storage platform (in which you can addÂ bricks and controllers under a single management domain and provide striped LUNs - stuff many of the vendors&#8217; volume managers have been doing for years) and a scale-out <strong>FILESYSTEM</strong>, which ONTAP GX presents.</p>
<p>As many users of parallel and clustered filesystems know, this is a very important distinction.Â  Large storage systems can effectively be crippled by a filesystem that hasn&#8217;t been adequately designed forÂ scalability.Â  The old &#8220;metadata bottleneck&#8221; in SAN filesystems.</p>
<p>ONTAP GX provides both a scale-out storage platform (distributed storage &#8220;bricks&#8221;, distributed controllers, single management domain) as well as a parallel filesystem with <strong>DISTRIBUTED</strong> <strong>metadata</strong> - alleviating file system bottlenecks that would otherwise prevent full utilization of the underlying architecture.</p>
<p>What you&#8217;re seeing at NetApp, and I believe elsewhere in the storage industry, is the gradualÂ maturation of scale-out storage technology (and particularly parallel/clustered filesystems) - across the industry, as companies look to leverage these innovations first in production technical computing applications (digital animation, EDA, bioinformatics, automotive/aerospace design, &#8230;), and subsequently as highly scalable enterprise storage systems (e.g. for enterprise grid deployments).</p>
<p>The truth of the matter, despite what some marketing departments will tell you, is that (a) building parallel/clustered filesystemsÂ is hard (especially in delivering scalable, full featured, robust, enterprise-class services); and (b) we (as an industry) are very much in the early days of the scale-out storage revolution.Â  Just ask someone who&#8217;s tried to deploy any of these systems, or do an in-depth survey of the scale-out storage solutions and ask them (and their customers) about feature sets and robustness.</p>
<p>From my (admittedly biased) perspective, the HUGE advantage NetApp has is a very successful enterprise business based on a proven scale-up approach to <a href="http://www.netapp.com/products/unified-storage.html">unified storage</a> (SAN, iSAN, NAS) withÂ Data ONTAP, and an emerging scale-out storage platform based on the Spinnaker legacy &#8212; all based on the same physical hardware components,Â  same core storage &#8220;container&#8221; architecture (WAFL), and same data management services (snapshots, mirrors, etc.).</p>
<p>Getting all of that right requires significant innovation, and takes time.Â  Hence the evolutionary, multi-release rollout of ONTAP GX.</p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=16</wfw:commentRss>
		</item>
		<item>
		<title>A Helluva Town &#8230;</title>
		<link>http://gridguy.net/?p=15</link>
		<comments>http://gridguy.net/?p=15#comments</comments>
		<pubDate>Fri, 20 Oct 2006 00:32:44 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Travel]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=15</guid>
		<description><![CDATA[New York, New York.Â  It&#8217;s aÂ helluva town.
Leonard Bernstein (or more accurately, Betty Comden and Adolph Green) had it right.
As I do from time to time, I had the chance to travel to New York on business this week.Â  And as usual, as I came through Queens and the Manhattan skyline approached, I fell into the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/New_York,_New_York_(On_The_Town)">New York, New York.Â  It&#8217;s aÂ helluva town.</a></p>
<p>Leonard Bernstein (or more accurately, Betty Comden and Adolph Green) had it right.</p>
<p><span id="more-15"></span>As I do from time to time, I had the chance to travel to New York on business this week.Â  And as usual, as I came through Queens and the Manhattan skyline approached, I fell into the magical grip of the city.</p>
<p>Circumstances this trip brought me to town a day early, and afforded me the chance to explore.Â  It was a beautiful day, so I set out from my midtown hotelÂ late Sunday morning for a stroll through Central Park.</p>
<p>I&#8217;ve posted some of the <a href="http://moxons.net/moxons/Bruce/Travel%20Pics/NYC_Oct2006/index.html">pictures I took on my walk</a>.</p>
<p>One of the highlights of my afternoon came whenÂ I stumbled across the <a href="http://www.cpdsa.org/">Central Park Dance Skaters Association</a>, a group of skaters on inline and traditional roller skates all skate-dancing to the disco and funk sounds of a couple of DJs.Â  The energy was infectious, and the ebullient skaters drew quite a crowd.</p>
<p>Only in New York &#8230; you gotta love this town!</p>
<p><a href="http://www.cpdsa.org/" /></p>
<p>Â </p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=15</wfw:commentRss>
		</item>
		<item>
		<title>DRM Comes of Age</title>
		<link>http://gridguy.net/?p=10</link>
		<comments>http://gridguy.net/?p=10#comments</comments>
		<pubDate>Mon, 02 Oct 2006 01:27:52 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Grid Defs and Refs]]></category>

		<category><![CDATA[Tech Computing]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=10</guid>
		<description><![CDATA[Distributed Resource Management (DRM) solutions had their origin as simple batch schedulers capable of dispatching jobs to â€œfreeâ€ cpu resources for technical computing applications.Â  Over the years, these systems have evolved to provide a wide range of job management services that include sophisticated rule-based dispatching mechanisms, environment replication, preemption, checkpoint/restart, and reporting.Â  Now, as some [...]]]></description>
			<content:encoded><![CDATA[<p>Distributed Resource Management (DRM) solutions had their origin as simple batch schedulers capable of dispatching jobs to â€œfreeâ€ cpu resources for technical computing applications.Â  Over the years, these systems have evolved to provide a wide range of job management services that include sophisticated rule-based dispatching mechanisms, environment replication, preemption, checkpoint/restart, and reporting.Â  Now, as some of these technical computing approaches find their way into the core of the enterprise, DRM is establishing itself as a key technology to enable Enterprise Grid computing models.</p>
<p><span id="more-10"></span>Technical Computing Roots [1]</p>
<p>The rapid adoption of commodity-based servers and the Beowulf distributed computing architecture has led to large compute farms in many of todayâ€™s production technical computing applications â€“ including electronic design automation, digital animation, seismic processing, bioinformatics, and complex simulation.Â  Facilities with 1,000 or more hosts are not uncommon.Â Â </p>
<p>As these compute farms have continued to grow, the cpu bottleneck that once limited system throughput, was replaced by other resource constraints that reduced utilization.Â  Software licenses, for example, became the limiting factor in many EDA implementations in the early part of this decade.Â  As a result, license-based scheduling criteria became a key part of many DRM systems.</p>
<p>As compute servers become more capable (64-bit processing, large memory bandwidth, GbE- or IB-based connectivity), and as networks expand to deliver increasing bandwidth, many organizations are encountering throughput limiting bottlenecks in other areas of the Grid Computing â€œstackâ€ (see Figure).</p>
<p><img title="Grid Computing Stack" style="width: 530px; height: 352px" alt="Grid Computing Stack" src="http://gridguy.net/images/DRMComesofAge_pic1.jpg" /></p>
<p>One of the most frequently cited bottlenecks is networked storage, which now has to service concurrent requests from hundreds or thousands of data-hungry compute servers.Â  <a href="http://gridguy.net/?p=11"><font color="#800080">Scalable parallel file systems</font></a> are being deployed to satiate some of the increasing I/O demand.Â  However, even with these systems in place, I/O bottlenecks that reduce system throughput often occur.</p>
<p>To handle the increasing pressures being placed on networked storage systems by scale-out computing architectures, some of these DRM systems are evolving to include storage monitoring and storage resource management as part of their scheduling policies.</p>
<p>Heading for the Enterprise</p>
<p>This expansion in scope is part of a generalization of the DRM model that is well aligned with some of the needs of <a href="http://gridguy.net/?p=4"><font color="#800080">enterprise (utility) computing grid environments</font></a>.Â  Utility computing relies heavily on the principle of consolidation, where physical resources are shared by a number of applications.Â  In such an environment, it is essential that system resources be effectively managed to ensure adequate application service levels.Â  â€œBindingâ€ an application to a set of resources based on the applicationâ€™s requirements, current resource utilization levels, availability of application data, and application service level requirements, is essentially the same problem that DRM systems have been solving for years.</p>
<p>As a result, we are seeing systems like <a href="http://www.platform.com/Products/Platform.Enterprise.Grid.Orchestrator/Product.Information/"><font color="#800080">Platform Computingâ€™s Enterprise Grid Orchestrator (EGO)</font></a> and <a href="http://www.qlusters.com/products"><font color="#800080">Qlustersâ€™ openQRM</font></a> move squarely into the enterprise data center management space.</p>
<p>As Enterprise Grids continue to be deployed, and service-oriented architectures (SOAs) mature, look to DRM-derived solutions to form the foundation of enterprise application management suites.</p>
<p>References</p>
<p>[1] Drawn from â€œStorage-aware Job Schedulingâ€, a presentation to the <a href="http://www.platform.com/gridconference/"><font color="#800080">Platform Grid Conference 2006</font></a>, San Francisco, CA.</p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=10</wfw:commentRss>
		</item>
		<item>
		<title>Extinction of the Dinosaurs: The Scale-out Evolution and Implications for Future Storage Generations</title>
		<link>http://gridguy.net/?p=11</link>
		<comments>http://gridguy.net/?p=11#comments</comments>
		<pubDate>Fri, 29 Sep 2006 03:59:12 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Storage]]></category>

		<category><![CDATA[Tech Computing]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=11</guid>
		<description><![CDATA[It was only 25 years or so ago that Dinosaurs ruled the earth.
No, really.Â  Computing Dinosaurs, that is.
Back in the early 1980s, most technical computing was done on &#8220;large iron&#8221; - Cray supercomputers and the like.Â  These systems took many years to design, cost many millions of dollars, required custom facilities, and employed a cadre [...]]]></description>
			<content:encoded><![CDATA[<p>It was only 25 years or so ago that Dinosaurs ruled the earth.</p>
<p align="left">No, really.Â  Computing Dinosaurs, that is.</p>
<p><span id="more-11"></span>Back in the early 1980s, most technical computing was done on &#8220;large iron&#8221; - Cray supercomputers and the like.Â  These systems took many years to design, cost many millions of dollars, required custom facilities, and employed a cadre of highly trained programmers who could extract every last drop of performance from those architectures.</p>
<p>I&#8217;ll never forget the first time I saw a Cray-2 supercomputer.Â  The Cray-2 was <a href="http://www.electronics-cooling.com/Resources/EC_Articles/MAY96/may96_04.htm">liquid cooled</a>, and you could actually see the fluorocarbon cooling liquid bubbling over the electronic circuit boards.</p>
<p>In the late 1980s and early 1990s, parallel computing was the rage, and a number of approaches were explored &#8212; a &#8220;diversity bloom&#8221;, if you will.Â Â Massively Parallel Processors based on custom bitwise processorsÂ like Thinking Machines&#8217; <a href="http://en.wikipedia.org/wiki/Connection_Machine">Connection Machine</a>Â and <a href="http://www.top500.org/orsc/1996/node15.html">MasPar&#8217;s MP-1 and MP-2</a>, those based on commodity microprocessors, such as the <a href="http://www.netlib.org/benchmark/top500/reports/report94/Architec/node20.html">Intel iPSC</a>, and the <a href="http://www.paralogos.com/DeadSuper/Misc/BBN.html">BBN Butterfly</a>, and those based on parallel/vector architecturesÂ like the <a href="http://www.paralogos.com/DeadSuper/Alliant/#FX%20Series">Alliant FX</a> series, and <a href="http://www.paralogos.com/DeadSuper/Convex/index.html">Convex Computer</a> systems all roamed the earth.</p>
<p>Towards the mid-1990s most of these architectures had died out, giving way to two predominant high performance computing architectures:</p>
<ul>
<li>Symmetric Multiprocessors (SMPs) -Â the scale-up approach</li>
<li>Beowulf clusters -Â the scale-out approach</li>
</ul>
<p>(For a more detailed treatise on scale-up and scale-out, see the case study I wrote for Microsoft a few years ago on the <a href="http://download.microsoft.com/documents/customerevidence/6286_CornellFinal.doc">Cornell Theory Center&#8217;s use of scalable computing technology</a>).</p>
<p>In the early part of this decade, many of the large SMP architecturesÂ  also fell, leaving a landscape in which Beowulf (scale-outÂ or cluster computing architectures employing commodity servers and networking) rules the land.Â  Indeed, the <a href="http://top500.org">Top500 web site</a>, which lists the 500 largest computer systems in the world, indicates that &#8220;365 systems are labeled as clusters, making this the most common architecture in the TOP500&#8243;, with Intel, AMD, and PowerPC processors at the core of nearly all of the systems in the list.</p>
<p>Today, scale-out computing architectures are clearly the dominant architecture for technical computing applications.</p>
<p>I contend that we are presently inÂ the midst of aÂ similarÂ transition in the storage industry.Â  The monolithic storage arrays of the past twenty years are giving way toÂ midrange systems, and eventually to scale-out storage architectures employing commodity-component -based &#8220;storage bricks&#8221;.</p>
<p>I believe we&#8217;re seeing now the equivalent of the parallel computing diversity bloom in the storage market &#8212; proliferation of a wide variety of scale-out storage systems and clustered file systems, including <a href="http://www.bluearc.com">BlueArc</a>, <a href="http://www-03.ibm.com/systems/clusters/software/gpfs.html">IBM&#8217;s GPFS</a>, <a href="http://www.ibrix.com">Ibrix</a>, <a href="http://www.isilon.com">Isilon</a>, <a href="http://www.lustre.org">Lustre</a>, <a href="http://www.panasas.com">Panasas</a>, <a href="http://www.polyserve.com">Polyserve</a>, <a href="http://www.sgi.com/products/storage/tech/file_systems.html">SGI&#8217;s CXFS</a>, <a href="http://www.terrascale.com">Rackable&#8217;s Terrascale</a>, and, of course,Â <a href="http://www.netapp.com/products/software/ontap-gx.html">NetApp&#8217;s ONTAP GX</a>.</p>
<p>And, as with scale-out computing architectures, which are now at the core of enterprise computing operations, I believe we will seeÂ these approaches evolve and mature - and no doubt consolidate - into the enterprise storage systems of tomorrow.</p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=11</wfw:commentRss>
		</item>
		<item>
		<title>Stockholm</title>
		<link>http://gridguy.net/?p=13</link>
		<comments>http://gridguy.net/?p=13#comments</comments>
		<pubDate>Sat, 23 Sep 2006 00:38:01 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Travel]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=13</guid>
		<description><![CDATA[I had the pleasure of visiting Stockholm, Sweden last week in conjunction with my work, presentingÂ at NetApp&#8217;s first Tech Ontap Live event.
What an incredibly beautiful city.Â  Of course it helped that the weather was simply spectacular during my visit, 70 F.
I took some photos in a morning walk around the city.
Thanks of course to my [...]]]></description>
			<content:encoded><![CDATA[<p>I had the pleasure of visiting Stockholm, Sweden last week in conjunction with my work, presentingÂ at NetApp&#8217;s first <a href="http://www.netapp.com/techontap/">Tech Ontap</a> Live event.</p>
<p>What an incredibly beautiful city.Â  Of course it helped that the weather was simply spectacular during my visit, 70 F.</p>
<p>I took some <a href="http://www.moxons.net/moxons/Bruce/Travel%20Pics/Stockholm.2006-Sep/index.html">photos</a> in a morning walk around the city.</p>
<p>Thanks of course to my NetApp hosts - you know who you are <img src='http://gridguy.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=13</wfw:commentRss>
		</item>
		<item>
		<title>Internet Bubble 2.0 ?</title>
		<link>http://gridguy.net/?p=12</link>
		<comments>http://gridguy.net/?p=12#comments</comments>
		<pubDate>Fri, 22 Sep 2006 15:02:31 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Digital Media]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=12</guid>
		<description><![CDATA[Every week, a new announcement in the internet and digital media space &#8230;

Google signs a $900m deal with News Corp to provide search and advertising for MySpace.com and other Fox Interactive properties
Apple announces an initiative to provide movie downloads from theÂ iTunes store
Yahoo announces an online video programming venture with Current TV
Yahoo is reportedly in &#8220;serious [...]]]></description>
			<content:encoded><![CDATA[<p>Every week, a new announcement in the internet and digital media space &#8230;</p>
<ul>
<li><a href="http://news.bbc.co.uk/2/hi/business/5254642.stm">Google signs a $900m deal with News Corp</a> to provide search and advertising for MySpace.com and other Fox Interactive properties</li>
<li><a href="http://news.zdnet.com/2100-1040_22-6114835.html?tag=nl">Apple announces an initiative to provide movie downloads from theÂ iTunes store</a></li>
<li><a href="http://www.nytimes.com/2006/09/20/business/20gore.html?ex=1316404800&#038;en=4642a6c8ba35a6c5&#038;ei=5088&#038;partner=rssnyt&#038;emc=rss">Yahoo announces an online video programming venture with Current TV</a></li>
<li><a href="http://www.foxnews.com/story/0,2933,214972,00.html">Yahoo is reportedly in &#8220;serious talks&#8221; to acquireÂ social networking site Facebook for $1B</a></li>
</ul>
<p>So, is this deja vu all over again?Â  Are we experiencing the frantic upside of a second internet bubble?Â  Will it burst too?</p>
<p><span id="more-12"></span></p>
<p>While there are perhaps similarities to the Great Internet Bubble (note I don&#8217;t say &#8220;First Great &#8230;&#8221;) in the fervor with which <a href="http://en.wikipedia.org/wiki/Web_2">web 2.0</a> applications seem to be catching the attention of the business community, I think there is one very significant difference.</p>
<p>And that is the fact that there is *real* money behind these deals - not the worthless-stock-for-worthless-stock transactions of the late &#8217;90s.Â  But real, honest-to-God money, comingÂ from very profitable media companies.</p>
<p>Will there be a shakeout once the fervor settles down?Â  Undoubtedly.Â  Will there be a wholesale bubble burst?Â  I don&#8217;t think so.</p>
<p>The internet is becoming the new entertainment medium - pulling viewers and advertising dollars from traditional broadcast and cable TV networks.Â  What we&#8217;re seeing is a transition in consumer behavior from passive viewer to active participant - in an effort to take back control of how we choose to entertain ourselves (not &#8220;be entertained&#8221;).</p>
<p>So, rather than asking when the internet bubble 2.0 will burst, maybe we should all be humming &#8220;Internet killed the &#8216;cast TV star&#8221; [1]</p>
<p>[1] shameful, unabashed reference to the <a href="http://www.youtube.com/watch?v=NwhcfqYjUww">Buggles&#8217; &#8220;Video Killed the Radio Star&#8221;</a></p>
<p>Â </p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=12</wfw:commentRss>
		</item>
		<item>
		<title>What&#8217;s a Storage Grid?</title>
		<link>http://gridguy.net/?p=6</link>
		<comments>http://gridguy.net/?p=6#comments</comments>
		<pubDate>Fri, 12 May 2006 21:10:00 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Grid Defs and Refs]]></category>

		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://gridguy.net/?p=6</guid>
		<description><![CDATA[So, is there such a thing as a &#8220;Storage Grid&#8221;?Â  And, if so, what is it?
I contend thatÂ a Storage Grid is indeed a useful architectural concept.Â 
The usual definition of â€œGridâ€ is more than the â€œnetwork architectureâ€ used to connect components together.Â  â€œGridâ€ really is the concept of an infrastructure (not just wires) into which a [...]]]></description>
			<content:encoded><![CDATA[<p>So, is there such a thing as a &#8220;Storage Grid&#8221;?Â  And, if so, what is it?<span id="more-6"></span></p>
<p>I contend thatÂ a Storage Grid is indeed a useful architectural concept.Â </p>
<p>The usual definition of â€œGridâ€ is more than the â€œnetwork architectureâ€ used to connect components together.Â  â€œGridâ€ really is the concept of an infrastructure (not just wires) into which a variety of devices (appliances) can plug to provide network-based services to applications. I often use the following definitions of Grid when talking withÂ enterprise-oriented customersÂ â€¦</p>
<ul>
<li>Cross-application (enterprise-wide), holistic IT model based on complete virtualization of underlying resources (virtual everything).</li>
<li>A heterogeneous environment that supports distributed application deployment, execution, management, and monitoring</li>
</ul>
<p>In which caseÂ we can talk about the Storage Grid being a virtualization of underlying storage resources that provides a range of services to applications â€“ data storage, data management, replication, backup/recovery, compliance, â€¦ at different performance levels and price points.Â  Throw in transparent data migration across backend tiers, and you have the Information Lifecycle Management (ILM) story.</p>
<p>Â </p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=6</wfw:commentRss>
		</item>
		<item>
		<title>Grid Computing Defined</title>
		<link>http://gridguy.net/?p=4</link>
		<comments>http://gridguy.net/?p=4#comments</comments>
		<pubDate>Fri, 12 May 2006 18:02:47 +0000</pubDate>
		<dc:creator>GridGuy</dc:creator>
		
		<category><![CDATA[Grid Defs and Refs]]></category>

		<guid isPermaLink="false">http://catalystinformatics.com/blog/?p=4</guid>
		<description><![CDATA[Grid Computing is a term that has arisen in the last few years to describeÂ a number of computer architecture approaches based on some simple but powerful principles.
The initial definition, as posed in Ian Foster&#8217;s &#8220;The Anatomy of the Grid&#8221; [1], encompassed &#8220;coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations&#8221;.Â Â  As indicated in [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Grid Computing</strong> is a term that has arisen in the last few years to describeÂ a number of computer architecture approaches based on some simple but powerful principles.<span id="more-4"></span></p>
<p>The initial definition, as posed in Ian Foster&#8217;s &#8220;The Anatomy of the Grid&#8221; [1], encompassed &#8220;coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations&#8221;.Â Â  As indicated in a subsequent paper, &#8220;the key concept is the ability to negotiate resource-sharing arrangements among a set of participating parties (providers and consumers) and then to use the resulting resource pool for some purpose&#8221; [2].</p>
<p>Since then, the term has broadenedÂ toÂ refer generally to the use ofÂ shared (commodity) computer components &#8212; processing and storage &#8212; in a distributed networked architecture.Â  In essence, an architectural alternativeÂ to theÂ development of monolithic, centralized computation and storage architectures.</p>
<p>That having been said, there are at least three common uses of the term &#8220;Grid&#8221; in the present IT lexicon:</p>
<ul>
<li>
<div><strong>Technical Computing Grids</strong> employ rack-mount computer systems in scale-out configurations to bring the aggregate processing power of many CPUs to bare on problems of interest.</div>
</li>
<li>
<div align="left">(Enterprise) <strong>Utility Computing Grids</strong>Â provide an agile, on-demand model for application provisioning and migration based on sharing ofÂ common infrastructure resources implemented throughÂ commodity computer components (CPUs, networking, storage).</div>
</li>
<li>
<div><strong>DataÂ Grids</strong> provide for the distributed capture, management, and sharing of information (and sometimes instrumentation) &#8212; typically across mutliple authority domains.</div>
</li>
</ul>
<p><strong>Technical Computing Grids</strong></p>
<p>This artchitectural approach is common in High Performance Computing (HPC) applications &#8212; ones employingÂ large amounts of computing power to solve computationally intense problems.</p>
<p>Traditional HPC apps in the scientific computing space include:</p>
<ul>
<li>
<p align="left">Energy research and simulation, and high energy physics research</p>
</li>
<li>Earth, ocean, and atmospheric sciences, global change, and weather prediction</li>
<li>Complex multi-physics simulations for aerospace design</li>
<li>Seismic data analysis</li>
<li>Large scale signal and image processing applications</li>
</ul>
<p>These are the types of applications that used to run on large supercomputers, such as those developed in the early 80s by Cray and others.Â  As commodity-based cluster computing (Beowulf clusters) emerged, many of these application were re-hosed on large (1000+ node) compute clusters (often called &#8220;grids&#8221;).</p>
<p>Additionally, many new applications have arisen, in part due to the increased availability of these new commodity supercomputers.Â  These applications are increasingly at the core of business critical operations, and include:</p>
<ul>
<li>Drug discovery (computational chemistry, genomics research)</li>
<li>Circuit design simulations</li>
<li>Automobile design simulations (aerodynamics and crash analysis)</li>
<li>Risk analysis of financial portfolios</li>
<li>Digital media applications (animation and rendering)</li>
</ul>
<p><strong>Utility Computing Grids</strong></p>
<p>Utility computing is all about leveraging modular compute, network, and storage components to improve resource utilization, increase enterprise agility through rapid application provisioning (and re-provisioning), and simplify IT operations.Â  In short, creating a more nimble, more cost effective IT organization.</p>
<p>The terms Grid, Utility Computing, and On-demand computing are often used almost interchangeably to describe a wide variety of approaches thatÂ are generally aimed at these objectives.Â  These approaches are typically based on twoÂ key principles - the foundational pillars - of utility computing: Consolidation and Virtualization.</p>
<p align="left">These two principles go hand-in-hand to support hosting of multiple applications - either concurrently, or in a time-share model - on the same physical resources.Â  E.g. server virtualization, as exemplified by VMWare, Xen, and Microsoft Virtual Server supports the hosting of multiple (virtual) servers on a single physical host.Â  The physical resources of the host - cpu, memory, I/O, network connectivity - are shared amongst a number of virtual servers.Â  Each looks like a standalone server - with its own IP address(es), its own network and security settings, and its own OS and applications - but shares the underlying physical resources.Â  Server virtualization improves utilization by consolidating multiple applications onto a common physical hardware platform - eliminating capex and opex costs associated with deploying multiple physical servers.Â  This approach is particularly effective in containing the &#8220;server sprawl&#8221; that has occurred in many IT organizations where every application instance required its own server (and local storage).</p>
<p>Similarly, network virtualization strategies (including vlans) and storage virtualization strategies serve to allow applications shared use of those infrastructure resources, often employing Quality-of-service (QoS) provisions.</p>
<p>Storage virtualization strategies, in particular, aim at delivering logical storage containers (filesystems and LUNs or volumes) that transcend the physical nature of storage systems â€“ disks and controllers.Â  Together with tiered storage strategies (using different classes of storage for different types of data) and transparent data migration, they deliver a â€œstorage as servicesâ€ model where those services are provided in the storage network and not by individual physical devices or servers.Â  A realization of what many have been calling information lifecycle management (ILM).</p>
<p><strong>DataÂ Grids</strong></p>
<p>The notion of a data grid may be the closest concept to the original Grid concept developed by Foster et al. It represents a physically distributed set of information resources (services) contributed by multiple authorities under a common set of protocols. In some ways, the World Wide Web represents the first generation data grid, in which information is â€œpublishedâ€ by individual sites, indexed by crawlers, and accessed via search engines and explicitly represented links.</p>
<p>The San Diego Supercomputer Centerâ€™s Storage Resource Broker is another data grid model that presents cataloged data â€œcollectionsâ€ presentable to a community of interest [3]. SRB â€œpresents the user with a single file hierarchy for data distributed across multiple storage systems. It has features to support the management, collaboration, controlled sharing, publication, replication, transfer, and preservation of distributed data.â€</p>
<p>In many ways, these efforts represent first generation data grids â€“ with static or quasi-static data â€œpublishedâ€ into web-based documents or files, and consumed by browsers and file-savvy applications. The next generation of data â€“ or information â€“ grid is one based on Web 2.0 technologies that affords a richer model for compositing individual information sources into a rich set of distributed web services.</p>
<p align="left"><strong>References</strong></p>
<p>[1] Ian Foster, Carl Kesselman, Steven Tuecke,Â &#8221;The Anatomy of the Grid&#8221;, <a href="http://www.globus.org/alliance/publications/papers/anatomy.pdf">http://www.globus.org/alliance/publications/papers/anatomy.pdf</a></p>
<p>[2] Ian Foster, &#8220;What is the Grid?Â  A Three Point Checklist.&#8221;, <a href="http://www-fp.mcs.anl.gov/~foster/Articles/WhatIsTheGrid.pdf">http://www-fp.mcs.anl.gov/~foster/Articles/WhatIsTheGrid.pdf</a></p>
<p>[3] SDSC Storage Resource Broker, <a href="http://www.sdsc.edu/srb/index.php/Main_Page">http://www.sdsc.edu/srb/index.php/Main_Page</a></p>
]]></content:encoded>
			<wfw:commentRss>http://gridguy.net/?feed=rss2&amp;p=4</wfw:commentRss>
		</item>
	</channel>
</rss>
