Home    Bloggers    Messages    Resources   
Tw  |  Fb  |  In  |  Rss
Matt Heusser

Cloud Computing: Keeping Things Simple, Except When It Doesn't

Matt Heusser
Newest First   Oldest First   Threaded View
MDMConsult
MDMConsult
7/7/2012 7:17:59 PM
User Rank
Platinum
Re: Balance in reporting
Cloud computing providers will need to support hundreds of thousands of users and services to ensure the highest quality. Robust and dynamic infrastructures are critical: transparency, scalability, monitoring/management, and security.

50%
50%
Matt Heusser
Matt Heusser
7/5/2012 2:34:26 PM
User Rank
Blogger
Re: Balance in reporting
"One would expect the production site fails over DR site when there is outage in production site."

 

The point I was trying to get at with the cloud was, if something fails, that's not supposed to be my problem.  We are supposed to have abstracted away the whole concept ... and we're not there yet.  Perhaps my standards are too high, but I seem to recall that was a great deal of the rhetroic that got this whole cloud thing started, no? :-)


50%
50%
Matt Heusser
Matt Heusser
7/5/2012 2:33:11 PM
User Rank
Blogger
Re: Balance in reporting
Good point Rich, the EC2 SLA is 99.95% ( http://aws.amazon.com/ec2-sla/ ) though some of the coverage is reporting much longer outages, I published the ones that were best confirmed.  There was a similar outage on June 18th ( http://www.zdnet.com/blog/btl/amazon-web-services-suffers-partial-outage/79981 ) and, if memory serves, on June 6th. 

Last night, our power grid wet out in my tiny town in West Michigan, and it does go down for a few hours a year.  Perhaps the cloud is a grid, and my problem is one of expectations. :-)


50%
50%
Rich Bruklis
Rich Bruklis
7/4/2012 11:45:19 AM
User Rank
Blogger
Re: Balance in reporting
Amazon offers 99.95% uptime which is about 4 hours and 22 minutes of downtime per year. It seems to me that they go down twice a year for about 2 hours each so I think they know what their risks and recovery are. 

I'd bet their data center is about 10 times better than most companies data centers when it comes security, efficiencies, scalability, and uptime.

I think these large, publicly-traded cloud companies (Amazon, Verizon/Terremark, Rackspace, etc) are like the jury system in the US - it isn't perfect but its the best there is.

50%
50%
Dr.T
Dr.T
7/4/2012 11:25:56 AM
User Rank
Platinum
Re: Balance in reporting

Thanks for the update Matt. What is important is the architecture of the environment with redundancy and DR point of views. One would expect the production site fails over DR site when there is outage in production site. That should be happening regardless of nature of the outage, the site would not be considered a DR site if the natural disaster impacts both the production and DR site.

50%
50%
Veretax
Veretax
7/3/2012 1:26:06 PM
User Rank
Steel
The Cloud isn't the only risk.
In an era with everly increasing severe, and unpredictible weather at times, I find that the story of Amazon's EC2 services to be just one of many.   Look at the recent storm which has impacted several million citizens in just one night from Indiana, all the way through Ohio, West Virginia, Virginia, Maryland, DC, etc.   It isn't just cloud hosted solutions which are vulnerable to outages of this type.  Non-cloud solutions are no safer at times.  

A great example are Point of Sale systems, such as those used by so many grocers. retailers, gas stations, convenience stores, restaurants, etc.   When the power is out at many of these locations, the expectation is it will quickly turn back on.  Yet as I a West Virginian look at the news I cannot help but look at major Chains, like Krogers, Walmart, Food Lion, Foodland etc who serve a large area and have now after nearly two plus days of little if any power have had to dispose of millions of dollars of perishable products - everything from meats, cheeses, refrigerated juices, prepared meals and deli meats and side items, and also highly time sensitive produce.  

 

It saddens me that in the effort to fine tune profit lines, companies like these which provide vital services are caught just as the local populace without any type of failover plan.  At some stores even things that could be used and necessary in the short term like candles, matches, propane, grills, charcoal, and bottled water remained on shelves because stores had no fail over plan for how to handle inventory when the computers and power were down.   Retailers of all sorts turned people away without cash as Credit Card and even Check verifying machines unable to connect due to lack of power, or down telephone lines left many desperate consumers stranded without cash.  Even if people had cash though, many stores had insufficient cash reserves, or lack of ability to process and sell merchandise absent the scanner based UPC code look up machines that so many retailers are dependent upon.  

 

What's worse?  People look far outside their normal shopping zones to shop at other stores of the chain, or even competitors who no doubt raked in cash hand over fish on simple commodities like Generators, Ice, Bottled Water, paper plates, cleaning supplies, and canned and other non-perishable items.  Those retailers who had a plan, positioned themselves to not only help the people seeking to find products, but likely increased their profit margins in the short term, which no doubt investors of any major retailer will appreciate.

 

For me at least this is something that all companies need to address, whether they are cloud or not cloud related.   When I read that retailer X or Y contemplated buying a generator at some point but thought it would be used to infrequently to justify it's cost, I can't help but shake my head as more money is walking out the door in dumpster bins then in their bank accounts.  With freak ice storms, out of control wildfires, rain storms, hurricanes and tornadoes, and other such weather phenomena becoming more common in occurrence if I were a CEO I'd be rethinking my fail over plans at the local level when connectivity and power goes down.

50%
50%
Matt Heusser
Matt Heusser
7/3/2012 12:59:09 PM
User Rank
Blogger
Balance in reporting
My friend, Wayne Rash, points out in a recent post authored hours after mine went up  pointed out that the situation was what the law might consider an "act of God", and Amazon did everything it could have.  In his words, Amazon's Data Center was "fully redundant in itself, and served by redundant backup power and redundant power grids, redundant network access went down under the combined onslaught of massive power outages, massive Internet outages, phone line outages and cell system outages. Not only did everything go down, but nobody could call for backup. And, of course, even if the staff had known that this event was happening, they couldn't have traveled there anyway. Most of the roads were blocked."  Wayne goes on to point out that most failover/restore systems in North America aren't nearly as well prepared as Amazon, and, if you aren't on EC2, it may be time to look to ourselves before pointing fingers. 

The man has a point, and I thought it was worth a brief follow-up to mention.

50%
50%
More Blogs from Matt Heusser
The pressure is on for solution providers to keep up their game in technology and services. These tips will point you in the right direction.
When was the last time you asked what would happen if the essential services you receive from the cloud went down -- even for 10 minutes?
In personal life or in business, IT can say, "Yes we can," and get things done through the cloud.
Your VP of operations wants to install a refrigerator that is Twitter-enabled in the new breakroom. Wait, what?
The next stop in software's world domination? The network.
flash poll
follow us on twitter
like us on facebook
21st Century IT
About Us     Contact Us     Help     Register     Twitter     Facebook     RSS
Copyright © 2013 UBM Channel, a UBM company   |   Privacy Policy   |   Terms of Service