My Photo

Enter your email address:

Delivered by FeedBurner

« Seesmic with Halley Suitt | Main | Internet Advertising spend sets another record »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341bf9da53ef00e54f9540e08834

Listed below are links to weblogs that reference Data center power outages and down time:

Comments

Brian P Halligan

What do you recommend for those of us that want exceptional availability, but don't want to invest $500m in a data center. (smile)

Bh.

Ari Newman

There are only so many ways to architect and operate a Tier1 hosting facility, and even the best of them have failures. Ironically, it's usually the fail-safe software or switching systems that fall apart when they are called upon. Even the best datacenters need to have full-system failure tests done monthly (not virtual tests of the system, real-deal "pull-the-plug" tests to ensure the systems are working.

As a SaaS site operator, the only way to control your destiny is to take matters into your own hands and run at least two sites on different networks. Using the same hosting provider is even risky because in some cases it's the network routing backbone (BGP/OSPF) that melts down).

Google and MSFT will be able to control their destiny better but will still face the same issues and need to actively test the redundancies built into the architecture. Sometimes all it takes is water in the fuel line of a diesel genny to take down an entire DC. No one is immune.

john rowell

Ari,

Good comment.

The challenge for most ISVs is that multi-site deployments are 'hard', look what happened to SalesForce when they implemented HA, and that there is little to no premium placed on redundant locations by your customers.

In other words. You can spend a significant amount of time and money (multiply by 3 if you are buying/running the gear on your own) on a multi-location deployment. Just don't expect your customers to be willing to foot the bill.

Real key here is how does the industry provide the multi-site capability to ISVs for little to no premium.

Walter Lounsbery

I think neither of the hosting services you mentioned were startups. I remember RackSpace advertising on TV more than 5 years ago, and 365 Main hosted services that predate the Web. The failure is more a statement about the reliability of statements about failure-proof reliability than the maturity of hosting companies or massive capitalization.

For comparison, the redundant and protected air traffic control system at the Dallas/Fort Worth airport (as state of the art as the FAA gets) was taken down for several hours by a maintenance technician that dropped a screwdriver in a very bad place. This happened several years ago. Houston had similar problems due to flooding of basement "protected" systems during Hurricanes Katrina and Rita.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Subscribe