Home > FAQs > What happens when Amazon or RightScale have outages?

MindTouch
Copyright (c) 2006-2014 MindTouch Inc.
http://mindtouch.com

This file and accompanying files are licensed under the MindTouch Master Subscription Agreement (MSA).

At any time, you shall not, directly or indirectly: (i) sublicense, resell, rent, lease, distribute, market, commercialize or otherwise transfer rights or usage to: (a) the Software, (b) any modified version or derivative work of the Software created by you or for you, or (c) MindTouch Open Source (which includes all non-supported versions of MindTouch-developed software), for any purpose including timesharing or service bureau purposes; (ii) remove or alter any copyright, trademark or proprietary notice in the Software; (iii) transfer, use or export the Software in violation of any applicable laws or regulations of any government or governmental agency; (iv) use or run on any of your hardware, or have deployed for use, any production version of MindTouch Open Source; (v) use any of the Support Services, Error corrections, Updates or Upgrades, for the MindTouch Open Source software or for any Server for which Support Services are not then purchased as provided hereunder; or (vi) reverse engineer, decompile or modify any encrypted or encoded portion of the Software.

A complete copy of the MSA is available at http://www.mindtouch.com/msa

What happens when Amazon or RightScale have outages?

Background Information

What exactly happens when the RightScale Dashboard is unavailable or their is a major outage in a cloud infrastructure like EC2?  Do your servers disappear?  Can you still access your servers?  Is all of your data lost forever?  How can you plan and prevent such outages from impacting your site?

 


Answer

The table below provides a high level breakdown of the impact of such outages and what actions should be taken under those circumstances.

RightScale Platform: Planned Downtime/Maintenance

Periodically, there will be brief periods of downtime when the RightScale Dashboard will not be available in order perform regular maintenance and site upgrades.  You can find more detailed information here: What are the effects of RightScale Planned Maintenance?

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
Existing servers will continue to run and operate as intended. ServerTemplates will not be able to launch and autoscaling would not work within your environment during this window. Maintenance windows are once every 4 to 6 weeks and are only necessary when rolling out new features to the platform. Customers receive a 7 days notice via email and an email reminder is sent the day before. Alerts are also displayed in the Dashboard upon login. During this time a maintenance page will be displayed preventing access to the Dashboard. These windows are scheduled and usually last between 10 minutes and 60 minutes (email notification describes “up to 1 hour”). The platform is restored as soon as the updates are complete. Direct connection to your servers is possible via SSH for administration of existing servers. Scheduled autoscaling ahead of the window may be advised.

 

RightScale Platform: Unplanned Downtime/Outage

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
Existing servers will continue to run and operate as intended. Uptime of the platform is not linked to the uptime of your servers. Access to monitoring and other features of the RightScale Dashboard would be impeded. Additional ServerTemplates may not be able to launch new servers, autoscaling or other automated actions based on alerts/escalations may not work within your environment. Likely manifesting as transient server unavailability (503s, etc) or general slowness of the site. RightScale is usually able to maintain the availability of most services for the duration of the downtime. Under the worst case scenario, a maintenance page would be posted preventing access to the Dashboard. A breakdown of individual RightScale components will follow. Direct connection to your servers is possible via SSH for administration of existing servers. To report an outage, we offer a 24/7 Emergency Line for RightScale in addition to our own proactive monitoring and remediation. RightScale Services would convey the scope of the problem proactively to you and provide a best estimate to full recovery of the platform.

 

Cloud Infrastructure: Server/Instance Failure

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
One component of your application would fail depending on the affected server. However, proper software architecture should plan for the failure of a component. The platform would not be affected. Launch a new server of the same type from your ServerTemplate or set up automated relaunch within RightScale by monitoring for server failure and setting up alerts/escalations for notification/remediation. RightScale recommends redundancy as a best practice and is strongly encouraged in the documentation.

 

Cloud Infrastructure: Availability Zone Failure (Amazon)

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
Components of the application would fail depending on the zone in which they were deployed. Servers would not be available for relaunch in that zone indefinitely. The platform performance may be affected as RightScale responds to the zone failure. We architect our application across multiple zones including redundant application and database servers in different datacenters. Relaunch your servers in an alternate availability zone. Depending on your zone launch settings, this can be automated in the Dashboard. RightScale recommends architecting in multiple zones for maximum high availability.

 

Cloud Infrastructure: Geographic Region Failure (Amazon)

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
Components of the application would fail depending on the geographic region in which they are deployed. Servers would not be available for relaunch in that Geographic Region indefinitely. RightScale runs across multiple geographic regions. Currently, most core RightScale services run in the US-East region. However, RightScale is architected to recover these services in an alternate zone. Some downtime may occur during the transition but would be limited. Relaunch your servers in an alternate geographic region. ServerTemplates are, by default, able to launch in multiple geographic regions.

 

Cloud Infrastructure: Total Service Failure -- All Zones and All Regions (Amazon)

How will it affect my servers? How will it affect the RightScale platform? What actions must be taken?
All of your servers would be unavailable. RightScale would be unavailable. RightScale has a footprint outside of the cloud called “meta” that is used to restore services and customer accounts should total failure occur. In the future, RightScale will be able to transition to an alternate cloud provider in the case of total service failure. These services are not yet production ready. Servers must be relaunched once the cloud and RightScale platform are recovered. This scenario is highly unlikely being that geographic regions do not share any common network infrastructure. In the future you will be able to architect for/transition to other cloud providers. These services are not yet production ready.
You must to post a comment.
Last Modified
21:26, 16 May 2013

Tags

Classifications

This page has no classifications.

Announcements

None

Glossary | 用語용어 Site Map | Site Help Community Corporate Site Get Support Dashboard Login
Doc Feedback Product Feedback Resources MultiCloud Marketplace Forums

Dashboard Status


© 2006-2014 RightScale, Inc. All rights reserved.
RightScale is a registered trademark of RightScale, Inc. All other products and services may be trademarks or servicemarks of their respective owners.