Note: Please go to docs.rightscale.com to access the current RightScale documentation set. Also, feel free to Chat with us!
Home > Guides > RightScale 101 > Management Tools > Monitoring System

Monitoring System

screen-MonitoringOverview-v1.pngThe monitoring system is one of the advanced features of the RighScale Dashboard, which allows you to view real-time graphical data for all running instances.  The monitoring system is based on two open source projects: collectd (http://collectd.org) and RRDtool (http://rrdtool.org).

Note: The monitoring features are only available for our pay editions.  If you have a Developer account, you need to upgrade in order to unlock this feature.  Please contact us at sales@rightscale.com.

 

How does the monitoring system work?

The overall process of RightScale's monitoring system is as follows:

  • Collectd is installed on an instance at boot time using the SYS Monitoring install RightScript.
  • The website passes the hostname of the monitoring server to the instance using the EC2 launch data (ex: RS_sketchy=skethcy1-1.rightscale.com)
  • Collectd auto-detects data sources (disks, processes, etc), starts collecting data, and sends the data every 20 seconds via UDP to the specified monitoring server.
  • The monitoring server stores the data in an RRD database.
  • When you view a Monitoring tab in the Dashboard for an individual server or deployment, the request is proxied through to the monitoring server.
  • The monitoring server produces graphs using the data in the RRD database.

Can websites that scale way up view monitoring information too?

Customers whose application scales into the tens, hundreds or even thousands of instances via a scalable server array may wonder if they can still view monitoring data in the RightScale Dashboard. Furthermore, can they view aggregate data from all of the servers?  The answer is yes.  However, we did have to implement a policy in order to maintain performance while user view their data. Remember that each server instance registers with the load balancers (e.g. HAProxy) as the site scales so that the workload can be evenly balanced between application servers in the array.  As an example, you could have 1000 instances registered with two front-end servers running load balancing software.  If each registered server instance (1000 in our example) had its own individual set of graphs as well as contributed to an aggregate graph, the amount of data would be overwhelming and take too long to retrieve and graph.  Hence, the concept of active and inactive Servers with respect to Monitoring had to be defined.  Servers are considered inactive if they have not sent data for a period of one day (or more).  The most important points with respect to viewing monitoring data of sites with many servers are:

  • Cumulative graphs show the total activity of all servers but provide detailed information only for the active servers (inactive server information is not available)
  • Active server graphing data is always available (either in thumbnail graphs or by providing a link to produce the graph on demand)
You must to post a comment.
Last modified
13:15, 15 Jul 2013

Tags

Classifications

This page has no classifications.

Announcements

None


© 2006-2014 RightScale, Inc. All rights reserved.
RightScale is a registered trademark of RightScale, Inc. All other products and services may be trademarks or servicemarks of their respective owners.