[Discuss] Monitoring your AWS instances

Edward Ned Harvey (blu) blu at nedharvey.com
Sun Sep 28 11:05:42 EDT 2014


None of our systems are scheduled for reboot, *and* the machines triggering our alerts have not been rebooting - they just become unavailable on the network for a few minutes and then reappear, without any sort of crash or reboot or anything affecting "uptime."  *And* this has been a general issue progressively getting worse and worse over the last several months (but especially bad lately.)

For the sake of discussion here, I just pulled reports on the numbers of alerts isolated to Amazon:

2013 Aug: 4
2013 Sep: 0
2013 Oct: 1
2013 Nov: 0
2013 Dec: 0
2014 Jan: 6
2014 Feb: 0
2014 Mar: 0
2014 Apr: 0
2014 May: 0
2014 Jun: 0
2014 Jul: 6
2014 Aug: 8
2014 Sep: from 9/1 to 9/12:         8
2014 Sep: weekend of 9/13 & 9/14:   58
2014 Sep: from 9/15 to 9/19:        18
2014 Sep: weekend of 9/20 & 9/21:   8
2014 Sep: from 9/22 to 9/26:        24
2014 Sep: weekend of 9/27 & 9/28:   70 (so far)
2014 Sep: 186 (so far)



More information about the Discuss mailing list