Over the last several months we’ve designed and built an events dashboard that lets you inspect very large amounts of system event data quickly (interactively). This feature was driven by customer requests and feedback. The initial proof of concept established its usefulness right away, when customers began remarking that they’d diagnosed server issues by noticing events such as database restarts, replication failures, and configuration changes. At least one customer told us this saved them a long wild-goose chase.
The key concept is that event data is loaded fully into the browser and then you can thin-slice and drill down without reloading any events. And it’s fast. Really fast – it remains responsive even with hundreds of thousands of events. Here’s what the last 30 days of fine-grained events looks like on our own systems:
The skyline along the top shows events by count over time. The left-hand pane lets you …
[Read more]