A while ago I wrote a blog post about time series database requirements that has been amazingly popular. Somewhere close to a dozen companies have told me they’ve built custom in-house time series databases, and that blog post was the first draft of a design document for it.
One of the things I said in the post was that I had no use for the “tagging” functionality I’ve seen in time series databases such as OpenTSDB. I’ve since reconsidered, although I think the functionality I now want is a bit different.
What does it mean to “tag” metrics? Typically, many time series databases let you add name=value pairs (tags) to individual time series points (observations). For example, you measure CPU usage to be 59% at 3:41PM on host inst413, and you tag this measurement as “shard=81” because inst413 holds data for shard 81. …
[Read more]