I generate this report daily using cron as to top exceptions across all datacentres and top queries across all datacentre and top urls across all datacentres and send them via email.
Problem is that after every release the no goes up and down as due to some bug a new exception will popup or some exception will resurrect. How do I trend and correlate these dynamic counters.
Solution came from my colleague in just an informal chat and he recommended I should md5-hash the url and create a graphite counter for it and in the email just make the count a hyperlink like shown below.
Now I can trend the query as clicking on this shows me a graph as shown below. My next plan is to inline the graph for top 10 urls in the email itself so I don't even need to click them.
Problem is that after every release the no goes up and down as due to some bug a new exception will popup or some exception will resurrect. How do I trend and correlate these dynamic counters.
Solution came from my colleague in just an informal chat and he recommended I should md5-hash the url and create a graphite counter for it and in the email just make the count a hyperlink like shown below.
Now I can trend the query as clicking on this shows me a graph as shown below. My next plan is to inline the graph for top 10 urls in the email itself so I don't even need to click them.
Comments
Post a Comment