Story Detail of id 48279134 | Liveview Hacker News

awithrow10 hours ago | on: Incident with Actions and Pages

that is absolutely not the case for any system of size and scale. that would just burn out the on-call team and not result in improvements. Error rates/budgets are used instead.

hnlmorg9 hours ago | parent

It depends what you're monitoring. If it's response codes from user generated queries, then I'd agree with you.

But if it is synthetic queries sent from the monitoring platform, then you control the user agent, payload, and endpoints. So any failed requests are a symptom of a misconfiguration and/or failure that should be investigated. Albeit not necessarily as a P1 priority.

#visit	13,397,084
#session	74,665
#live-session	0