Alertmanager will have its own set of services around it (irc/phab relays and metricsinfra-specific authorization proxies for example), it's easier than prometheus to set up in a HA configuration and the exact ways of scaling prometheus on metricsinfra are still unknown, so it makes sense (at least to me) to split alertmanager to live on a separate VM than where Prometheus itself lives.
Description
Details
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | fgiunchedi | T205862 Expand modern metrics infrastructure coverage (2018-19 Q2 goal) | |||
Resolved | colewhite | T183454 Deprovision Diamond collectors no longer in use | |||
Resolved | MoritzMuehlenhoff | T210993 Deprecate Diamond collectors in Cloud VPS | |||
Resolved | taavi | T336774 Current status of cloudmetrics and its components | |||
Resolved | taavi | T326266 Remove the WMCS statsd/Graphite service | |||
Open | dcaro | T313444 Streamline WMCS Alerting and Paging | |||
Resolved | taavi | T317032 Remove Diamond? | |||
Resolved | taavi | T264920 Grafana "cloud-vps-project-board" needs to be migrated from Graphite to Prometheus | |||
Open | None | T194333 [Epic] Provide logging/metrics/monitoring SaaS for Cloud VPS tenants | |||
Resolved | taavi | T266050 Build Prometheus service for use by all Cloud VPS projects and their instances | |||
Resolved | taavi | T286335 Split metricsinfra alertmanager to separate hosts from prometheus |
Event Timeline
Change 703708 had a related patch set uploaded (by Majavah; author: Majavah):
[operations/puppet@production] metricsinfra: Add HAProxy for distributing http traffic
Change 704522 had a related patch set uploaded (by Majavah; author: Majavah):
[operations/puppet@production] metricsinfra: Remove alertmanager apache proxy
Change 703708 merged by Bstorm:
[operations/puppet@production] metricsinfra: Add HAProxy for distributing http traffic
Change 705014 had a related patch set uploaded (by Majavah; author: Majavah):
[operations/puppet@production] metricsinfra: Add separate alertmanager support
Change 704522 merged by Bstorm:
[operations/puppet@production] metricsinfra: Remove alertmanager apache proxy
Change 705014 merged by Bstorm:
[operations/puppet@production] metricsinfra: Add separate alertmanager support
Change 705632 had a related patch set uploaded (by Majavah; author: Majavah):
[operations/puppet@production] metricsinfra: remove alertmanager from prometheus role
Change 705632 merged by Bstorm:
[operations/puppet@production] metricsinfra: remove alertmanager from prometheus role