[go: up one dir, main page]

Page MenuHomePhabricator

Export useful metrics from haproxy logs for Thumbor
Closed, ResolvedPublic

Description

The prometheus-haproxy-exporter exports avg response time per 1024 successful connections, which is not very helpful. We need to export more meaningful metrics using haproxy's logs.

Related Objects

View Standalone Graph
This task is connected to more than 200 other tasks. Only direct parents and subtasks are shown here. Use View Standalone Graph to show more of the graph.

Event Timeline

Gilles triaged this task as Medium priority.

Change 502967 had a related patch set uploaded (by Gilles; owner: Gilles):
[mediawiki/vagrant@master] Expose haproxy total request time via mtail

https://gerrit.wikimedia.org/r/502967

Change 502967 merged by jenkins-bot:
[mediawiki/vagrant@master] Expose haproxy total request time via mtail

https://gerrit.wikimedia.org/r/502967

Change 502972 had a related patch set uploaded (by Gilles; owner: Gilles):
[operations/puppet@production] Expose haproxy total request time via mtail

https://gerrit.wikimedia.org/r/502972

Change 502972 merged by Effie Mouzeli:
[operations/puppet@production] haproxy: improve metrics (via mtail) and logging

https://gerrit.wikimedia.org/r/502972

Change 504284 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] thumbor: enable haproxy mtail metrics

https://gerrit.wikimedia.org/r/504284

Change 504323 had a related patch set uploaded (by Gilles; owner: Gilles):
[operations/puppet@production] Add tests for haproxy mtail program

https://gerrit.wikimedia.org/r/504323

Change 504284 merged by Effie Mouzeli:
[operations/puppet@production] thumbor: enable haproxy mtail metrics

https://gerrit.wikimedia.org/r/504284

Change 504323 merged by Effie Mouzeli:
[operations/puppet@production] Add tests for haproxy mtail program

https://gerrit.wikimedia.org/r/504323

Change 504335 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] thumbor: Fix mtail group and log path

https://gerrit.wikimedia.org/r/504335

Change 504335 merged by Effie Mouzeli:
[operations/puppet@production] thumbor: Fix mtail group and log path

https://gerrit.wikimedia.org/r/504335

I'm not seeing the metrics show up in the "eqiad/prometheus" ops datasource in Grafana. I'm not sure how prometheus is supposed to be configured to collect the data from the thumbor hosts, though.

Change 504924 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] prometheus: Fix haproxy mtail stats for thumbor

https://gerrit.wikimedia.org/r/504924

Change 504924 merged by Effie Mouzeli:
[operations/puppet@production] prometheus: Fix haproxy mtail stats for thumbor

https://gerrit.wikimedia.org/r/504924

Change 504978 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] thumbor: Inlcude mtail in ferm configuration

https://gerrit.wikimedia.org/r/504978

Change 504978 merged by Effie Mouzeli:
[operations/puppet@production] thumbor: Inlcude mtail in ferm configuration

https://gerrit.wikimedia.org/r/504978

@Gilles This is fixed now, I will though revert back to nginx for the weekend. We do have data we can work with from today.

I've added the relevant panels to your dashboard, mirroring the data we were tracking for nginx:

Capture d'écran 2019-04-22 17.11.02.png (1×3 px, 472 KB)

@Gilles thank you! I added the relevant codfw ones