Let's lookout for errors and errorish status codes in the logs for the maps servers.
Document what we find and see if it fits any existing task.
Outcomes
New logstash dashboard for debugging Kartotherian: https://logstash.wikimedia.org/goto/9e502017ee8d1c81b8ea94ebf5573ecd
Obvious bugs and cleanup work have been split into subtasks.
Out of roughly 1M log messages across all hosts in 1 week, these are the main categories and frequencies of noteworthy errors,
Message | Task | Impact | Share |
groupIds not available | T308223 | Missing all annotations | |
Marker symbol '%s' is invalid | T145475 (and some are just invalid) | Missing map | 7% |
Failed to parse color: "#function fill() { [native code] }" | T308560 | Missing map | 3% |
Bad geojson - unknown type ExternalData | T308223 | Missing map | 1.5% |
SPARQL query result contains non-unique ID | T308223 | No effect | 0.4% |
ETIMEDOUT | Missing map | 0.4% | |
XML document not well formed | 0.2% | ||
"ids" or "query" parameter must be given | Missing map | 0.1% | |
image created from bytes must be 2048 pixels or fewer on each side | 0.07% | ||
ESOCKETTIMEDOUT | 0.05% | ||
A few errors also show that SPARQL queries and GeoJSON are sometimes invalid.
Looking at the "groupIds not available" errors specifically, taking a sample of 1 week and roughly 900k missing groupIds messages,
Category | Task | Share |
Wikidata thumbnails missing title | T309695 | 48% |
Transliterated text needs language parameter | T246314 | 27% |
Other errors where revid is not present | T309773 | 17% |
Errors although revid is present | T309702 | 7.3% |
Wikivoyage template malfunctioning? Errors like "groupIds not available: Maske,Track,Aktivität,Anderes,Anreise,Ausgehen,Aussicht,Besiedelt,Fehler,Gebiet,Kaufen,Küche,Sehenswert,Unterkunft,aquamarinblau,cosmos,gold,hellgrün,orange,pflaumenblau,rot,silber,violett" | 1.3% | |
There should be no more requests without revid, and those with revid should succeed almost always, so these are surprising statistics. The wikidata subtask is an easy win.