Private account of @Lucas_Werkmeister_WMDE (he/him, Berlin timezone). Anything I do here is on volunteer time, even if it looks work-related :)
User Details
- User Since
- Jun 5 2016, 4:36 PM (442 w, 2 d)
- Availability
- Available
- IRC Nick
- lucaswerkmeister
- LDAP User
- Lucas Werkmeister
- MediaWiki User
- Lucas Werkmeister [ Global Accounts ]
Mon, Nov 25
Renders fine (with an error message) at https://test.wikidata.org/w/index.php?curid=959:
(Switching hats, sorry.)
Sun, Nov 24
(And as expected, it’s still trying to remove sh-latn.json in the next pull request.)
Sat, Nov 23
Thu, Nov 21
That’s great, thank you!
Currently, the number of links to Template:SDC statement has value (as counted by search – this lags somewhat behind the “real” number, as it depends on a further job, but it should be a decent approximation)
Per IRC discussion, marking as High priority. @AntiCompositeNumber reports that this results in category changes being slow to propagate (Category:Johann Baptist Hops not showing up in Category:Hops (surname) yet).
I request that the secrets not be removed in this case — the only sensitive secret is the Spur.us API key which is already expired per T380193
Mon, Nov 18
The export kept deleting the file through several re-exports; I’ve now manually restored it to unblock all the other translation updates for now.
Thanks for deploying!
(Placing the task back up for grabs because the rest is deployment and I have no idea how to do that.)
Sat, Nov 9
kubectl describe pods showed
Last State: Terminated Reason: OOMKilled
so I tried bumping the memory limit. (Bit annoying that the OOMKilled wasn’t shown in kubectl logs nor kubectl get events.) So far, the pod looks more stable:
tools.wikiloves@tools-bastion-13:~$ kubectl get pods NAME READY STATUS RESTARTS AGE wikiloves-575bb94984-cp7sp 1/1 Running 1 (2m44s ago) 2m46s
https://wikiloves.toolforge.org/ just shows me a 403, I don’t know if that’s correct or not.
Fri, Nov 8
Reopening, as I’d respectfully ask you to reconsider installing mysqldump on the bastion directly. toolforge jobs does not support streaming the output, so to actually get an offsite backup I either have to put together an awful kubectl run command myself (I’m at least 15 minutes in and nowhere near anything usable yet – and even if I end up with something usable, I would be relying on a whole bunch of Toolforge internals, including how envvars map to k8s secrets) or use NFS (and we all love NFS). With mysqldump on the bastion, on the other hand, it’s very straightforward to stream the dump straight into another machine.
If you want to reimplement the tool (my very naive guess, without ever having seen the tool in action, is that it seems like a manageable task in principle – the existing source code I can see is some 200 lines of Python, 50 lines of CSS, and 120 lines of XML), I think we could still add you to the tool (after removing the old code), so that the rewrite can be reached under the same tool name. I can’t say whether this makes sense or not for this tool (whether a lot of users will want to continue using the same URLs, or whether it would be more confusing to users if a new implementation shows up there). Of course, you can also do a rewrite in a new tool.
Wed, Nov 6
In the meantime I’m holding off on merging that PR (unless you say I should merge it anyway).
Tue, Nov 5
Thanks! Do you know by any chance why that PR removes the sh-latn translations? (They still seem to exist on-wiki.)
Mon, Nov 4
The tool is currently offline, having been disabled as part of the grid engine deprecation about a year ago, so currently no external resources are being loaded there anymore. If @MichaelSchoenitzer plans to bring the tool back, it would be great to fix this; otherwise this task could also be closed, I think.
More to the point, plausible.io is an analytics service. While I commend the Humaniki team on not using Google Analytics, I’m still not sure we want this in Wikimedia Cloud VPS (though it’s hard to say for sure, because the Cloud Services Cross Site Policy, as linked from the Cloud Services Terms of use, is unfortunately still stuck in the draft stage). That is to say, I suspect the resolution for this task should not be “load the tracking JS from a wmcloud domain” but rather “do not load the tracking JS at all”.
Tool was apparently deleted / archived almost two years ago, nothing more to be done here:
lucaswerkmeister@tools-nfs-2:~$ ls -lh /srv/tools/archivedtools/activity.tgz -rw-r--r-- 1 root root 96K Jan 29 2023 /srv/tools/archivedtools/activity.tgz
In T378976: Lexeme-forms on Toolforge returns error, a webservice was dead for a few hours until I manually stopped and started it; might be related to this upgrade? (The startup probe continuously got connect: connection refused, apparently.)
Looks like that worked 🤷
The pod didn’t log anything else before it died, apparently:
No idea what’s wrong:
Sat, Nov 2
Wed, Oct 30
Oct 25 2024
It’s happening again. https://gitlab.wikimedia.org/toolforge-repos/wd-image-positions/-/tree/twn?ref_type=heads currently has green CI, yet no merge request was created.
Oct 20 2024
Oct 15 2024
Oct 11 2024
Yes, looks like it’s working again – thank you!
Oct 10 2024
Yay, thank you!
That turned out to be a different problem (the “bad content format” pages at the bottom of https://quickcategories.toolforge.org/batch/7877/ used to make the whole background runner crash).
Oct 9 2024
Oct 8 2024
Alright, thanks – I’ve updated the translations on TWN now, so we should hopefully see some changes on Thursday.
Oct 7 2024
That’s not how it worked in the past – https://gitlab.wikimedia.org/toolforge-repos/wd-image-positions/-/merge_requests/4 was initially created with CI errors according to the comments. And then I got a notification about the merge request, saw the CI errors, and fixed them. Without the merge request, that’s not going to happen, so if this is an intentional change, I’d like to revert it: a workflow where l10n-bot doesn’t create merge requests with failing CI doesn’t work for me at all.
The warning still appears, by the way – I guess someone™ still needs to deploy the new version? (I might technically have the required permissions but wouldn’t know how to do it…)
Oct 5 2024
This is still happening, by the way, which means the Wikidata Image Positions translations are now over a month outdated :/
Sep 28 2024
Oh wait, I totally forgot I signed an NDA for T314527 as well. (And indeed L3 tells me I’ve signed it.) That probably means this task is done?
Sep 25 2024
Works for me now, thanks \o/
Sep 24 2024
Still happening. I got redirected like this:
- https://quarry.wmcloud.org/login?next=/
- https://meta.wikimedia.org/w/index.php?title=Special%3AOAuth%2Fauthenticate&oauth_token=X&oauth_consumer_key=Y
- https://quarry.wmcloud.org/oauth-callback?oauth_verifier=Z&oauth_token=X
- http://quarry.wmcloud.org/
Though the error Firefox shows me is different:
Sep 22 2024
For the record, I talked with @Dogu on Telegram about the specifics of the builder pattern, and some of the results we arrived at were:
Looks like it already restarted itself once:
Sep 20 2024
Let’s close this for now, feel free to reopen if it comes back.
Sep 19 2024
Do we have to be bound by Bash’s syntax limitations? . and - work just fine between env and Flask / Python, even if there is a Bash in between:
Sep 18 2024
Alright, the background runner should now have a health check and hopefully restart itself if it gets stuck.
Sep 17 2024
As mentioned in T370474#10098474, @Lucas_Werkmeister_WMDE has signed some kind of NDA already. I don’t know if that also covers me (same legal person, different social role) – I’m happy to sign anything if required, and will leave that decision to the Legal team.
Sep 15 2024
I looked a bit through the Git history but didn’t find a lot more explanation for the pattern.
Sep 14 2024
Something’s definitely running again:
Sep 11 2024
If this sounds reasonable to the OAuth maintainers, I can try to implement it, but I would love to hear first whether people think it stands a chance of being merged or not ^^
Sep 9 2024
Possible improvement to the error page: https://gerrit.wikimedia.org/r/c/operations/puppet/+/1071715
Well, apparently there were some random Wikimedia errors:
Sep 5 2024
That read() seems to be happily blocking forever, by the way:
If we can’t figure out the underlying issue, I suppose I could:
I’m leaving the background runner in its current state for a bit in case someone else wants to take a look, but at some point I’ll restart it again to get the edits unstuck.
Sep 4 2024
Note: I don’t know what the pipeline does and have no idea if pointing the script at the fork’s pipeline run would be enough. (E.g. maybe the original repo has some secrets that the pipeline run needs?)
Sep 3 2024
I guess so :P since I already started looking into this after noticing the output (I’m guessing it’s due to the Kubernetes upgrade yesterday?)
(Kubernetes 1.31 apparently adds a --wait option again but who knows if it does the same thing.)
FWIW, according to the commit deprecating this and other flags, --wait had no effect (since at least 1.26, which is the version we’re running), so it should be safe to just remove. (I haven’t been able to find any old documentation that would tell us whether it used to do something or not, but if the --wait was ever needed for something then it probably already broke whenever an older k8s version made the flag have no effect.)
Sep 2 2024
Feel free to reopen if you want to make the experimental highlighter plugin an optional dependency after all, but otherwise I think this is resolved.
Aug 30 2024
Anyway, after installing the plugin, wbsearchentities works \o/