[go: up one dir, main page]

Page MenuHomePhabricator

QuarryComponent
ActivePublic

Members (7)

Watchers (7)

Details

Description

Service to let users execute SQL queries against Wikipedia databases on Wikimedia Cloud Services (Homepage).

Analytics projects aim to give the wiki movement a data services platform: Providing insight into community activity (Data Engineering team).

Recent Activity

Wed, Nov 13

rook added a comment to T289531: Switch to using prefix puppet instead of direct-on-instance puppet.

The move to k8s appears to have made this ticket mostly unactionable.

Wed, Nov 13, 3:11 PM · cloud-services-team, Quarry
rook closed T288982: Productionize quarry a bit as Resolved.
Wed, Nov 13, 3:07 PM · cloud-services-team, Quarry, Epic
rook closed T289531: Switch to using prefix puppet instead of direct-on-instance puppet, a subtask of T288982: Productionize quarry a bit, as Declined.
Wed, Nov 13, 3:07 PM · cloud-services-team, Quarry, Epic
rook closed T289531: Switch to using prefix puppet instead of direct-on-instance puppet as Declined.
Wed, Nov 13, 3:07 PM · cloud-services-team, Quarry

Thu, Nov 7

rook closed T378978: update build-and-push as Resolved.
Thu, Nov 7, 3:11 PM · Quarry
rook added a comment to T378978: update build-and-push.

https://github.com/toolforge/quarry/pull/71

Thu, Nov 7, 3:10 PM · Quarry
rook closed T373528: unused dns proxies? as Resolved.
Thu, Nov 7, 2:56 PM · Quarry
rook closed T373134: PR usually not posting to phabricator as Declined.
Thu, Nov 7, 1:04 PM · Quarry, PAWS

Mon, Nov 4

rook closed T348873: update github action, a subtask of T378978: update build-and-push, as Resolved.
Mon, Nov 4, 2:27 PM · Quarry
rook closed T348873: update github action as Resolved.
Mon, Nov 4, 2:27 PM · PAWS, Quarry
rook added a parent task for T348873: update github action: T378978: update build-and-push.
Mon, Nov 4, 2:26 PM · PAWS, Quarry
rook added a subtask for T378978: update build-and-push: T348873: update github action.
Mon, Nov 4, 2:26 PM · Quarry
rook added a parent task for T348873: update github action: T378977: Update build-and-push.
Mon, Nov 4, 2:26 PM · PAWS, Quarry
rook created T378978: update build-and-push.
Mon, Nov 4, 2:26 PM · Quarry

Wed, Oct 30

joanna_borun closed T169452: Replace Quarry with an installation of Superset as Declined.
Wed, Oct 30, 3:05 PM · cloud-services-team (FY2024/2025-Q1-Q2), superset.wmcloud.org, Quarry

Mon, Oct 28

rook added a comment to T360041: Set query result retention time.

I believe PII such as email addresses, password hashes, and IPs is scrubbed by the replicas? Quarry isn't a system I think of as having PII in it. All the data it queries is public, I think.

Mon, Oct 28, 5:46 PM · Quarry
Novem_Linguae added a comment to T360041: Set query result retention time.

Though none of it gets at the central issue of PII

Mon, Oct 28, 5:36 PM · Quarry
rook closed T360041: Set query result retention time as Resolved.
Mon, Oct 28, 4:40 PM · Quarry
rook added a comment to T360041: Set query result retention time.

I appreciate the commentary. Though none of it gets at the central issue of PII, and the reality that quarry is not designed to keep data in perpetuity. Data persistence is an expensive process and not being applied to quarry as such we're one system crash from the data being gone regardless. There are the current download options to export query results. If additional export options are desired, patches are welcome.

Mon, Oct 28, 3:08 PM · Quarry

Wed, Oct 23

bking added a comment to T360596: Figure out a plan to move forward with regarding Redis License changes.

Forgive the drive-by comment, but at the 6-month anniversary of this ticket, it might be worth checking how our upstream production applications (such as gitlab, netbox etc) are handling this change, if it all. For example, I noticed that netbox-docker is now using valkey .

Wed, Oct 23, 1:34 PM · cloud-services-team, GitLab (Infrastructure), Patch-For-Review, User-aborrero, serviceops, MediaWiki-Platform-Team (Radar), collaboration-services, Release-Engineering-Team (Radar), Quarry, Toolforge, Software-Licensing, Infrastructure-Foundations, netbox, Core Platform Team Initiatives (API Gateway), ChangeProp, MediaWiki-File-management, SRE
jijiki moved T360596: Figure out a plan to move forward with regarding Redis License changes from Incoming 🐫 to 💾 Datastores on the serviceops board.
Wed, Oct 23, 12:11 PM · cloud-services-team, GitLab (Infrastructure), Patch-For-Review, User-aborrero, serviceops, MediaWiki-Platform-Team (Radar), collaboration-services, Release-Engineering-Team (Radar), Quarry, Toolforge, Software-Licensing, Infrastructure-Foundations, netbox, Core Platform Team Initiatives (API Gateway), ChangeProp, MediaWiki-File-management, SRE
jijiki added a parent task for T360596: Figure out a plan to move forward with regarding Redis License changes: T325243: Evaluate out redis_misc cluster.
Wed, Oct 23, 11:54 AM · cloud-services-team, GitLab (Infrastructure), Patch-For-Review, User-aborrero, serviceops, MediaWiki-Platform-Team (Radar), collaboration-services, Release-Engineering-Team (Radar), Quarry, Toolforge, Software-Licensing, Infrastructure-Foundations, netbox, Core Platform Team Initiatives (API Gateway), ChangeProp, MediaWiki-File-management, SRE

Mon, Oct 21

rook closed T377010: [bug] Quarry queries are stopped as Declined.
Mon, Oct 21, 8:18 PM · Quarry
rook added a comment to T377010: [bug] Quarry queries are stopped.

It is possible that you were encountering the three hour time limit for analytics searches. If there was some lag it could have increased your query time from what looks like an hour to later. I'm unsure of how additional data could be provided, though it may be possible. Likely though it is easier to check https://replag.toolforge.org/ for lag which if there was much would suggest long running queries may not complete.

Mon, Oct 21, 8:17 PM · Quarry

Fri, Oct 18

GTrang closed T375988: Quarry shows error: This web service cannot be reached as Resolved.

And now Quarry is working again.

Fri, Oct 18, 2:08 PM · Quarry
rook added a comment to T375988: Quarry shows error: This web service cannot be reached.

Quarry is working again. Though I didn't have time to investigate what is happening so this may happen again. Opening T375997 to investigate the underlying issue.

Indeed, the same error is showing up at Quarry again right now.

Fri, Oct 18, 10:31 AM · Quarry

Oct 17 2024

GTrang reopened T375988: Quarry shows error: This web service cannot be reached as "Open".

Quarry is working again. Though I didn't have time to investigate what is happening so this may happen again. Opening T375997 to investigate the underlying issue.

Oct 17 2024, 1:43 AM · Quarry
GTrang added a parent task for T375997: worker nodes issue with garbage collection: T375988: Quarry shows error: This web service cannot be reached.
Oct 17 2024, 1:41 AM · Quarry
GTrang added a subtask for T375988: Quarry shows error: This web service cannot be reached: T375997: worker nodes issue with garbage collection.
Oct 17 2024, 1:41 AM · Quarry

Oct 16 2024

Prototyperspective added a comment to T377010: [bug] Quarry queries are stopped.

Seems like much less or no issues now. In any case, please add some info when queries are stopped.

Oct 16 2024, 3:43 PM · Quarry

Oct 12 2024

Prototyperspective added a comment to T377010: [bug] Quarry queries are stopped.

Please prevent queries from getting stopped. One went through but the other still gets stopped all the time.

Oct 12 2024, 6:35 PM · Quarry

Oct 11 2024

Prototyperspective added a comment to T377010: [bug] Quarry queries are stopped.

@rook Yes, I did not press the stop button for any of the queries that were stopped and it only displays the above two lines and not any further info like some error code. Other example: https://quarry.wmcloud.org/query/86864

Oct 11 2024, 6:03 PM · Quarry
rook added a comment to T377010: [bug] Quarry queries are stopped.

It's been awhile since I've looked at that code. When I worked on it it was to have the stopped status appear when someone manually presses the "stop" button, I thought I added it just for that, but maybe it existed for something else as well. So, ideally, the status doesn't mean to indicate that the query failed or otherwise couldn't run, just that it was stopped while it was running by the user. Are you seeing your queries end with this status without pressing the stop button?

Oct 11 2024, 6:01 PM · Quarry
Prototyperspective created T377010: [bug] Quarry queries are stopped.
Oct 11 2024, 3:34 PM · Quarry

Oct 10 2024

Base added a comment to T360041: Set query result retention time.

If you do do this, it would be good to only remove the older runs results, but leave the most recent run result for each query, or as a less desirable alternative to keep those for only published queries (but I am one of the people who rarely publishes queries even when they are finished). My queries often refer to some article contest, or some on wiki investigation, or just a bit of curiosity that does make sense to be kept for many years and is sometimes linked with the assumption of some stability from on-wiki. Re-running queries every 90 days would be an unnecessary burden, might lead to corrupted results, such as a query that was used to determine a contest winner will no longer should that person as winner because of some things happening in between, and because of schema changes it might also be a maintenance burden (I still have a lot of query results active from the pre-multi-DB select cancellation times, or from the pre actor migration times) that either require a tweak or are even impossible to rewrite, but the results of their older runs are somewhat useful to this day.

Oct 10 2024, 12:47 AM · Quarry

Sep 30 2024

Aklapper renamed T375988: Quarry shows error: This web service cannot be reached from Quarry not working today to Quarry shows error: This web service cannot be reached.
Sep 30 2024, 9:00 AM · Quarry
rook closed T375988: Quarry shows error: This web service cannot be reached as Resolved.
Sep 30 2024, 7:32 AM · Quarry
rook added a comment to T375988: Quarry shows error: This web service cannot be reached.

Quarry is working again. Though I didn't have time to investigate what is happening so this may happen again. Opening T375997 to investigate the underlying issue.

Sep 30 2024, 7:30 AM · Quarry
rook created T375997: worker nodes issue with garbage collection.
Sep 30 2024, 7:29 AM · Quarry
rook added a comment to T375988: Quarry shows error: This web service cannot be reached.

Looks like k8s is having trouble with garbage collection

Warning  FreeDiskSpaceFailed  118s (x159 over 13h)  kubelet  Failed to garbage collect required amount of images. Attempted to free 4281512755 bytes, but only found 0 bytes eligible to free.
Sep 30 2024, 7:04 AM · Quarry
Liz edited projects for T375988: Quarry shows error: This web service cannot be reached, added: Quarry; removed Cloud-Services.
Sep 30 2024, 3:08 AM · Quarry

Sep 28 2024

Maintenance_bot edited projects for T151158: Support queries against Quarry's own database and ToolsDB, added: User-notice-archive; removed User-notice.
Sep 28 2024, 7:30 PM · User-notice-archive, cloud-services-team (FY2024/2025-Q1-Q2), Quarry

Sep 25 2024

LucasWerkmeister added a comment to T361471: Quarry login fails due to redirect to plaintext HTTP URL.

Works for me now, thanks \o/

Sep 25 2024, 1:59 PM · Quarry
taavi closed T361471: Quarry login fails due to redirect to plaintext HTTP URL as Resolved.
Sep 25 2024, 1:59 PM · Quarry
github-toolforge-bot added a comment to T361471: Quarry login fails due to redirect to plaintext HTTP URL.

supertassu closed https://github.com/toolforge/quarry/pull/70

Sep 25 2024, 1:57 PM · Quarry

Sep 24 2024

github-toolforge-bot added a comment to T361471: Quarry login fails due to redirect to plaintext HTTP URL.

supertassu opened https://github.com/toolforge/quarry/pull/70

Sep 24 2024, 5:28 PM · Quarry
LucasWerkmeister added a comment to T361471: Quarry login fails due to redirect to plaintext HTTP URL.

Still happening. I got redirected like this:

Though the error Firefox shows me is different:

Secure Site Not Available – You’ve enabled HTTPS-Only Mode for enhanced security, and a HTTPS version of quarry.wmcloud.org is not available.

Sep 24 2024, 5:18 PM · Quarry

Sep 20 2024

1234qwer1234qwer4 created T375292: Emtpy Quarry query names are not linked/should not be allowed.
Sep 20 2024, 5:50 PM · Quarry

Sep 18 2024

Quiddity moved T151158: Support queries against Quarry's own database and ToolsDB from In current Tech/News draft to Already announced/Archive on the User-notice board.
Sep 18 2024, 6:37 PM · User-notice-archive, cloud-services-team (FY2024/2025-Q1-Q2), Quarry

Sep 14 2024

Zache added a comment to T360041: Set query result retention time.

I have experience with three different types of queries where keeping the results has been important.

Sep 14 2024, 3:16 AM · Quarry