[go: up one dir, main page]

Page MenuHomePhabricator

Data-Platform-SRE (2024.07.08 - 2024.07.28)Milestone
ArchivedPublic

Members (6)

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

Milestone for Data Platform SRE work

Recent Activity

Wed, Nov 20

gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1093333 merged by Btullis:

[operations/puppet@production] Upgrade the remainder of the cephosd cluster to nftables

https://gerrit.wikimedia.org/r/1093333

Wed, Nov 20, 1:30 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1093333 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Upgrade the remainder of the cephosd cluster to nftables

https://gerrit.wikimedia.org/r/1093333

Wed, Nov 20, 1:24 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1089716 merged by Btullis:

[operations/puppet@production] Canary cephosd1001 to use nftables instead of iptables

https://gerrit.wikimedia.org/r/1089716

Wed, Nov 20, 9:51 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review

Mon, Nov 11

gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1089716 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Canary cephosd1001 to use nftables instead of iptables

https://gerrit.wikimedia.org/r/1089716

Mon, Nov 11, 12:05 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review

Oct 21 2024

brouberol added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

As an aside, we have enabled both the ceph-csi-rdb and ceph-csi-cephfs (https://phabricator.wikimedia.org/T376401) storage interfaces in dse-k8s-eqiad.

Oct 21 2024, 5:31 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1079542 abandoned by Brouberol:

[operations/puppet@production] envoy: Fix firewall_srange not being taken into account

https://gerrit.wikimedia.org/r/1079542

Oct 21 2024, 1:48 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review

Oct 11 2024

gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1079545 merged by Btullis:

[operations/puppet@production] Revert cephosd servers from nftables to ferm

https://gerrit.wikimedia.org/r/1079545

Oct 11 2024, 3:35 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1079545 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Revert cephosd servers from nftables to ferm

https://gerrit.wikimedia.org/r/1079545

Oct 11 2024, 3:33 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1079542 had a related patch set uploaded (by Brouberol; author: Brouberol):

[operations/puppet@production] ceph.server: invert firewall::src_sets and firewall::srange hiera values

https://gerrit.wikimedia.org/r/1079542

Oct 11 2024, 3:04 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review
gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1050331 merged by Btullis:

[operations/puppet@production] cephosd: Switch to use nftables instead of iptables

https://gerrit.wikimedia.org/r/1050331

Oct 11 2024, 9:35 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review

Oct 10 2024

gerritbot added a comment to T327259: Enable the Container Storage Interface (CSI) and the Ceph CSI plugin on dse-k8s cluster.

Change #1050330 merged by Btullis:

[operations/puppet@production] Switch cephosd1001 to use the nftables based firewall

https://gerrit.wikimedia.org/r/1050330

Oct 10 2024, 4:45 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Patch-For-Review

Oct 9 2024

Maintenance_bot removed a project from T364367: Create dedicated UIs for wdqs graph split endpoints: Patch-For-Review.
Oct 9 2024, 12:30 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1078664 merged by jenkins-bot:

[operations/deployment-charts@master] wikidata-query-gui: remove experimental endpoints

https://gerrit.wikimedia.org/r/1078664

Oct 9 2024, 11:55 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata

Oct 8 2024

gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1078664 had a related patch set uploaded (by Jelto; author: Jelto):

[operations/deployment-charts@master] wikidata-query-gui: remove experimental endpoints

https://gerrit.wikimedia.org/r/1078664

Oct 8 2024, 1:09 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata

Sep 25 2024

Michael closed T368750: Newcomer Homepage: Suggested Edits (mobile preview) empty state when there are no suggested edits, a subtask of T368405: Special:Homepage is rendered much slower (<1 sec to 2+ sec), as Resolved.
Sep 25 2024, 9:32 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Growth-Team (FY2024-25 Q1 Sprint 1), MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), Data Products, User-Michael, Data-Platform, Performance Issue, GrowthExperiments-Homepage

Sep 13 2024

Maintenance_bot removed a project from T371210: Create the airflow-analytics-test domain and certs: Patch-For-Review.
Sep 13 2024, 7:30 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28)
gerritbot added a comment to T371210: Create the airflow-analytics-test domain and certs.

Change #1057830 abandoned by Stevemunene:

[operations/puppet@production] trafficserver: add airflow-analytics-test discovery record

Reason:

Abandoned in favour of airflow-test-k8s done here https://gerrit.wikimedia.org/r/c/operations/puppet/+/1063848

https://gerrit.wikimedia.org/r/1057830

Sep 13 2024, 7:03 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28)

Sep 6 2024

Ahoelzl set the point value for T367848: Publishing conda environments with WMF Data Workflow Utils is broken to 1.
Sep 6 2024, 8:53 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Data-Platform-SRE (2024.07.08 - 2024.07.28), Product-Analytics

Aug 26 2024

RKemper updated the task description for T364367: Create dedicated UIs for wdqs graph split endpoints.
Aug 26 2024, 7:52 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1066812 merged by Ryan Kemper:

[operations/puppet@production] wdqs graph split: routing for wdqs backends

https://gerrit.wikimedia.org/r/1066812

Aug 26 2024, 7:49 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a project to T364367: Create dedicated UIs for wdqs graph split endpoints: Patch-For-Review.
Aug 26 2024, 5:19 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1066812 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/puppet@production] wdqs graph split: routing for wdqs backends

https://gerrit.wikimedia.org/r/1066812

Aug 26 2024, 5:19 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
Ahoelzl moved T367848: Publishing conda environments with WMF Data Workflow Utils is broken from Incoming (new tickets) to Q1 2024 July 1st - September 30th on the Data-Engineering board.
Aug 26 2024, 4:55 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Data-Platform-SRE (2024.07.08 - 2024.07.28), Product-Analytics

Aug 16 2024

brouberol closed T372620: Report on the initial growthbook installation PoC, a subtask of T365839: Deploy an instance of GrowthBook to Kubernetes, as Resolved.
Aug 16 2024, 10:09 AM · Patch-For-Review, Data-Platform-SRE (2024.07.08 - 2024.07.28), Data Products

Aug 13 2024

Stevemunene closed T356230: Conda-Analytics packages incompatible with latest versions of Pandas and Numpy as Resolved.

Marking this as resolved in favour of the created tasks on upgrading numpy, pyarrow, pandas and pyspark.

Aug 13 2024, 2:52 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Movement-Insights

Aug 3 2024

Maintenance_bot removed a project from T364367: Create dedicated UIs for wdqs graph split endpoints: Patch-For-Review.
Aug 3 2024, 1:30 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1053765 merged by Ryan Kemper:

[operations/puppet@production] wdqs graph split: routing for wdqs backends

https://gerrit.wikimedia.org/r/1053765

Aug 3 2024, 12:45 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata

Jul 31 2024

bking updated the task description for T371061: Update CirrusSearch dashboards to use new metrics/refresh dashboards.
Jul 31 2024, 2:24 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29), Discovery-Search, CirrusSearch
matmarex archived Data-Platform-SRE (2024.07.08 - 2024.07.28).
Jul 31 2024, 1:27 PM
bking updated Other Assignee for T368760: Configure airflow webserver under Kubernetes to use OIDC authentication, added: bking.
Jul 31 2024, 1:11 PM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
gerritbot added a comment to T364368: Create separate pybal pools for wdqs graph split (main vs scholarly).

Change #1046120 abandoned by Stevemunene:

[operations/puppet@production] [WIP] wdqs: create wdqs split pybal pools

Reason:

Duplicate of https://gerrit.wikimedia.org/r/c/operations/puppet/+/1054520

https://gerrit.wikimedia.org/r/1046120

Jul 31 2024, 10:06 AM · Data-Platform-SRE (2024.08.17 - 2024.09.06), Patch-For-Review, Discovery-Search, Wikidata-Query-Service, Wikidata

Jul 30 2024

Maintenance_bot removed a project from T363001: Create a helm chart for airflow that is appropriate to our needs: Patch-For-Review.
Jul 30 2024, 9:32 PM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
gerritbot added a comment to T363001: Create a helm chart for airflow that is appropriate to our needs.

Change #1041759 merged by Bking:

[operations/deployment-charts@master] dse-k8s-services: Add net-new chart for Airflow

https://gerrit.wikimedia.org/r/1041759

Jul 30 2024, 9:24 PM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
bking added a comment to T368033: Design a suitable DAG deployment method.

Thanks @hashar and @Ottomata for following up. Based on this exchange as well as the comments in the Gitlab HDFS synchronizer design doc , I think DPE SRE and Releng are on the same page as far as the potential security issues posed by this CD implementation.

Jul 30 2024, 9:17 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29), Data-Engineering
bking added a parent task for T371061: Update CirrusSearch dashboards to use new metrics/refresh dashboards: T350597: Audit and prioritize metrics for conversion to statslib that are used for graphite-based alerting.
Jul 30 2024, 8:26 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29), Discovery-Search, CirrusSearch
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1057889 merged by Ryan Kemper:

[wikidata/query/gui-deploy@production] wdqs-main: set title to "WDQS main graph"

https://gerrit.wikimedia.org/r/1057889

Jul 30 2024, 7:06 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
gerritbot added a comment to T364367: Create dedicated UIs for wdqs graph split endpoints.

Change #1057889 had a related patch set uploaded (by Ryan Kemper; author: DCausse):

[wikidata/query/gui-deploy@production] wdqs-main: set title to "WDQS main graph"

https://gerrit.wikimedia.org/r/1057889

Jul 30 2024, 7:05 PM · Data-Platform-SRE (2024.07.08 - 2024.07.28), collaboration-services, Discovery-Search, Wikidata-Query-Service, Wikidata
Maintenance_bot removed a project from T368757: Create a git-sync container image to be used with airflow: Patch-For-Review.
Jul 30 2024, 2:31 PM · Data-Platform-SRE (2024.07.29 - 2024.08.16)
Stevemunene moved T365449: Upgrade Airflow to 2.9.3 from In Progress to To Be Deployed on the Data-Platform-SRE (2024.07.08 - 2024.07.28) board.

The airflow v2.9.3 is ready to be deployed for testing on the an-test-client1002

Jul 30 2024, 2:11 PM · Patch-For-Review, Release-Engineering-Team (Radar), Data-Platform-SRE (2024.07.29 - 2024.08.16), collaboration-services, Data Pipelines, Data-Engineering
CodeReviewBot added a comment to T368757: Create a git-sync container image to be used with airflow.

bking merged https://gitlab.wikimedia.org/repos/data-engineering/git-sync/-/merge_requests/11

Jul 30 2024, 1:58 PM · Data-Platform-SRE (2024.07.29 - 2024.08.16)
Stashbot added a comment to T368518: decommission clouddb1021.

Mentioned in SAL (#wikimedia-operations) [2024-07-30T13:58:18Z] <marostegui> Remove clouddb1021 from zarcillo database T368518

Jul 30 2024, 1:58 PM · SRE, DC-Ops, ops-eqiad, Data-Platform-SRE (2024.07.29 - 2024.08.16), decommission-hardware
Marostegui added a comment to T368518: decommission clouddb1021.

I am going to remove this host from zarcillo database - even if it is used for reimage tests it will be eventually decommissioned.

Jul 30 2024, 1:58 PM · SRE, DC-Ops, ops-eqiad, Data-Platform-SRE (2024.07.29 - 2024.08.16), decommission-hardware
CodeReviewBot added a project to T368757: Create a git-sync container image to be used with airflow: Patch-For-Review.

bking opened https://gitlab.wikimedia.org/repos/data-engineering/git-sync/-/merge_requests/11

Jul 30 2024, 1:56 PM · Data-Platform-SRE (2024.07.29 - 2024.08.16)
Stevemunene added a comment to T365449: Upgrade Airflow to 2.9.3.

With the tags work around we have been able to release the airflow 2.9.3 upgrade, I don't think we shall need to rework the pipeline as is since we are moving to dse-k8s. Thanks for the help @Jelto and @LSobanski

Jul 30 2024, 1:50 PM · Patch-For-Review, Release-Engineering-Team (Radar), Data-Platform-SRE (2024.07.29 - 2024.08.16), collaboration-services, Data Pipelines, Data-Engineering
Stevemunene moved T371209: Create airflow-test-k8s OIDC configuration from In Progress to Blocked / Waiting on the Data-Platform-SRE (2024.07.08 - 2024.07.28) board.
Jul 30 2024, 11:04 AM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
Stevemunene closed T371210: Create the airflow-analytics-test domain and certs as Resolved.
Jul 30 2024, 10:56 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28)
Stevemunene closed T371210: Create the airflow-analytics-test domain and certs, a subtask of T368760: Configure airflow webserver under Kubernetes to use OIDC authentication, as Resolved.
Jul 30 2024, 10:56 AM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
gerritbot added a comment to T371209: Create airflow-test-k8s OIDC configuration.

Change #1057805 merged by Stevemunene:

[operations/dns@master] dns: provision airflow-analytics-test domain

https://gerrit.wikimedia.org/r/1057805

Jul 30 2024, 10:46 AM · Data-Platform-SRE (2024.08.17 - 2024.09.06)
gerritbot added a comment to T371210: Create the airflow-analytics-test domain and certs.

Change #1057805 merged by Stevemunene:

[operations/dns@master] dns: provision airflow-analytics-test domain

https://gerrit.wikimedia.org/r/1057805

Jul 30 2024, 10:46 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28)

Jul 29 2024

LSobanski moved T365449: Upgrade Airflow to 2.9.3 from Incoming to Consultation on the collaboration-services board.
Jul 29 2024, 3:32 PM · Patch-For-Review, Release-Engineering-Team (Radar), Data-Platform-SRE (2024.07.29 - 2024.08.16), collaboration-services, Data Pipelines, Data-Engineering