[go: up one dir, main page]

Page MenuHomePhabricator

Data Products (Data Products Sprint 17)Milestone
ArchivedPublic

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

Sprint 17 Jul 24, 2024 - Aug 14, 2024

Recent Activity

Tue, Nov 5

apaskulin moved T370335: Update guide to creating an instrument with Metrics Platform from Backlog to Done on the Tech-Docs-Team board.
Tue, Nov 5, 3:54 PM · Data Products (Data Products Sprint 17), Tech-Docs-Team, Metrics Platform

Sep 7 2024

TheDJ added a comment to T342267: Investigate surprising "10% Other" portion of Analytics Browsers report.

I had missed that this had been tackled.
The new numbers make a lot morse sense to me. Thank you for the work @Milimetric !

Sep 7 2024, 2:14 PM · Data Products (Data Products Sprint 17), Analytics-Data-Problem, MediaWiki-Platform-Team (Radar), Data-Engineering, Data-Engineering-Dashiki

Sep 6 2024

VirginiaPoundstone closed T368253: MetricsPlatform: Add performance instrumentation, a subtask of T366234: Deploy the Metrics Platform extension, as Resolved.
Sep 6 2024, 12:07 PM · Metrics Platform, Data Products (Data Products Sprint 17), Wikimedia-extension-review-queue, Wikimedia-Extension-setup
VirginiaPoundstone closed T371115: [SPIKE] Gather technical requirements for MPIC Alpha, a subtask of T371096: [SPRINT 17 GOAL] Implementation Plan for MPIC ALPHA , as Resolved.
Sep 6 2024, 12:07 PM · Data Products (Data Products Sprint 17)

Sep 3 2024

VirginiaPoundstone archived Data Products (Data Products Sprint 17).
Sep 3 2024, 2:36 PM
VirginiaPoundstone closed T371237: MPIC Onboarding for Surbhi as Resolved.
Sep 3 2024, 2:35 PM · Data Products (Data Products Sprint 17)
VirginiaPoundstone closed T342267: Investigate surprising "10% Other" portion of Analytics Browsers report as Resolved.
Sep 3 2024, 2:35 PM · Data Products (Data Products Sprint 17), Analytics-Data-Problem, MediaWiki-Platform-Team (Radar), Data-Engineering, Data-Engineering-Dashiki
VirginiaPoundstone closed T369917: MPIC: Prevent premature saving of instrument when adding/removing contextual attributes as Resolved.
Sep 3 2024, 2:35 PM · Patch-For-Review, Data Products (Data Products Sprint 17), Metrics Platform
VirginiaPoundstone closed T367057: [SPIKE] Document decision to use a single table per base schema as Resolved.
Sep 3 2024, 2:35 PM · Data Products (Data Products Sprint 17), Spike, Documentation, Metrics Platform
VirginiaPoundstone closed T371031: Spike: Deep Dive on Growthbook data pipeline as Resolved.
Sep 3 2024, 2:35 PM · Data Products (Data Products Sprint 17)
VirginiaPoundstone closed T369222: MPIC: Fix dark mode as Resolved.
Sep 3 2024, 2:34 PM · Patch-For-Review, Data Products (Data Products Sprint 17), Metrics Platform
VirginiaPoundstone closed T366234: Deploy the Metrics Platform extension as Resolved.
Sep 3 2024, 2:34 PM · Metrics Platform, Data Products (Data Products Sprint 17), Wikimedia-extension-review-queue, Wikimedia-Extension-setup
VirginiaPoundstone closed T369856: MPIC: Fix dates and timezones issue as Resolved.
Sep 3 2024, 2:34 PM · Patch-For-Review, Data Products (Data Products Sprint 17), Metrics Platform
VirginiaPoundstone closed T369233: MPIC: Populate the Progress column in Catalog view as Resolved.
Sep 3 2024, 2:34 PM · Data Products (Data Products Sprint 17), Metrics Platform
VirginiaPoundstone closed T370335: Update guide to creating an instrument with Metrics Platform as Resolved.
Sep 3 2024, 2:34 PM · Data Products (Data Products Sprint 17), Tech-Docs-Team, Metrics Platform
VirginiaPoundstone closed T372364: Bug: pivot does not handle varied casing, a subtask of T342267: Investigate surprising "10% Other" portion of Analytics Browsers report, as Resolved.
Sep 3 2024, 2:34 PM · Data Products (Data Products Sprint 17), Analytics-Data-Problem, MediaWiki-Platform-Team (Radar), Data-Engineering, Data-Engineering-Dashiki
VirginiaPoundstone closed T371579: MPIC: Update unit test cases as Resolved.
Sep 3 2024, 2:34 PM · Metrics Platform, Data Products (Data Products Sprint 17)
VirginiaPoundstone closed T371121: MPIC: Remove instrument_sample_rates table as Resolved.
Sep 3 2024, 2:34 PM · Metrics Platform, Data Products (Data Products Sprint 17)
VirginiaPoundstone closed T371583: MPIC: Fix empty toast message when an error occurs as Resolved.
Sep 3 2024, 2:34 PM · Metrics Platform, Data Products (Data Products Sprint 17)
VirginiaPoundstone closed T372364: Bug: pivot does not handle varied casing as Resolved.
Sep 3 2024, 2:34 PM · Data Products (Data Products Sprint 17), Data-Engineering
VirginiaPoundstone closed T372047: MPIC: Review instructions to run MPIC locally for all possible scenarios as Resolved.
Sep 3 2024, 2:34 PM · Metrics Platform, Data Products (Data Products Sprint 17)

Aug 21 2024

mmartorana closed T366233: Application Security Review Request : Metrics Platform extension, a subtask of T366234: Deploy the Metrics Platform extension, as Resolved.
Aug 21 2024, 5:11 PM · Metrics Platform, Data Products (Data Products Sprint 17), Wikimedia-extension-review-queue, Wikimedia-Extension-setup

Aug 20 2024

mforns moved T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking from To Deploy to Done on the Data Products (Data Products Sprint 17) board.
Aug 20 2024, 2:06 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
mforns added a comment to T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.

I just deployed to the production cluster.
I will be following the executions of webrequest loading in case there are issues.
Otherwise, this is done :-)

Aug 20 2024, 2:05 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
mforns updated the task description for T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.
Aug 20 2024, 2:05 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform

Aug 19 2024

mforns added a comment to T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.

I deployed the changes to the test cluster.
I will wait until tomorrow, to make sure there are no unexpected issues,
and then will deploy to the production cluster.

Aug 19 2024, 8:47 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
Stashbot added a comment to T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.

Mentioned in SAL (#wikimedia-analytics) [2024-08-19T20:45:03Z] <mforns> deployed airflow-dags to analytics_test instance for T368303

Aug 19 2024, 8:45 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
mforns updated the task description for T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.
Aug 19 2024, 8:28 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
mforns updated the task description for T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking.
Aug 19 2024, 7:45 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
WDoranWMF closed T371096: [SPRINT 17 GOAL] Implementation Plan for MPIC ALPHA as Resolved.
Aug 19 2024, 4:26 PM · Data Products (Data Products Sprint 17)
WDoranWMF closed T369736: [SPRINT 16 GOAL] Draft presentation for Wikimania as Resolved.
Aug 19 2024, 4:26 PM · Data Products (Data Products Sprint 17)
WDoranWMF closed T369739: [SPRINT 16 GOAL] Stand up at least one POC for a 3rd party solution on WMCS as Resolved.
Aug 19 2024, 4:26 PM · Data Products (Data Products Sprint 17)
WDoranWMF moved T371237: MPIC Onboarding for Surbhi from In Process to Done on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:24 PM · Data Products (Data Products Sprint 17)
cjming moved T366234: Deploy the Metrics Platform extension from Sign Off to Done on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:21 PM · Metrics Platform, Data Products (Data Products Sprint 17), Wikimedia-extension-review-queue, Wikimedia-Extension-setup
cjming moved T366234: Deploy the Metrics Platform extension from Code Review / Tech Input to Sign Off on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:21 PM · Metrics Platform, Data Products (Data Products Sprint 17), Wikimedia-extension-review-queue, Wikimedia-Extension-setup
WDoranWMF moved T371115: [SPIKE] Gather technical requirements for MPIC Alpha from Code Review / Tech Input to Sign Off on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:20 PM · Data Products (Data products Sprint 18)
WDoranWMF moved T368303: REQUEST: Add Special:AllEvents to allowlist for campaigns-product pageview tracking from Code Review / Tech Input to To Deploy on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:16 PM · Data Products (Data products Sprint 18), Event-Discovery, Data-Platform
WDoranWMF moved T372364: Bug: pivot does not handle varied casing from Code Review / Tech Input to Done on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:16 PM · Data Products (Data Products Sprint 17), Data-Engineering
WDoranWMF moved T371031: Spike: Deep Dive on Growthbook data pipeline from To Deploy to Done on the Data Products (Data Products Sprint 17) board.
Aug 19 2024, 4:15 PM · Data Products (Data Products Sprint 17)
Milimetric added a comment to T342267: Investigate surprising "10% Other" portion of Analytics Browsers report.

So to confirm, this means:

  • over 1% of page views (after deducting known bots and spiders) are coming from clients with user agents that are entirely unknown to ua-parser. That is, the "Other" is already there in the raw wmf.webrequest_text dataset, and we've not created or normalized anything else to "Other".
  • 0.26% is "Redacted" where we replace/normalize/summarise for privacy reasons browser/OS names in our pipeline.
Aug 19 2024, 2:21 PM · Data Products (Data Products Sprint 17), Analytics-Data-Problem, MediaWiki-Platform-Team (Radar), Data-Engineering, Data-Engineering-Dashiki
Sfaci added a comment to T368466: MPIC: Backend integration tests.

Just in case it helps, I would like to add that we are very interested on automating these integration tests. We already created a ticket to explore about it (T371922: MPIC: Automate integration tests). At this time we have to run them manually because we need a database but it seems that GitLab services could run a database as a container within the pipeline (is that feature supported by our current GitLab installation?). That way these tests could be run automatically every time we push/merge something.
In fact this is something that we also miss when working with the APIs and the existing test suite we have for them. If I'm not wrong we already explored this feature there and the result of that exploration was that this is something not supported by the pipeline we have there (Gerrit + Jenkins).

Aug 19 2024, 11:35 AM · Data Products (Data Products Sprint 22), Quality-and-Test-Engineering-Team, Metrics Platform
Sfaci moved T372557: MPIC: [SPIKE] Create disaster recovery plan for MPIC from Backlog to Metrics Platform Instrument Configurator on the Metrics Platform board.
Aug 19 2024, 10:53 AM · Data Products (Data Products Sprint 20 🎯), Spike, Metrics Platform
Sfaci updated the task description for T368801: [EPIC] MPIC: Fix outstanding issues.
Aug 19 2024, 10:50 AM · Data Products (Epics Timeline), Epic, Metrics Platform
KCVelaga_WMF added a comment to T369687: Develop a reusable Metrics Platform schema fragment for translation workflows.

Thank you @mforns

Aug 19 2024, 5:15 AM · Data Products (Data products Sprint 18), Product-Analytics, LPL Analytics

Aug 17 2024

Krinkle added a comment to T342267: Investigate surprising "10% Other" portion of Analytics Browsers report.

Hm.. it seems the "Other" bucket has grown slightly larger than our predictions of 0.26% prediction at T342267#9998984

[…] The "Other" that we're seeing now is just coming from UA parser actually identifying the stuff as "Other". […]

The thing that we figured would be ~ 0.26% in total would be all the items with "Redacted" in every dimension added together. […]

Aug 17 2024, 1:27 AM · Data Products (Data Products Sprint 17), Analytics-Data-Problem, MediaWiki-Platform-Team (Radar), Data-Engineering, Data-Engineering-Dashiki

Aug 16 2024

apaskulin added a comment to T370335: Update guide to creating an instrument with Metrics Platform.

To-dos have been opened as subtasks of T329506: User Facing Metrics Platform Documentation

Aug 16 2024, 11:55 PM · Data Products (Data Products Sprint 17), Tech-Docs-Team, Metrics Platform
cjming added a comment to T368612: MPIC: Frontend web testing.

thanks @zeljkofilipin ! i will pop by in QTE's next office hours on Monday

Aug 16 2024, 8:50 PM · Patch-For-Review, User-zeljkofilipin, Quality-and-Test-Engineering-Team, Metrics Platform
WDoranWMF moved T370183: [SPIKE] Establish a plan for internal Data Products team training on the data contract from Sprint Backlog to Paused on the Data Products (Data Products Sprint 17) board.
Aug 16 2024, 4:40 PM · Data Products (Data Products Sprint 22)
WDoranWMF moved T371031: Spike: Deep Dive on Growthbook data pipeline from Code Review / Tech Input to To Deploy on the Data Products (Data Products Sprint 17) board.
Aug 16 2024, 4:38 PM · Data Products (Data Products Sprint 17)
phuedx updated the task description for T369847: Setup basic send and receive wiring between a MW instance and a Statsig cloud instance.
Aug 16 2024, 3:15 PM · Data Products (Data products Sprint 18), Metrics Platform