[go: up one dir, main page]

Page MenuHomePhabricator

Missing rows in revision table of enwiki.labsdb (data integrity issue)
Closed, DuplicatePublic

Description

https://en.wikipedia.org/w/index.php?oldid=675414646 exists, but...

MariaDB [enwiki_p]> select * from revision where rev_id = 675414646;
Empty set (0.00 sec)

MariaDB [enwiki_p]> select * from revision_userindex where rev_id = 675414646;
Empty set (0.00 sec)

It looks like the enwiki.labsdb revision table on Labs is missing some rows.

This is the page that refers to that revision (i.e., page_latest: 675414646):

MariaDB [enwiki_p]> select * from page where page_id = 47486437\G
*************************** 1. row ***************************
           page_id: 47486437
    page_namespace: 0
        page_title: Minister_of_Posts,_Telegraphs_and_Telecommunications_(France)
 page_restrictions: 
      page_counter: 0
  page_is_redirect: 1
       page_is_new: 1
       page_random: 0.955717957143
      page_touched: 20150810125145
page_links_updated: 20150810125145
       page_latest: 675414646
          page_len: 59
page_content_model: wikitext
1 row in set (0.00 sec)

Event Timeline

MZMcBride raised the priority of this task from to High.
MZMcBride updated the task description. (Show Details)
MZMcBride added projects: Toolforge, DBA.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

At least the following page IDs are affected:

47486469
47486598
47486599
47486601
47486479
47486609
47486613
47486489
47486491
47486621
47486597
47486627
47486630
47486684
47486635
47486638
47486579
47486388
47486519
47486648
47486650
47486526
47486581
47486656
47486530
47486662
47486583
47486414
47486415
47486418
47486675
47486550
47486607
47486556
47486429
47486558
47486693
47486437
47486567
47486451
47486452
47486453
47486455
47486456
47486588
47486591

Despite being a mixture of old an recent pages, all missing edits seem to be from around the same dates.
This would point to a range of transactions missing. I will identify the exact range and backfill those revisions back to sanitarium, and then labs.