[go: up one dir, main page]

Page MenuHomePhabricator

TJones (Trey Jones)
Staff Computational Linguist, Search Platform Team

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Jul 8 2015, 3:02 PM (481 w, 3 d)
Availability
Available
IRC Nick
Trey314159
LDAP User
Tjones
MediaWiki User
TJones (WMF) [ Global Accounts ]

I would have written a shorter comment, but I did not have the time.

I'm part of the Search Platform team and I spend my time working on search & relevance, trying to better support search in various languages, analyzing queries, and doing random mathy things. I tend to write long, detailed notes about my investigations (so as to improve the bus number of my work).

When I have to work on _GitHub,_ /‍‍/Phab,/‍‍/ and ''MediaWiki'' all on the same day, I sometimes suffer Severe Markup Incongruence Fatigue.

I � Unicode.

Recent Activity

Tue, Sep 24

TJones triaged T375565: Review hindi_normalization for Hindi analysis chain as Medium priority.
Tue, Sep 24, 9:58 PM · Discovery-Search
TJones triaged T375567: Review indic_normalization for other Indic languages/scripts as High priority.
Tue, Sep 24, 9:57 PM · Discovery-Search
TJones created T375567: Review indic_normalization for other Indic languages/scripts.
Tue, Sep 24, 9:57 PM · Discovery-Search
TJones created T375565: Review hindi_normalization for Hindi analysis chain.
Tue, Sep 24, 9:52 PM · Discovery-Search
TJones triaged T375561: Apply ICU folding to more languages as Medium priority.
Tue, Sep 24, 9:20 PM · Discovery-Search
TJones created T375561: Apply ICU folding to more languages.
Tue, Sep 24, 9:19 PM · Discovery-Search
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Tue, Sep 24, 8:16 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones created T375557: Reindex all wikis to enable folding harmonization and new functionality.
Tue, Sep 24, 8:13 PM · Discovery-Search (Current work)

Mon, Sep 23

TJones moved T332342: Standardize ASCII-folding/ICU-folding across analyzers from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mon, Sep 23, 3:13 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)
TJones moved T332342: Standardize ASCII-folding/ICU-folding across analyzers from To Be Deployed to Needs review on the Discovery-Search (Current work) board.
Mon, Sep 23, 3:06 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)
TJones moved T332342: Standardize ASCII-folding/ICU-folding across analyzers from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mon, Sep 23, 3:05 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)
TJones moved T332342: Standardize ASCII-folding/ICU-folding across analyzers from In Progress to Needs review on the Discovery-Search (Current work) board.
Mon, Sep 23, 3:05 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)

Fri, Sep 20

TJones added a comment to T332342: Standardize ASCII-folding/ICU-folding across analyzers.

A full write up with details of the 11 languages using Indic scripts (Marathi, Burmese, Malayalam, Telugu, Sinhala, Kannada, Gujarati, Nepali, Assamese, Punjabi, and Odia) that are configured in this last patch is on Mediawiki.

Fri, Sep 20, 8:49 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)

Thu, Sep 19

TJones updated the task description for T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 4:30 PM · Epic, Discovery-Search
TJones triaged T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.) as High priority.
Thu, Sep 19, 3:49 PM · Epic, Discovery-Search
TJones renamed T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.) from [EPIC] Create infrastructure to support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.) to [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:49 PM · Epic, Discovery-Search
TJones added a parent task for T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias: T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:47 PM · Discovery-Search, Russian-Sites, Discovery-ARCHIVED
TJones added a parent task for T127003: Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis: T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:47 PM · Discovery-Search
TJones added a parent task for T155104: Detect "wrong keyboard" queries for Hebrew/American keyboards on EN/HE Wikipedias: T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:47 PM · Discovery-Search, Discovery-ARCHIVED
TJones added subtasks for T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.): T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias, T155104: Detect "wrong keyboard" queries for Hebrew/American keyboards on EN/HE Wikipedias, T127003: Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis, T297761: Create a Latin-to-Devanagari transliteration second-chance search for Hindi wikis.
Thu, Sep 19, 3:47 PM · Epic, Discovery-Search
TJones added a parent task for T297761: Create a Latin-to-Devanagari transliteration second-chance search for Hindi wikis: T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:47 PM · Discovery-Search
TJones moved T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.) from needs triage to [epic] on the Discovery-Search board.
Thu, Sep 19, 3:46 PM · Epic, Discovery-Search
TJones created T375215: [EPIC] Support "second-try" transliteration or wrong-keyboard searches (aka N.O.R.M.).
Thu, Sep 19, 3:46 PM · Epic, Discovery-Search
TJones renamed T127003: Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis from Inter language script detection in search to Transliterate Latin or Cyrillic script searches to Georgian script on Georgian wikis.
Thu, Sep 19, 2:54 PM · Discovery-Search
TJones raised the priority of T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias from Medium to High.
Thu, Sep 19, 2:53 PM · Discovery-Search, Russian-Sites, Discovery-ARCHIVED
TJones raised the priority of T155104: Detect "wrong keyboard" queries for Hebrew/American keyboards on EN/HE Wikipedias from Medium to High.
Thu, Sep 19, 2:52 PM · Discovery-Search, Discovery-ARCHIVED
TJones raised the priority of T318269: Test and analyze Kuromoji Japanese language analyzer from Medium to High.
Thu, Sep 19, 2:47 PM · Discovery-Search

Mon, Sep 16

TJones added a comment to T372932: Add raw numbers to Search Metrics dashboard.

The updates look excellent to me!

Mon, Sep 16, 5:06 PM · Discovery-Search (Current work)

Aug 27 2024

TJones added a comment to T332342: Standardize ASCII-folding/ICU-folding across analyzers.

Apparently this should have been an (oxymoronic) mini-epic.

Aug 27 2024, 8:50 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)
TJones triaged T373471: Expand homoglyph normalization to more scripts (esp. Greek) as High priority.
Aug 27 2024, 5:49 PM · Discovery-Search
TJones moved T373471: Expand homoglyph normalization to more scripts (esp. Greek) from needs triage to Language Stuff on the Discovery-Search board.

I wanted to move this out of my ever-languishing 10% project pile.

Aug 27 2024, 5:49 PM · Discovery-Search
TJones created T373471: Expand homoglyph normalization to more scripts (esp. Greek).
Aug 27 2024, 5:48 PM · Discovery-Search

Aug 20 2024

TJones added a comment to T332342: Standardize ASCII-folding/ICU-folding across analyzers.

Full Part 4—Refactoring & Analysis Notes are on Mediawiki.

Aug 20 2024, 10:32 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)

Aug 13 2024

TJones added a comment to T332342: Standardize ASCII-folding/ICU-folding across analyzers.

Full write up on Mediawiki. In summary:

Aug 13 2024, 10:06 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)

Aug 2 2024

TJones added a comment to T371709: Account recovery help needed for Developer account tjones.

Thanks!

Aug 2 2024, 4:31 PM · wikitech.wikimedia.org, Trust-and-Safety, cloud-services-team
TJones updated the task description for T371709: Account recovery help needed for Developer account tjones.
Aug 2 2024, 4:19 PM · wikitech.wikimedia.org, Trust-and-Safety, cloud-services-team
TJones created T371709: Account recovery help needed for Developer account tjones.
Aug 2 2024, 4:18 PM · wikitech.wikimedia.org, Trust-and-Safety, cloud-services-team

Jul 29 2024

TJones closed T59242: CirrusSearch: Problems on the Gujarati wikipedia that look like unicode normalization issues as Resolved.

This ticket seems to be about how lsearchd parses wikitext, which isn't really relevant anymore. I can't reproduce the unwanted parsing behavior with current on-wiki search, so I'm closing the ticket. If there is still a problem, please re-open with new examples, or open a new ticket.

Jul 29 2024, 4:02 PM · Discovery-Search, Gujarati-Sites, Discovery-ARCHIVED, CirrusSearch
TJones added a comment to T369632: High level plan of how to scale MoreLike.

@EBernhardson, your write-up looks good to me!

Jul 29 2024, 3:53 PM · FY2024-25 KR 3.1 Content Discovery, Web-Team-Backlog, Discovery-Search (Current work)

Jul 25 2024

TJones closed T231593: Improve Basque language processing for search as Resolved.

This got done as part of T283366: Unpack Basque, Catalan, Danish Elasticsearch Analyzers, and I even noted that Basque had an extra boost from the kind of issues this specific ticket was supposed to address.

Jul 25 2024, 8:00 PM · Discovery-Search
TJones closed T211824: Investigate a “rare-character” index as Declined.

I'm going to close this ticket because the problem is resolved for a lot of non-punctuation characters, thanks to changes to the Elastic tokenizers.

Jul 25 2024, 7:24 PM · Discovery-Search
TJones closed T95849: Search for unicode symbols like ★ is inconsistent and unpredictable as Resolved.

I'm going to close this ticket because all of the example queries now do reasonable things (or, the unreasonable parts have to do with regex searches timing out, not Unicode characters).

Jul 25 2024, 5:49 PM · Discovery-ARCHIVED, CirrusSearch

Jul 19 2024

TJones updated subscribers of T329834: Cannot search partial Javanese script titles.

This has been rolling around in my head for a while and something related came up today, so I wanted to jot down some notes to my future self, or to anyone else who may work on this.

Jul 19 2024, 3:59 PM · CirrusSearch, Discovery-Search, MediaWiki-Search

Jul 16 2024

TJones claimed T332342: Standardize ASCII-folding/ICU-folding across analyzers.
Jul 16 2024, 6:35 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)

Jul 15 2024

TJones updated the task description for T369632: High level plan of how to scale MoreLike.
Jul 15 2024, 3:25 PM · FY2024-25 KR 3.1 Content Discovery, Web-Team-Backlog, Discovery-Search (Current work)

Jul 8 2024

TJones added a comment to T368996: Entering "Palestine" on en.wp, search suggestions do not offer "State of Palestine".

Hi @34DSSDS. It is not an easy thing to manage individual search results. There are obvious knobs to turn to get the desired result here for this specific query, but it would be a big change and it would alter a lot of other suggestion results. Given that our current algorithm and configuration have been optimized to give the best overall results we could measure, dramatically changing it for one query will almost certainly lead to worse results in general.

Jul 8 2024, 7:11 PM · Discovery-Search, CirrusSearch
TJones edited projects for T332342: Standardize ASCII-folding/ICU-folding across analyzers, added: Discovery-Search (Current work); removed Discovery-Search.
Jul 8 2024, 3:19 PM · MW-1.43-notes (1.43.0-wmf.22; 2024-09-10), Discovery-Search (Current work)
TJones updated the task description for T219550: [EPIC] Harmonize language analysis across languages.
Jul 8 2024, 3:17 PM · MW-1.41-notes (1.41.0-wmf.20; 2023-08-01), Discovery-Search (Current work), Epic

Jul 1 2024

TJones added a comment to T368894: Cirrus search does not prioritise master pages on their subpages.

Other than @IKhitron's suggestion to add the main page to the list when suggesting the sub-page, is anything here that isn't covered by T159861: Add an is_subpage field to elasticsearch documents and use as a scoring feature, which explicitly takes the sub-page status into account? Should we merge the tickets?

Jul 1 2024, 5:21 PM · Discovery-Search, CirrusSearch

Apr 30 2024

TJones renamed T363734: Reindex all wikis to enable dotted I fix, Yiddish ligatures, maybe Arabic normalization from Reindex all wikis to enable dotted I fix, yiddish ligatures to Reindex all wikis to enable dotted I fix, Yiddish ligatures, maybe Arabic normalization.
Apr 30 2024, 9:25 PM · Discovery-Search (Current work)
TJones added a comment to T72899: Search box needs some normalization for Arabic Family languages.

I looked at all of these as best I could, and I decided on a general mapping to standard Arabic forms internally. Arabic does that to some degree, as does Persian! And for the languages without custom stemmers and stop word filters, the character used internally doesn't matter, as long as the desired words can find each other.

Apr 30 2024, 9:23 PM · MW-1.43-notes (1.43.0-wmf.4; 2024-05-07), Discovery-Search (Current work), Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search
TJones moved T72899: Search box needs some normalization for Arabic Family languages from In Progress to Needs review on the Discovery-Search (Current work) board.
Apr 30 2024, 8:09 PM · MW-1.43-notes (1.43.0-wmf.4; 2024-05-07), Discovery-Search (Current work), Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search
TJones updated the task description for T363734: Reindex all wikis to enable dotted I fix, Yiddish ligatures, maybe Arabic normalization.
Apr 30 2024, 4:03 PM · Discovery-Search (Current work)

Apr 29 2024

TJones changed the point value for T72899: Search box needs some normalization for Arabic Family languages from 3 to 5.
Apr 29 2024, 9:27 PM · MW-1.43-notes (1.43.0-wmf.4; 2024-05-07), Discovery-Search (Current work), Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Apr 29 2024, 4:46 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones added a project to T363734: Reindex all wikis to enable dotted I fix, Yiddish ligatures, maybe Arabic normalization: Discovery-Search.
Apr 29 2024, 4:43 PM · Discovery-Search (Current work)
TJones created T363734: Reindex all wikis to enable dotted I fix, Yiddish ligatures, maybe Arabic normalization.
Apr 29 2024, 4:43 PM · Discovery-Search (Current work)

Apr 24 2024

TJones claimed T72899: Search box needs some normalization for Arabic Family languages.
Apr 24 2024, 1:18 PM · MW-1.43-notes (1.43.0-wmf.4; 2024-05-07), Discovery-Search (Current work), Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search
TJones moved T72899: Search box needs some normalization for Arabic Family languages from Incoming to In Progress on the Discovery-Search (Current work) board.
Apr 24 2024, 1:17 PM · MW-1.43-notes (1.43.0-wmf.4; 2024-05-07), Discovery-Search (Current work), Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search
TJones moved T362501: וי (U+05D5 vav, U+05D9 yod) doesn't find ױ (U+05F1 Yiddish vav yod) from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Apr 24 2024, 1:14 PM · Discovery-Search (Current work), CirrusSearch

Apr 19 2024

TJones moved T362501: וי (U+05D5 vav, U+05D9 yod) doesn't find ױ (U+05F1 Yiddish vav yod) from In Progress to Needs review on the Discovery-Search (Current work) board.
Apr 19 2024, 9:49 PM · Discovery-Search (Current work), CirrusSearch
TJones added a comment to T362501: וי (U+05D5 vav, U+05D9 yod) doesn't find ױ (U+05F1 Yiddish vav yod).

In reading up on the ligatures, I found another ligature (yod-yod-patah ײַ) that has several variants, one using a ligature from above (double-yod + patah ײַ), one with separate characters (yod + yod + patah ייַ), and a less common variant with the patah in the middle (yod + patah + yod יַי). It looks like icu_normalizer already converts the single-character form (ײַ) to one using the double-yod ligature (ײַ).

Apr 19 2024, 8:40 PM · Discovery-Search (Current work), CirrusSearch

Apr 18 2024

TJones renamed T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping for other languages to 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping.
Apr 18 2024, 7:06 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones renamed T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from Enable hiragana/katakana mapping for other languages to 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping for other languages.
Apr 18 2024, 7:02 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones claimed T362501: וי (U+05D5 vav, U+05D9 yod) doesn't find ױ (U+05F1 Yiddish vav yod).
Apr 18 2024, 1:59 PM · Discovery-Search (Current work), CirrusSearch
TJones moved T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from In Progress to To Be Deployed on the Discovery-Search (Current work) board.
Apr 18 2024, 1:58 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch

Apr 17 2024

TJones added a comment to T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping.

While we had planned to expand the deployment of the hiragana-to-katakana mapping from English to most other languages (though not Japanese), testing revealed that doing the mapping pre-tokenization interfered with the new ICU tokenizer's ability to parse Japanese text (on non-Japanese wikis).

Apr 17 2024, 7:20 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch

Apr 16 2024

TJones merged T177876: Investigate changing ICU tokenization from whitelist to blacklist into T356643: Enable icu_tokenizer (almost) everywhere and update AnalysisConfigBuilder to use icu_token_repair.
Apr 16 2024, 2:45 PM · MW-1.42-notes (1.42.0-wmf.20; 2024-02-27), Discovery-Search (Current work)
TJones merged task T177876: Investigate changing ICU tokenization from whitelist to blacklist into T356643: Enable icu_tokenizer (almost) everywhere and update AnalysisConfigBuilder to use icu_token_repair.
Apr 16 2024, 2:45 PM · Discovery-Search

Apr 15 2024

TJones added a comment to T362442: The search In vector-2022 and minerva does not lead to the full destination of the redirect when searching for the exact name.

#REDIRECT [[Page#Anchor]] - going to it should still lead you to the Anchor, not to #top of that page - as is being done here.

Apr 15 2024, 6:38 PM · Web-Team-Backlog, Desktop Improvements (Vector 2022), Discovery-Search, CirrusSearch
TJones updated the task description for T362310: Implement global ratelimiting in our service mesh.
Apr 15 2024, 3:47 PM · serviceops, Patch-For-Review, Discovery-Search (Current work), CirrusSearch
TJones set the point value for T361950: Ensure that WDQS query throttling does not interfere with federation to 3.
Apr 15 2024, 3:44 PM · wmde-wikidata-tech, Discovery-Search (Current work), Wikidata
TJones updated the task description for T361950: Ensure that WDQS query throttling does not interfere with federation.
Apr 15 2024, 3:44 PM · wmde-wikidata-tech, Discovery-Search (Current work), Wikidata
TJones added a comment to T362442: The search In vector-2022 and minerva does not lead to the full destination of the redirect when searching for the exact name.

Showing the canonical page title in the suggestion was a design decision that was made for Vector 2022, though it can be confusing when the full title isn't obviously related to the redirect title. T303013 has some potential heuristics for deciding when to show the redirect info. Feel free to chime in over there if you don't feel like your use case would be covered (the specific artificial example here would be covered).

Apr 15 2024, 2:58 PM · Web-Team-Backlog, Desktop Improvements (Vector 2022), Discovery-Search, CirrusSearch
TJones merged T362442: The search In vector-2022 and minerva does not lead to the full destination of the redirect when searching for the exact name into T303013: Indicate when search results are from redirects (sometimes).
Apr 15 2024, 2:57 PM · Web-Team-Backlog, Design-System-Team, Codex, Desktop Improvements (Vector 2022)
TJones merged task T362442: The search In vector-2022 and minerva does not lead to the full destination of the redirect when searching for the exact name into T303013: Indicate when search results are from redirects (sometimes).
Apr 15 2024, 2:56 PM · Web-Team-Backlog, Desktop Improvements (Vector 2022), Discovery-Search, CirrusSearch
TJones added a comment to T362495: Commons search for galleries and categories shows code in the results.

Is this substantially different from T331389? ("...<nowiki> output in search result descriptions")

Apr 15 2024, 2:39 PM · CirrusSearch, Discovery-Search

Apr 11 2024

TJones moved T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Apr 11 2024, 4:21 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)

Apr 10 2024

TJones added a comment to T358495: Enable dotted_I_fix (almost?) everywhere.
In T358495#9705136, @NMW03 wrote:

Not sure if this task fixes that, lowercasing I and dotted I (İ) returns different lowercase letters

Apr 10 2024, 9:44 PM · Patch-For-Review, Discovery-Search (Current work)

Apr 9 2024

TJones claimed T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping.
Apr 9 2024, 8:28 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones edited projects for T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping, added: Discovery-Search (Current work); removed Discovery-Search.
Apr 9 2024, 8:27 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch

Apr 5 2024

TJones moved T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures from In Progress to Needs review on the Discovery-Search (Current work) board.
Apr 5 2024, 8:18 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)

Apr 3 2024

TJones changed the point value for T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures from 5 to 3.
Apr 3 2024, 7:20 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)
TJones moved T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures from Incoming to In Progress on the Discovery-Search (Current work) board.
Apr 3 2024, 5:01 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)
TJones claimed T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures.
Apr 3 2024, 5:01 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)
TJones moved T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from Language Stuff to needs triage on the Discovery-Search board.
Apr 3 2024, 5:00 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones placed T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping up for grabs.
Apr 3 2024, 4:59 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones moved T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from In Progress to Incoming on the Discovery-Search (Current work) board.
Apr 3 2024, 4:58 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones added a comment to T358495: Enable dotted_I_fix (almost?) everywhere.

Not getting automated tags for some reason, but this is included in 1.42.0-wmf.25, so it will be deployed soon.

Apr 3 2024, 4:56 PM · Patch-For-Review, Discovery-Search (Current work)
TJones claimed T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping.
Apr 3 2024, 3:33 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones edited projects for T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping, added: Discovery-Search (Current work); removed Discovery-Search, Discovery-ARCHIVED.
Apr 3 2024, 3:31 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch
TJones moved T180387: 𝖤̶𝗇̶𝖺̶𝖻̶𝗅̶𝖾̶ Disable hiragana/katakana mapping from Incoming to In Progress on the Discovery-Search (Current work) board.
Apr 3 2024, 3:31 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), Discovery-Search (Current work), CirrusSearch

Mar 29 2024

TJones created T361377: Refactor CirrusSearch AnalysisConfigBuilder Tests & Fixtures.
Mar 29 2024, 4:05 PM · MW-1.43-notes (1.43.0-wmf.1; 2024-04-16), Discovery-Search (Current work)
TJones triaged T359100: Analyze results of harmonization as High priority.
Mar 29 2024, 2:47 PM · Discovery-Search (Current work)

Mar 28 2024

TJones updated the task description for T219550: [EPIC] Harmonize language analysis across languages.
Mar 28 2024, 9:44 PM · MW-1.41-notes (1.41.0-wmf.20; 2023-08-01), Discovery-Search (Current work), Epic
TJones moved T359100: Analyze results of harmonization from In Progress to Needs Reporting on the Discovery-Search (Current work) board.

Full write-up (and it's a lot!) is on MediaWiki.

Mar 28 2024, 9:33 PM · Discovery-Search (Current work)
TJones moved T358495: Enable dotted_I_fix (almost?) everywhere from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mar 28 2024, 2:24 PM · Patch-For-Review, Discovery-Search (Current work)

Mar 21 2024

TJones moved T359100: Analyze results of harmonization from Incoming to In Progress on the Discovery-Search (Current work) board.
Mar 21 2024, 12:01 AM · Discovery-Search (Current work)

Mar 19 2024

TJones added a comment to T358495: Enable dotted_I_fix (almost?) everywhere.

The full write up is on MediaWiki.

Mar 19 2024, 9:29 PM · Patch-For-Review, Discovery-Search (Current work)
TJones claimed T353377: CirrusSearchIndexTooOld.

This is done for commons and wikidata for the production clusters (eqiad and codfw) as a result of T342444. (wikidata hasn't reindexed in cloudelastic yet, but it is in the queue.)

Mar 19 2024, 4:32 PM · Discovery-Search (Current work)