Global-search is showing duplicate results
Closed, ResolvedPublic

Description

Sometime in the past week (noticed today) the global-search tool is showing multiple duplicate results. Have we got two underlying databases in play here?

https://global-search.toolforge.org/?regex=1&ignorecase=1&q=%22mairie.biz%22

nl.wikipedia Brugheas "population sans doubles comptes" --> ==Externe links== * [https://www.mairie.biz/mairie-brugheas-03700.html Informatie over Brugheas] * {{Link INSEE|id=03044}}
nl.wikipedia Brugheas "population sans doubles comptes" --> ==Externe links== * [https://www.mairie.biz/mairie-brugheas-03700.html Informatie over Brugheas] * {{Link INSEE|id=03044}}
uk.wikipedia Енгем [http://www.mairie.biz/mairie-inghem-62129.html Мерія муніципалітету Енгем] {{Webarchive|url=https://web.archive.org/web/20101229080510/http://mairie.biz/mairie-inghem-62129
uk.wikipedia Енгем [http://www.mairie.biz/mairie-inghem-62129.html Мерія муніципалітету Енгем] {{Webarchive|url=https://web.archive.org/web/20101229080510/http://mairie.biz/mairie-inghem-62129

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
bking claimed this task.
bking subscribed.

This has been fixed.

Cloudelastic has slightly different settings from our production hosts with regards to cross-cluster searches. For the curious, here's the curl command I used.

I'm closing this out, but please feel free to re-open if you're getting duplicate results. We're working on improved monitoring for this situation, see T358802 for more details.

We are in the process of deploying a new updater for CirrusSearch, with cloudelastic as the first destination cluster. Duplicates could be a result of that, and are good to report so we can get everything working great before moving on to the primary search clusters.

Unfortunately the two pages you pointed out are no longer showing duplicates, so it's a bit hard to track down why it was showing up. If you see any more please report them.