User talk:Mike Peel: Difference between revisions

From Wikidata
Jump to navigation Jump to search
Content deleted Content added
Line 39: Line 39:
::GND is CC0 and has a linked data service from which the date of birth can be obtained. [[User:MrProperLawAndOrder|MrProperLawAndOrder]] ([[User talk:MrProperLawAndOrder|<span class="signature-talk">{{int:Talkpagelinktext}}</span>]]) 14:19, 24 May 2020 (UTC)
::GND is CC0 and has a linked data service from which the date of birth can be obtained. [[User:MrProperLawAndOrder|MrProperLawAndOrder]] ([[User talk:MrProperLawAndOrder|<span class="signature-talk">{{int:Talkpagelinktext}}</span>]]) 14:19, 24 May 2020 (UTC)
::: {{ping|MrProperLawAndOrder}} Could you import them at the same time as creating the items? QS should support dates, you just need to be able to get the data into the right format to put into QS. Thanks. [[User:Mike Peel|Mike Peel]] ([[User talk:Mike Peel|<span class="signature-talk">{{int:Talkpagelinktext}}</span>]]) 14:21, 24 May 2020 (UTC)
::: {{ping|MrProperLawAndOrder}} Could you import them at the same time as creating the items? QS should support dates, you just need to be able to get the data into the right format to put into QS. Thanks. [[User:Mike Peel|Mike Peel]] ([[User talk:Mike Peel|<span class="signature-talk">{{int:Talkpagelinktext}}</span>]]) 14:21, 24 May 2020 (UTC)
:::: Not so easily. I imported based on deutsche-biographie.de but would prefer to take the time information directly from GND. I am in contact with [[User:Epìdosis]] and he with [[User:Bargioni]], the latter seems to have expertise with perl scripting and reading data from website, so that seems to be one good option to obtain the information. Anyway, the newly created items all have GND and can thus soon have a VIAF. Articles created in dewiki shall get a GND locally anyway and then it is easy to connect/merge them based on the GND. I don't know how often Pi bot works on articles that already have a GND. Last but not least, matching by GND - if the GND is correct - is much safer than by name+birthdate. An article creator can probably best tell which GND is the correct one if faced with multiple items having same name+birthdate. GND DB stores information about 12 mio+ humans, WD has not even 1 mio GND humans. Very soon, your tool may work much better than before, because the number of GND humans with rich and verified data will be increased a lot. Apart from [[User:Epìdosis]], [[User:Kolja21]] is involved in the DtBio centered GND human data upgrade. [[User:MrProperLawAndOrder|MrProperLawAndOrder]] ([[User talk:MrProperLawAndOrder|<span class="signature-talk">{{int:Talkpagelinktext}}</span>]]) 22:42, 24 May 2020 (UTC)

Revision as of 22:43, 24 May 2020

Hotkey for Wikidata Infobox in edit window

Hi, Mike Peel. I thought I'd let you know that I requested a hotkey for easy insertion of the Wikidata Infobox template on Commons and it has been implemented. To me it is an improvement and I think it will be useful for others as well. The hotkey only shows up in Category pages, I believe, which is of course appropriate. Greetings, Eissink (talk) 09:31, 16 May 2020 (UTC).[reply]

Matching existing wikidata objects with unconnected articles

Hello Mike,

in the last days wikidata objects for humans have been created, which currently only contain the name and the GND, see:

  • Will it be possible for Pi bot to match articles, which will be created in the future to these existing objects? Or does this now have to be done manually (as for example with all articles, which have been created in the user namespace and have been moved to the article namespace lateron or added information like categories, infoboxes, templates, ... at a later point in time, ...).
  • Which are the criteria for Pi bot to decide, if an object and an article describe the same person (name + date/year of birth + optional date/year of death?).
  • Is it possible to match objects and articles based on various IDs, like VIAF, GND, IMDb, LCCN, Transfermarkt, Weltfussball, ... which are included in the articles (and the objects)?

Thanks a lot! --M2k~dewiki (talk) 10:01, 24 May 2020 (UTC)[reply]

@M2k~dewiki: The script that creates new items for humans:
  1. Runs a search for the person's name
  2. skips cases where more than 5 matches are found
  3. excludes matches that already have a sitelink
  4. skips cases where it finds a potential match without a birth date
  5. excludes matches with a birth date is more than 3 years different
  6. adds the sitelink if the birth year matches
  7. only if there are no remaining matches, then it creates a new item
In this case, I think the script will stop at #4, since the newly created items don't have birth dates in them - those cases will have to be manually resolved.
I do have other scripts that add sitelinks based on identifiers, but they were only written for importing commons category sitelinks based on IDs on Commons. I could try repurposing these to look through identifiers in Wikipedias, but there would have to be a straightforward way to find the cases where the article doesn't have a sitelink and has an identifier (e.g., a tracking category that contains articles with identifiers that don't have Wikidata items, or have identifiers that don't match Wikidata). Thanks. Mike Peel (talk) 10:16, 24 May 2020 (UTC)[reply]
GND is CC0 and has a linked data service from which the date of birth can be obtained. MrProperLawAndOrder (talk) 14:19, 24 May 2020 (UTC)[reply]
@MrProperLawAndOrder: Could you import them at the same time as creating the items? QS should support dates, you just need to be able to get the data into the right format to put into QS. Thanks. Mike Peel (talk) 14:21, 24 May 2020 (UTC)[reply]
Not so easily. I imported based on deutsche-biographie.de but would prefer to take the time information directly from GND. I am in contact with User:Epìdosis and he with User:Bargioni, the latter seems to have expertise with perl scripting and reading data from website, so that seems to be one good option to obtain the information. Anyway, the newly created items all have GND and can thus soon have a VIAF. Articles created in dewiki shall get a GND locally anyway and then it is easy to connect/merge them based on the GND. I don't know how often Pi bot works on articles that already have a GND. Last but not least, matching by GND - if the GND is correct - is much safer than by name+birthdate. An article creator can probably best tell which GND is the correct one if faced with multiple items having same name+birthdate. GND DB stores information about 12 mio+ humans, WD has not even 1 mio GND humans. Very soon, your tool may work much better than before, because the number of GND humans with rich and verified data will be increased a lot. Apart from User:Epìdosis, User:Kolja21 is involved in the DtBio centered GND human data upgrade. MrProperLawAndOrder (talk) 22:42, 24 May 2020 (UTC)[reply]