User talk:Magnus Manske

Jump to navigation Jump to search

About this board

Previous discussion was archived at User talk:Magnus Manske/Archive 9 on 2024-01-01.

More problems with duplicate authors

3
ArthurPSmith (talkcontribs)

Hi - I've just been spending many hours dealing with another few hundred duplicate-authors actions; this tool is *really* not ready for prime time. Latest example: https://www.wikidata.org/w/index.php?title=Q4757048&diff=2133123022&oldid=2118661421 - how can you merge "Andrew G. White" and "Alex White"? There have been far too many similar merges like this that I've had to revert - and I'm concerned that somebody's going to go in again and re-merge them if the tool is recommending this. Also I'm finding many cases of incorrect merges where the issue is likely that the wrong person was assigned to a paper by the other author in Wikidata, so your tool thinks they're the same person because of that. I had a case yesterday where a political scientist with the same name as a biochemist was set as the author of some biochemistry papers, and your tool then was used to merge them. What actually would have been useful in a case like that was to point out that those biochem papers had been likely assigned to the wrong person as author. Can you adjust your tool to do that? In any case until some significant fixes are in place this needs to be turned off ASAP.

ArthurPSmith (talkcontribs)

By the way part of the problem (including the political scientist one) is that ORCID sometimes has these bad author assignments - some of the data in ORCID comes from services like Scopus that can be mistaken on things like this.

ArthurPSmith (talkcontribs)

Another common issue I'm seeing is two different authors with the same family name and same first initial co-authoring a paper. This often happens with husband-wife teams; also other family relationships can result in paper co-authoring, and even just being from the same region of the world area two individuals may be likely to share a surname. Your duplicate authors game will always try to merge these cases, because of course being co-authors on a paper means they are both co-authors with the other authors on that paper. Instead I think it should catch cases like this as a sign that they *are not* the same person. Having both wikidata id's as co-authors on the same work should be an indicator that they are distinct, not the same. I know there are exceptions where real duplicates exist, but at the least it should be handled differently from other cases where they are never co-authors.

Reply to "More problems with duplicate authors"

Mix'n'match [1.0] inserts ISNI in spaced format

15
CV213 (talkcontribs)
CV213 (talkcontribs)
CV213 (talkcontribs)
Anvilaquarius (talkcontribs)

Yes, it came from Mix-n-match. I try to correct them, but maybe I forget it sometimes. --Anvilaquarius (talk) 10:27, 20 February 2024 (UTC)

CV213 (talkcontribs)

Thank you. @Magnus Manske: It would be helpful if you could fix the problem at the source, by removing the spaces in any P213 stored there.

Magnus Manske (talkcontribs)

I have cleaned up non-standard ISNIs in Mix'n'match, but I don't have a good mechanism to prevent them from being added for new entries

CV213 (talkcontribs)

By new entries, do you mean new entries in Mix'n'match, created after the clean-up?

Magnus Manske (talkcontribs)

yes.

CV213 (talkcontribs)
CV213 (talkcontribs)
CV213 (talkcontribs)
CV213 (talkcontribs)
Epìdosis (talkcontribs)

ISNI is stored without spaces in identificativo SBN di un autore (P396). But as of now it seems that the catalogue of MnM has both the forms with and without spaces in the auxiliary data; the first should be removed.

CV213 (talkcontribs)

@Magnus Manske: can you have a look why 2024-02-24 and 2024-03-01 new items created contained "spaced ISNI and non-spaced ISNI"?

Epìdosis (talkcontribs)
Reply to "Mix'n'match [1.0] inserts ISNI in spaced format"

Nonsense data said to be imported from World Aquatics database

1
Mormegil (talkcontribs)

Hi, your bot imports nonsensical data about swimmers. Could you make it stop doing that (and ideally revert the wrong additions)?

Reply to "Nonsense data said to be imported from World Aquatics database"

New Archnet Mix'n'match catalogs with 0 entries for unknown reason

3
Marsupium (talkcontribs)

Hello, I have imported mixnmatch:6299 and mixnmatch:6300 for Archnet site ID (P7323) and Archnet authority IDs (which don't have a property yet) using the import form. Both ended up having 0 IDs though the example entries in the preview looked fine. Do you have an idea what might have gone wrong? Thanks a lot in advance for any help or recommendations!

Magnus Manske (talkcontribs)

I manually ran an update, should be all there now.

Marsupium (talkcontribs)

Vielen tausend Dank!

Reply to "New Archnet Mix'n'match catalogs with 0 entries for unknown reason"
Folengo (talkcontribs)

Listeria bot is currently not working after working very well for a week. Thanks.

Folengo (talkcontribs)

Listeria Bot is currently not updating since yesterday morning. Thanks.

Folengo (talkcontribs)

Listeria Bot is currently not working, Thanks.

160.78.149.61 (talkcontribs)

The bot is not working again. Thanks.

Folengo (talkcontribs)

Sorry, that's me above.

Folengo (talkcontribs)

The bot is currently not working. Thanks.

Folengo (talkcontribs)

The bot is currently not working. Thanks.

Folengo (talkcontribs)

LIsteria bot is not working again, since two days ago. Thanks,

Folengo (talkcontribs)

Listeria bot is currently not working. Thanks.

Reply to "Listeria Bot not working"

IDs for human on item about literary work

1
ISNIplus (talkcontribs)
Reply to "IDs for human on item about literary work"

Import of Authority Control data via https://ac2wd.toolforge.org

3
Kolja21 (talkcontribs)

Hallo Magnus, das Tools importiert weiterhin veraltete und fehlerhafte GNDs, selbst wenn diese bereits im Datenobjekt vermerkt sind. Beispiel: dein Editv vom 3. April. Ein nicht individualisierter Datensatz, der mit missbilligter Rang gekennzeichnet ist.

Magnus Manske (talkcontribs)

Ich glaube, ich habe das jetzt repariert. Kann es leider nicht an Deinem Beispiel ausprobieren, da hat wohl GND oder VIAF aufgeräumt?

Kolja21 (talkcontribs)

Danke für die schnelle Rückmeldung. Die nicht individualisierten Datensätze (Tn) sind seit Juli 2020 nicht mehr Bestandteil der GND und wurden bei VIAF gelöscht. Laut Versionsgeschichte des VIAF-Clusters 296429616 wurde der Tn dort bereits 2017 gestrichen und damals vermutlich zu einem anderen Cluster verschoben. Mein Eindruck ist, dass sich das Tool an dem Datenobjekt orientiert, wo der Tn als GND mit missbilligter Rang eingetragen war. Aus der ungültigen GND wurde die neue (identische) GND abgeleitet. Das gleiche passiert mit ehemaligen GNDs, die als Weiterleitungen im PICA-Feld 007N vermerkt sind. Ein Beispiel zu Testzwecken: PICA-Feld 007N in GND 118541579 für Günter Grass: https://d-nb.info/gnd/1020430370. Quelle: OGND.

Reply to "Import of Authority Control data via https://ac2wd.toolforge.org"

Please respond re duplicate authors - project chat

2
ArthurPSmith (talkcontribs)
ArthurPSmith (talkcontribs)

thanks

Paging or limit param on MnM jobs page

2
Summary by Solidest

Thanks!

Solidest (talkcontribs)

Hi! Could you please implement paging or limit=500 on https://mix-n-match.toolforge.org/#/jobs ? The queue is now full of errors ( perhaps they should also be moved below TODO), and it was also stuck for a week until today. I added several autoscrape jobs during the week, and now it's impossible to find these catalogues.

Magnus Manske (talkcontribs)

Pagination added.

Folengo (talkcontribs)
Magnus Manske (talkcontribs)

I know, working on it

95.244.120.237 (talkcontribs)

It briefly worked on the 15th, now not working again.


Thanks.

Folengo (talkcontribs)

Hello,


The bot has not been working for almost three days.


Thanks again.

Magnus Manske (talkcontribs)

Working on it, might take a few days.

Folengo (talkcontribs)

It has been working fine for two days, now it is not working again. Thanks.

Folengo (talkcontribs)

The bot is not working today. Thanks.

Solidest (talkcontribs)
Folengo (talkcontribs)

The bot is not working again. Thanks.

Folengo (talkcontribs)

The bot is not working again. Thanks.