User:RPI2026F1Bot/Task1
The main task of User:RPI2026F1Bot is to import information from anime and manga databases. This is a comprehensive list of what is being imported.
Bot Information[edit]
The bot's source code is located at https://github.com/PythonCoderAS/wikidata-anime-import.
The bot will scan all values of all properties in the list of scanned properties. When a value is found, the bot will create a statement for it if the property is on the list of properties the bot is allowed to create. All imported values that match existing statements will have a reference added to them, even if the bot will not create them.
Reference Spec[edit]
The bot adds these properties to the reference group created for every matched value:
Property | Value |
---|---|
retrieved (P813) | The current time in UTC when the bot went to create the reference |
stated in (P248) | The item of the database the bot imported the value from |
reference URL (P854) | The URL to the external item on the database. Should be the same value as the search formatter of the associated property. |
<property ID> | The external ID of the property it imported information from. |
To prevent duplicating references, the bot will not add a reference if there exists a reference with the same stated in (P248) value, if there is a reference with reference URL (P854) and the bot-generated reference URL is a substring of that URL, or if the external ID of the database's property is present.
Anime[edit]
Manga[edit]
Databased Imported From[edit]
Database | Property | Contributes External Links | Notes |
---|---|---|---|
MyAnimeList (Q4044680) | MyAnimeList manga ID (P4087) | Yes | |
AniList (Q86470198) | AniList manga ID (P8731) | Yes | |
MangaDex (Q110093307) | MangaDex title ID (P10589) | Yes | May be deleted soon. |
MangaUpdates (Q114730827) | MangaUpdates manga ID (P11149) | No | |
Anime-Planet (Q112180497) | Anime-Planet manga ID (P11348) | No | |
INKR Comics (Q115633593) | INKR title ID (P11315) | No |
Properties Imported[edit]
Non-external ID properties imported[edit]
Property | Creates Value | Shared Notes | Database | Notes |
---|---|---|---|---|
genre (P136) | No | Will try to simplify values wherever possible. For example, if both romance anime and manga (Q15637310) and comedy anime and manga (Q15286013) are present, it will store them as romantic comedy anime and manga (Q15712145). There is similar behavior for comedy drama anime and manga (Q15712927).
Note: The bot does not actually modify the existing statements, only the internal representation used for checking which statements to add references to. |
MyAnimeList (Q4044680) | |
No | AniList (Q86470198) | Uses the official genres rather than user-generated tags | ||
No | MangaDex (Q110093307) | |||
No | MangaUpdates (Q114730827) | |||
No | Anime-Planet (Q112180497) | |||
Yes | INKR Comics (Q115633593) | Imports are allowed as this is an official, legitimate source. | ||
intended public (P2360) | Yes | MyAnimeList (Q4044680) | ||
AniList (Q86470198) | ||||
MangaDex (Q110093307) | ||||
MangaUpdates (Q114730827) | ||||
Anime-Planet (Q112180497) | ||||
start time (P580) | No | This usually means the start of serialization for most manga. | MyAnimeList (Q4044680) | Imports precisions between year and day depending on the number of fields the API returns |
AniList (Q86470198) | Imports precisions between year and day depending on the number of fields the API returns | |||
MangaDex (Q110093307) | Imports precision of year only | |||
MangaUpdates (Q114730827) | Imports precision of year only | |||
Anime-Planet (Q112180497) | Imports precision of year only | |||
end time (P582) | No | This usually means the end of serialization for most manga. | MyAnimeList (Q4044680) | Imports precisions between year and day depending on the number of fields the API returns |
AniList (Q86470198) | Imports precisions between year and day depending on the number of fields the API returns | |||
Anime-Planet (Q112180497) | Imports precision of year only | |||
country of origin (P495) | No | AniList (Q86470198) | ||
MangaDex (Q110093307) | Inferred based on the value of language of work or name (P407) | |||
language of work or name (P407) | No | AniList (Q86470198) | Inferred based on the value of country of origin (P495) | |
MangaDex (Q110093307) | ||||
hashtag (P2572) | No | AniList (Q86470198) | ||
number of parts of this work (P2635) | Yes | Imports the number of volume (Q1238720). | MyAnimeList (Q4044680) | |
AniList (Q86470198) | ||||
MangaDex (Q110093307) | ||||
Anime-Planet (Q112180497) | ||||
title (P1476) | No | AniList (Q86470198) | Only imports titles for ja and en
|
External IDs[edit]
All external ID properties are always created.
Imported Specifically[edit]
Property | Shared Notes | Database | Notes |
---|---|---|---|
MyAnimeList manga ID (P4087) | AniList (Q86470198) | A lot of values of AniList manga ID (P8731) are equal to MyAnimeList manga ID (P4087) + 30000 since Anilist initially copied MyAnimeList's database. | |
MangaDex (Q110093307) | |||
AniList manga ID (P8731) | MangaDex (Q110093307) | ||
MangaUpdates manga ID (P11149) | MangaDex (Q110093307) | Contains both the old schema and new schema IDs. The bot will add a reference to the old schema if encountered. | |
BookWalker series ID (JP version) (P11259) | MangaDex (Q110093307) | ||
BookWalker series ID (global version) (P11363) | MangaDex (Q110093307) | ||
Anime-Planet manga ID (P11348) | MangaDex (Q110093307) | May contain redirects to the true ID | |
eBookJapan books ID (P11476) | MangaDex (Q110093307) |
Generally[edit]
Sources that have nonspecific external links (such as "related links", "link to raw", etc.) contribute towards a link pool. Then the bot uses regexes to extract IDs and sends the leftovers to described at URL (P973) (see #Adding described at URL (P973)).
Property | Regex | Notes |
---|---|---|
Niconico ID (P11176) | seiga\.nicovideo\.jp/comic/(\d+)
|
|
BookWalker series ID (JP version) (P11259) | (?:global\.)?bookwalker\.jp/(?:series|book)/(\d+)
|
|
BookWalker series ID (global version) (P11363) | ||
INKR title ID (P11315) | comics\.inkr\.com/title/(\d+)
|
|
Anime News Network manga ID (P1984) | animenewsnetwork\.com/encyclopedia/manga\.php\?id=(\d+)
|
|
Media Arts Database ID (P7886) | mediaarts-db\.bunka.go\.jp/id/C(\d+)
|
|
Bangumi subject ID (P5732) | bgm\.tv/subject/(\d+)
|
|
AnimeClick manga ID (P5849) | animeclick\.it/manga/(\d+)
|
|
eBookJapan books ID (P11476) | ebookjapan\.yahoo\.co\.jp/books/(\d+)
|
Other[edit]
Archive URLs[edit]
If the bot ever runs into a URL for a property that is an archive.org URL, it will extract the information, update the target URL to the real URL, and set qualifiers archive URL (P1065), archive date (P2960), and reason for deprecated rank (P2241)link rot (Q1193907) as well as set the rank of the statement to deprecated.
Adding described at URL (P973)[edit]
A lot of anime and manga databases contain a collection of external links. Some of these links have properties associated with them, but many don't. Links that have no property associated with them will be added under the property described at URL (P973) if the property is whitelisted.
URL Blacklist[edit]
The bot will not add any URLs that match these domains/regex under described at URL (P973):
URL/Domain/Regex | Reason |
---|---|
twitter.com | Social Media Site, not applicable for an anime or manga |
youtube.com | Social Media Site, not applicable for an anime or manga |
instagram.com | Social Media Site, not applicable for an anime or manga |
pixiv.com | Social Media Site, not applicable for an anime or manga |
pixiv.net | Social Media Site, not applicable for an anime or manga |