Property talk:P1065

From Wikidata
Jump to navigation Jump to search

Documentation

archive URL
URL to the archived web page specified with URL property
DescriptionThe URL in the same reference but archived on Archive.org or some other archive service (webcite.org, archive.is).
Representsarchive URL (Q105809893)
Data typeURL
Template parameteren:Template:Cite field "archiveurl"
Domainany item having an URL property (note: this should be moved to the property statements)
Allowed values
According to this template: any link to any web page
According to statements in the property:
https?:\/\/(web\.archive\.org\/web\/[0-9if_]{14,17}|archive\.org\/(details|download)|archive\.(today|is|fo|li|ph|vn)\/[0-9]{4}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}|webrecorder\.io|(www\.)?webcitation\.org|perma\.cc|perma\-archives\.org|([^\/]+\.)?megalodon\.jp|warp\.da\.ndl\.go\.jp|archive\.wikiwix\.com|(webarchive|yourarchives)\.nationalarchives\.gov\.uk|(pandora|trove|webarchive|content\.webarchive)\.nla\.gov\.au|webarchive\.loc\.gov|media\.digitalarkivet\.no|arquivo\.pt|(swap|sul-swap-prod)\.stanford\.edu)\/.+|https?:\/\/archive\.(today|is|fo|li|md|ph|vn)\/[a-zA-Z0-9]{4,}
When possible, data should only be stored as statements
ExampleDisturbed's Draiman on Band's Hiatus: 'It's the Right Time to Step Away' (Q67086087) → https://web.archive.org/web/20130312103925/http://www.billboard.com/articles/news/468770/disturbeds-draiman-on-bands-hiatus-its-the-right-time-to-step-away
Lizzy Yarnold (Q658596) → https://web.archive.org/web/20160107080433/http://www.thebbsa.co.uk/the-team/directory/lizzy-yarnold
Sourceany archived url on Wikipedia sources (note: this information should be moved to a property statement; use property source website for the property (P1896))
Robot and gadget jobsMaybe if a bot can crawl archive.org we can add them in mass
See alsofull work available at URL (P953), archive date (P2960), reference URL (P854), filename in archive (P7793)
Lists
Proposal discussionProposal discussion
Current uses
Total307,883
Main statement2,0950.7% of uses
Qualifier130,18642.3% of uses
Reference175,60257% of uses
Search for values
[create Create a translatable help page (preferably in English) for this property to be included here]
Scope is as qualifier (Q54828449), as reference (Q54828450): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P1065#Scope, SPARQL
Format “https?:\/\/(web\.archive\.org\/web\/[0-9if_]{14,17}|archive\.org\/(details|download)|archive\.(today|is|fo|li|ph|vn)\/[0-9]{4}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}[.-]?[0-9]{2}|webrecorder\.io|(www\.)?webcitation\.org|perma\.cc|perma\-archives\.org|([^\/]+\.)?megalodon\.jp|warp\.da\.ndl\.go\.jp|archive\.wikiwix\.com|(webarchive|yourarchives)\.nationalarchives\.gov\.uk|(pandora|trove|webarchive|content\.webarchive)\.nla\.gov\.au|webarchive\.loc\.gov|media\.digitalarkivet\.no|arquivo\.pt|(swap|sul-swap-prod)\.stanford\.edu)\/.+|https?:\/\/archive\.(today|is|fo|li|md|ph|vn)\/[a-zA-Z0-9]{4,}: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303). Known exceptions: Shina Dictionary (Q113649384)
List of violations of this constraint: Database reports/Constraint violations/P1065#Format, SPARQL
This property is being used by:

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

Discussion label change[edit]

I am cleaning https://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P1065#.22Qualifier.22_violations and notice many uses of 'archive URL' (P1064) where the contributor intention was to provide link to a webpage with the full text of the work in question. I am adding 'full text available at' (P953) with the information but if someone has a suggestion about how make the label of P1064 state the use as a link to an archived version of another webpage more clearly I would appreciate. Carlos Porto (talk) 01:05, 16 October 2015 (UTC)[reply]

I give 'archived at' alias to full work available at URL (P953) so people that search for the 'archive' string can find that option, and I think it would be better to change 'URL archived' from alias to label of this property and keep current label 'archive URL' as an alias. Carlos Porto (talk) 02:38, 16 October 2015 (UTC)[reply]

Using archive URL under references?[edit]

The documentation/talk page of this says this can only be used as a property. I am confused if this can be used under references? For example reference URL (Property:P854) is added as a reference to a statement. Then along with it is archive URL containing the archived link. (The Talk page of Talk page of full text available at (Property:P953) says its Domain must be "creative work (intended for songs, albums, and artists)", so I doubt I will use this because the type of item I am trying to add the archive URL is a place). Sanglahi86 (talk) 08:58, 13 October 2016 (UTC)[reply]

New property for dead links[edit]

Is there a boolean property alongside this to specify that the URL is dead? Not all URLs with an archive version specified are dead. --Valerio Bozzolan (talk) 22:58, 1 December 2016 (UTC)[reply]

There was some discussion about this at Wikidata:Property proposal/dead-url. Seems too much text to figure out what is the outcome.
--- Jura 23:02, 1 December 2016 (UTC)[reply]

Restricted to qualifiers?[edit]

Please pay attention to the archive date (P2960); it is limited to Source section. d1g (talk) 10:09, 2 January 2017 (UTC)[reply]

Which version?[edit]

If I am looking to the web archive, which version should I use? Point out that some webs are updated in time, so if I ad P1065 now, and nobody will update it in the future, the archived web page might provide an only old version of the page. --Juandev (talk) 11:25, 19 September 2019 (UTC)[reply]

Change that restricts this property to a few archives[edit]

I wonder about the rationale for changing this to only be valid for a handful of archives, P1793 is added here [1] by Jc86035, this makes it fail for a lot of national archives. I would say the constraint is wrong and should be removed. Jeblad (talk) 15:31, 12 January 2020 (UTC)[reply]

@Jeblad: This is the discussion where I justified adding it. National archives are barely used compared to the Wayback Machine (which now accounts for the vast majority of archive links used across the Wikimedia projects), so I think I thought it was an acceptable compromise to use a whitelist. (I've changed it to a suggestion constraint for now.) By all means, change the constraint if you think it's worth improving it. Jc86035 (talk) 16:27, 12 January 2020 (UTC)[reply]
@Jc86035: It should not be changed, it should be removed. Create a new property for web archives if you find it useful, don't mess with an existing property. As it is now this property is pretty much useless for archives, which is not the same as a web archive. Jeblad (talk) 16:32, 12 January 2020 (UTC)[reply]
@Jeblad: Not all online archives are appropriate to link to, especially for the Wikipedias (e.g. self-hosted and self-published content). Properties and their constraints are changed all the time, and there isn't a Wikidata policy that restricts doing so (though it's possible that that should be changed). There are fewer than 200 format constraint violations right now, so the property is still at the level where these can be fixed by hand (either by updating the property constraints or by updating the items).
The national archives that I originally included in the format constraint were just the few that were the most commonly used on the English Wikipedia. The main reason I didn't add any more was that I didn't think it was necessary to get to zero constraint violations (there are still a few dozen that could be fixed trivially). I've added the Norwegian government's archive site for now, but more of them should definitely be added. Jc86035 (talk) 16:50, 12 January 2020 (UTC)[reply]
@Jc86035: This will create a nightmare to maintain, and will not work. Jeblad (talk) 17:02, 12 January 2020 (UTC)[reply]
@Jeblad: Which is probably more or less why I didn't add all of the archives in the first place (I don't remember). The filtering makes it actually possible to check issues with some of the statements, which is more or less the exact purpose of a suggestion constraint. It shouldn't prevent users from adding archives that aren't listed in the constraint. Jc86035 (talk) 17:11, 12 January 2020 (UTC)[reply]
@Jc86035: Don't add constraints that isn't actionable, that only create noise. Jeblad (talk) 17:36, 12 January 2020 (UTC)[reply]

Can someone fix the regex and constraint?[edit]

This is a valid Wayback Machine URL for an archived blog: https://web.archive.org/web/*/https://blog.numerade.com/

But it is getting a warning that it doesn't conform to the regex. Can someone revise the regex and constraint so that a URL like this is correct? I think the issue is the asterisk in the URL? UWashPrincipalCataloger (talk) 01:34, 11 November 2020 (UTC)[reply]

Guidance on usage?[edit]

Hiya. I'm adding Property:P7014 and Property:P7101 to many items, both of which have Property:P1065 as required restraints. I've tried to read up on each property and searched existing usage to understand how to use it correctly.

Is the idea that when we add a privacy policy, the person adding it will make a quick stop over at the Wayback Machine to capture it, and copy that snapshot URL into the archive URL qualifier?

Please walk me through how it works, so I may remove flags and warnings when I add these properties. ^_^

maiki (talk) 23:20, 11 January 2022 (UTC)[reply]

@Trade who added this at one of them: [2]. --- Jura 23:29, 11 January 2022 (UTC)[reply]

url-status property[edit]

I'm not sure where this discussion should go. There needs to be a Wikidata property that can be fed to en.Wikipedia's w:Template:Cite Q so that the |archive-url, |archive-date, and |url-status parameters can be read consistently and automatically from Wikidata, rather than requiring manual maintenance (possibly of a big number of pages). For the moment, |url-status needs to be added manually, and choosing to add it only to references that already have archives risks creating a lot of extra work on the Wikipedia side as the status of a URL evolves. Boud (talk) 15:25, 17 October 2023 (UTC)[reply]