Wikidata talk:WikiProject Tropical cyclones

From Wikidata
Jump to navigation Jump to search

Launch of WikiProject Wikidata for research[edit]

Hi, this is to let you know that we've launched WikiProject Wikidata for research in order to stimulate a closer interaction between Wikidata and research, both on a technical and a community level. As a first activity, we are drafting a research proposal on the matter (cf. blog post). It would be great if you would see room for interaction! Thanks, --Daniel Mietchen (talk) 01:37, 9 December 2014 (UTC)[reply]

Wikimania 2016[edit]

Only this week left for comments: Wikidata:Wikimania 2016 (Thank you for translating this message). --Tobias1984 (talk) 12:00, 25 November 2015 (UTC)[reply]

Change of properties for cyclone items[edit]

I'm working to fill infobox tropical cyclone related (in cawiki) with WD information.

  1. There is no property for "category", because when it was proposed, people rejected it assuming that can be calculated via maximum sustained winds (P2895).
    I checked how present infoboxes calculate the category and, in addition to the speed wind, could be necessary the basin (Atlantic, NW Pacific, NE Indian, etc...) and, in some of these basins, the duration and wind gust. It is, we need more than one new property to calculate just one rejected property. Even more, there is no only one categoritzation category, but one (or more) for each basin.
    In any case, even knowing the formula, the category attribution is finally certified by en:Regional Specialized Meteorological Center of each zone.
  2. The subclasses were poorly structured.
  3. The instance of (P31) of events are mainly focused in one of the generic names: tropical cyclone (Q8092), North Atlantic tropical cyclone (Q2547976), typhoon (Q140588).

To be able to have the category without a new property, I have done:

With this elements, my proposal for each event is:

  • Exchange the P31 value by the specific category + qualifier with the corresponding scale. : Ex.:
  • For those cases with two classifications because the cyclone cross from one basin to another, the P31 could be multivalue and should incorporate P518 with the basin:
  • For subtropical or extratropical cyclons, in addition to its classifications, add a P31 to indicate this complementary characteristic:
  • Add part of the series (P179) with the previous and next event of same category scale. For those cases with two classifications because the cyclone cross from one basin to another, the P179 could be multivalue. Ex:
, to avoid a P179 constrain

I'm preparing a quickstatement to make this changes gradually.

I'll appreciate any suggestions. Thanks, Amadalvarez (talk) 21:50, 15 April 2019 (UTC)[reply]

This scheme needs to be discarded almost entirely (sorry, I really honestly believe that about your classification scheme) due to numerous problems, some ontological and some related to computer science. The main problem is that it is not in Boyce–Codd normal form (Q2460153), leading to much redundancy (and constraint violations; BCNF is useful for preventing constraint violations in the first place). Other problems are:

  • part of the series (P179) should be used to indicate the season, not storms of a given category or type; part of (P361) could be used instead. The current linked list (Q7003418) data structure (Q175263) is not resilient at all and is very cumbersome for when a storm needs to be retroactively added to or removed from a category (as is expected from the Atlantic hurricane reanalysis project (Q4816630) or any scale definition change). It is also inefficient for random access. Also, linked lists require an order to be defined on the set; what order is chosen? (formation date? date the storm first reached that category? etc.).
  • located in/on physical feature (P706) should not be used here; instead, we should create a new property called "tropical cyclone basin". These basins, unlike things like "south Atlantic ocean", have very strictly defined boundaries, unlike the terrain features used. In addition, especially in the western Pacific, the specific terrain feature is often different from the main ocean (for example, the South China Sea (Q37660) as a subset of the western Pacific basin).
  • instance of (P31) emphatically should not be used to indicate storm category. This is a blatant violation of BCNF, and is essentially circumventing the consensus that the category should be computed from the peak intensity value. It should simply take on values of "tropical cyclone" or "subtropical cyclone".
  • maximum sustained winds (P2895) needs to be qualified by agency, especially outside the central/eastern Pacific and Atlantic basins where there are usually many agencies. The Regional Specialized Meteorological Center (Q1326212) value should of course take primary rank. It also needs to be qualified by duration of sustained winds; this could be a new property called "sustained wind time", and the values are time, either 1-, 3-, or 10-minute. In particular, in all basins not monitored by the Central Pacific or National Hurricane Centers, we will have both 1-minute and either 3- or 10-minute.

Enhancements we need are:

  • Only limited data on each tropical cyclone is kept currently. We can keep far more information. For each item to be more useful, we should import the entire best track into the storm's Wikidata item. This would allow nice things such as automatic computation of duration at a given category, automatic calculation of Accumulated cyclone energy (Q339687), and what not. To do this, we need to create a property called "best track entry" that will store a given entry of a best track file; each entry contains the time, location, and intensity of the storm, as well as many other data fields, and could be imported automatically (for example HURDAT). I propose the following format:
  • We need a new radius of wind property to indicate 34-knot (gale), 50-knot (storm), and 64-knot (hurricane) force wind radii to act as a qualifier to the above best track property. Its value will be a scalar, and it will be qualified by direction (P560) for which quadrant (northeast, northwest, southwest, southeast), either maximum sustained winds (P2895) or a new property to specify which wind type (gale, storm, or hurricane).
  • As an exception to BCNF, we should keep maximum sustained winds (P2895) and location (P276) as computing it from the best track claims can be cumbersome for such a frequent operation; both should be qualified by agency.
  • There should be a property "Fujiwhara interaction with" to hold information about Fujiwhara effect (Q1376614)'s between storms.
    • An "interaction result" qualifier can be added to indicate whether the storm absorbed or was absorbed by the other storm, or merely altered course.

Besides meteorological data, we should also store casualty counts (number of deaths (P1120)) and damage totals on each item. Since many storms have multiple names (today, mostly because PAGASA (Q747963) assigns additional names to storms already named by the Japan Meteorological Agency (Q860935)), we should have a tropical cyclone name property, with value not string but rather, item; since many storms have the same name, we also need an item for each name, on which we can then also indicate information as to where (which basin) the name is used, whether it has been retired, who contributed it, and the storms it was used on (each of which should be a new property). The tropical cyclone name property should be qualified by agency given the frequency of multiple-name storms.

This is just for tropical cyclone items. Season items have to also be considered. The value of image (P18) for them should be the season summary map (for example what I just added to 2015 Pacific hurricane season (Q18604546)). Most season data such as number of named storms, number of major hurricanes, and casualties can be inferred by summing the individual storms' data, so most seasons should just include follows (P155) (for previous season), followed by (P156) (for next season), and a has part(s) (Property:P527) for each storm.

I welcome comments from my fellow English Wikipedia tropical cyclone project members @Cyclonebiskit, Juliancolton, TheAustinMan, TropicalAnalystwx13, Titoxd, JasonRees: etc. to comment on this proposal.--Jasper Deng (talk) 01:28, 5 August 2019 (UTC)[reply]

I agree with the principal qualms Jasper notes above, and agree with the proposals forwarded for maximum sustained winds (P2895) and associating a "sustained wind time" property with them. Also agree with some of the proposals forwarded by Jasper. Would support importing International Best Track Archive for Climate Stewardship (Q43269466) to all instances of tropical cyclone (Q8092) with associated properties per time (see IBTrACS), but disagree with adding a "best track status" boolean property given that is hard to determine (because of Atlantic hurricane reanalysis project (Q4816630), Atlantic storms out to 1989 are technically "provisional" which is misleading). Disagree with a "Fujiwhara interaction with" property as that is often not well-established in any sort of verifiable manner. — the•austin•man (works) 02:20, 5 August 2019 (UTC)[reply]
We should drop the "applies to" for similar reasons; that doesn't seem to be a very natural or clear way to indicate basin.--Jasper Deng (talk) 04:37, 5 August 2019 (UTC)[reply]
Should unnamed tropical cyclones without popularized names but with associated Wikipedia articles be assigned those articles names or using some other designation? There are storms like 1916 Texas hurricane (Q199003) which do not have any typically used names by popular press or publications, but are so-named simply to describe them on-wiki. the•austin•man (works) 00:19, 13 August 2019 (UTC)[reply]
@TheAustinMan: We should use the Wikipedia-used names if no better alternative is available. Machines should be able to still lookup tropical cyclones by official identifies like AL052019.
In fact I think we should even create an item for each storm name. The name item can give its etymology, its pronunciation, whether it was retired, which name it replaced, and what not.--Jasper Deng (talk) 07:14, 15 September 2019 (UTC)[reply]
@Jasper Deng, TheAustinMan: Once you finish with all the definitions, new properties and changes in the current information I already uploaded, please make me a ping in order I can apply the changes in my accesses. It's a pity you appear after I made thousands of uploads of statements, but if you see clear and are willing to go ahead, I'll be waiting that you end all the work. Thanks, Amadalvarez (talk) 16:02, 2 November 2019 (UTC)[reply]

Wikidata item for each class of storm[edit]

Hello. Based on en:Tropical cyclone#Intensity classifications, I went ahead and created the missing Wikidata items for the remaining storm intensities: See here for overview. Individual storms should ideally be tagged with such (i.e. instance of). Obviously, the table can be expanded to include other scales (i.e. Taiwan), and should ideally be saved on this Wikiproject. Thoughts welcome. Cheers, Rehman 12:54, 6 June 2020 (UTC)[reply]