Wikidata:WikiFactMine/New dictionary walkthrough

From Wikidata
Jump to navigation Jump to search

Interested in having a dictionary added to WikiFactMine search? Please follow these steps.

  1. Specify a dictionary, i.e. determine what should be listed in it.
  2. If you need assistance with the detailed compilation work, please look the form provided.
  3. Create your dictionary via the aaraa tool.
  4. Download the JSON file to your computer, as the final step of the creation process.
  5. Mail it to tom@contentmine.org with a covering note. You will receive a reply giving

details of how to access WikiFactMine search results for your dictionary.

Support

WikiFactMine wishes to work with those supplying dictionaries, to improve our search results and the understanding we have of dictionaries. The aaraa tool allows additions to existing dictionaries. We anticipate that users will wish to review and tweak them.

Documentation

The aaraa tool is documented at Wikidata:WikiFactMine/New_dictionaries. Dictionaries can be reviewed and searched in this tool.

Technical notes

Dictionary names should be one word of lower-case letters only. Please note that there is a size limitation of 10K entries (about 1 Mb of data) for dictionaries created by this route. If this is troublesome for your particular dataset, please explain the issue to us.

Optionally, dictionaries can include Wikidata aliases as well as the standard English-language labels. Aliases are a significant search asset, and may add up to 400% to the length of a list. They are currently opt-in, and you should say if you wish WikiFactMine to include them.

Process

Dictionaries will have various processes applied to them before being put to use. In particular a standard list of “stop words” will be removed.

You will probably yourself want to follow a transparent process in creating and updating dictionaries.