User talk:Jberkel/lists/wanted/20220320/mul

Simplification
The list contains virtually no false positives for names of species (or subspecies, variety, form, etc). The one-part names were overwhelmingly false positives. It might be possible to make a useful, but probably short list of genera by searching for redlinked strings of the form [A-Z][a-z]+. DCDuring (talk) 15:31, 6 April 2022 (UTC)

Spelling etc. check
It would be useful to compare lists of various classes of Wiktionary lists with the items in the Catalogue of Life ("COL") (only genus and subgeneric names). For one thing we would almost certainly find a lot of misspellings as well as obsolete taxonomic names, many of which are probably not appropriately labelled. We would also probably find newer names that have not been accepted.

The lists I would compare with COL are:
 * 1) All of our taxonomic entries (for genera and lower ranks)
 * 2) All of the names enclosed in  (for genera and lower ranks)
 * 3) All of the items in any list of probable taxonomic names like this wanted list  (for genera and lower ranks).

The first is the most important because those names are what normal users encounter. The other lists will necessarily first be seen by a contributor.

I would hope to include onesies, which are highly likely to include many more misspellings and entries that need correction.

It would be useful to have separate runs for items that have COL status "Accepted" and "Synonym", but also "Ambiguous synonym", "Provisionally Accepted", and "Misapplied". The first two are the most numerous and probably the most important.

One important thing is to generate lists that are actionable and not overwhelming. For example, outright misspellings should be relatively few and not too hard to correct.

Let me know your thoughts on this. DCDuring (talk) 15:31, 6 April 2022 (UTC)