User:ŠtambukBot

This will be a bot operated by User:Ivan Štambuk. It is currently being developed, based on the DotNetWikiBot Framework and written in C# and F#, utilizing the MediaWiki API for all its functions.

Its primary function is to provide completely autonomous creation of Serbo-Croatian inflected forms.

Its other function is to verify consistency of existing Serbo-Croatian entries and report anomalies. In particular:
 * to report usage of obsoleted or generic (e.g. ) templates
 * to verify precise mirroring of Cyrillic and Latin script entries
 * to verify inflected forms against HJP & HEL databases, as well as its own internal heuristics
 * to verify that existing inflected forms reflect inflectional tables of lemma entries and vice versa

Bot will also be used for various trivial forms of editing of Serbo-Croatian sections. In particular
 * to generate morphological etymologies
 * to synch derived terms, related terms, and various *nyms
 * to generate missing pronunciations
 * to generate references

When run, the bot operates on the live XML dump of all the entries inside Category:Serbo-Croatian language. It can handle appending to existing SH entries, including cases with multiple and shared etymologies, multiple and shared pronunciations.


 * User:ŠtambukBot/Report - analysis of anomalous SH entries requiring cleanup or attention
 * User:ŠtambukBot/Statistics - statistics of existing SH entries
 * User:ŠtambukBot/Log - detailed bot activity log
 * User:ŠtambukBot/Missing - lists of missing SC lemmata, checked against comprehensive predefined lists of 70k lemmata, extracted from Vladimir Anić's Veliki Rječnik hrvatskoga jezika