Thread:User talk:Yair rand/Translations from an Xml dump/reply (2)

Hi, we are still developing the extraction of RDF from the Wiktionaries. We made some progress, but it is a difficult process. Our idea is to use templates (like simplified regexes) for scraping data out of the Wiktionaries. The templates can be configured for each language according to Entry_layout_explained. It will need one more month (or two) though until we are finished and then some more testing is needed. But it will be able to cover most languages, not just English... After that we can start working on the sense ids again.SebastianHellmann 18:20, 23 March 2011 (UTC)