Rosetta Code:Village Pump/WiktionaryDumps: Difference between revisions

From Rosetta Code
Content added Content deleted
(added Vptopic wiki code)
(ideas in sub-thread)
Line 8: Line 8:
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br>
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br>
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br>
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br>
[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)
Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br>
:Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br>
[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]])
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)

:Maybe recreate the kind of files like "/usr/share/dict/french" or [https://users.cs.duke.edu/~ola/ap/linuxwords linuxwords]
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)

Revision as of 17:45, 30 April 2019

WiktionaryDumps
This is a particular discussion thread among many which consider Rosetta Code.

Summary

Trying to do something with Wiktionary dumps

Discussion

I think it would be nice to do some tasks with the wiktionary dumps:
https://dumps.wikimedia.org/enwiktionary/20190420/
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)

Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)
Maybe recreate the kind of files like "/usr/share/dict/french" or linuxwords
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)