WiktionaryDumps to words: Difference between revisions
m
→{{header|Phix}}: oops, just mullered the tag
m (→{{header|Phix}}: fixed syntax colouring hiccup ("block" as a keyword, which it isn't)) |
m (→{{header|Phix}}: oops, just mullered the tag) |
||
Line 486:
Does not rely on wget/bzcat etc. Downloads in 16K or so blocks, unpacks one block at a time in memory, terminates properly when 5 or more words are found.<br>
Tested on Windows, should be fine on Linux as long as you can provide a suitable bz2.so
<!--<lang Phix>(
<span style="color: #008080;">constant</span> <span style="color: #000000;">url</span> <span style="color: #0000FF;">=</span> <span style="color: #008000;">"https://dumps.wikimedia.org/enwiktionary/latest/enwiktionary-latest-pages-articles.xml.bz2"</span>
|