WiktionaryDumps to words: Difference between revisions
Content added Content deleted
m (→{{header|Phix}}: fixed syntax colouring hiccup ("block" as a keyword, which it isn't)) |
m (→{{header|Phix}}: oops, just mullered the tag) |
||
Line 486: | Line 486: | ||
Does not rely on wget/bzcat etc. Downloads in 16K or so blocks, unpacks one block at a time in memory, terminates properly when 5 or more words are found.<br> |
Does not rely on wget/bzcat etc. Downloads in 16K or so blocks, unpacks one block at a time in memory, terminates properly when 5 or more words are found.<br> |
||
Tested on Windows, should be fine on Linux as long as you can provide a suitable bz2.so |
Tested on Windows, should be fine on Linux as long as you can provide a suitable bz2.so |
||
<!--<lang Phix>( |
<!--<lang Phix>(notonline)--> |
||
<span style="color: #008080;">constant</span> <span style="color: #000000;">url</span> <span style="color: #0000FF;">=</span> <span style="color: #008000;">"https://dumps.wikimedia.org/enwiktionary/latest/enwiktionary-latest-pages-articles.xml.bz2"</span> |
<span style="color: #008080;">constant</span> <span style="color: #000000;">url</span> <span style="color: #0000FF;">=</span> <span style="color: #008000;">"https://dumps.wikimedia.org/enwiktionary/latest/enwiktionary-latest-pages-articles.xml.bz2"</span> |
||