Talk:WiktionaryDumps to words: Difference between revisions
Content added Content deleted
(no need to download it all) |
|||
Line 12: | Line 12: | ||
::: I too have some questions. |
::: I too have some questions. |
||
::# What does wiktionary have to do with the task? Would any XML encoded word list do? If so, why does the task name include wiktionary? |
::# What does wiktionary have to do with the task? Would any XML encoded word list do? If so, why does the task name include wiktionary? |
||
:::: Because I found it interesting to do something with the wiktionary, as I explained on the [https://rosettacode.org/wiki/Rosetta_Code:Village_Pump/WiktionaryDumps Village Pump page]. - [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 09:18, 10 December 2020 (UTC) |
|||
::# Is the task supposed to show how to download and extract a large file in your particular language? The reference implementation just shells out and uses other tools. |
::# Is the task supposed to show how to download and extract a large file in your particular language? The reference implementation just shells out and uses other tools. |
||
::: The task is still a draft, if you think the download and uncompressed parts should be in the language, we can update the task. (and I will updated the ocaml too.) - [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 09:18, 10 December 2020 (UTC) |
|||
::# If the task is just extract a certain group of entries from an XML file, how does it differ significantly from [[XML/XPath]]? |
::# If the task is just extract a certain group of entries from an XML file, how does it differ significantly from [[XML/XPath]]? |
||
::: --[[User:Thundergnat|Thundergnat]] ([[User talk:Thundergnat|talk]]) 21:57, 9 December 2020 (UTC) |
::: --[[User:Thundergnat|Thundergnat]] ([[User talk:Thundergnat|talk]]) 21:57, 9 December 2020 (UTC) |
||
:::: Because we can not use the DOM method to parse 800MB of XML, we need to use the SAX method then. [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 09:18, 10 December 2020 (UTC) |
Revision as of 09:18, 10 December 2020
Too vague
"Demonstrate how your language can handle this dump"? How?
You need to write a task where all examples are doing one shared thing that is comparable as a feature of those languages implementation of the task. If you mean to highlight one type of XML handling over another then this doesn't do it, for example. --Paddy3118 (talk) 10:00, 9 December 2020 (UTC)
- The task, as explained, is to create a file equivalent than "/usr/share/dict/french" (output), using the wiktionary dump as input. Blue Prawn (talk) 19:27, 9 December 2020 (UTC)
- I have no desire to download an 800 megabyte compressed file for a Rosetta Code task that is who-knows-how-large uncompressed. Surely the task doesn't need to use a file that large. --Chunes (talk) 20:41, 9 December 2020 (UTC)
- You don't need to do so. Please see the OCaml example that only donwloads the first 1 or 2 megas. Blue Prawn (talk) 09:13, 10 December 2020 (UTC)
- I have no desire to download an 800 megabyte compressed file for a Rosetta Code task that is who-knows-how-large uncompressed. Surely the task doesn't need to use a file that large. --Chunes (talk) 20:41, 9 December 2020 (UTC)
- I too have some questions.
- What does wiktionary have to do with the task? Would any XML encoded word list do? If so, why does the task name include wiktionary?
- Because I found it interesting to do something with the wiktionary, as I explained on the Village Pump page. - Blue Prawn (talk) 09:18, 10 December 2020 (UTC)
- Is the task supposed to show how to download and extract a large file in your particular language? The reference implementation just shells out and uses other tools.
- The task is still a draft, if you think the download and uncompressed parts should be in the language, we can update the task. (and I will updated the ocaml too.) - Blue Prawn (talk) 09:18, 10 December 2020 (UTC)
- If the task is just extract a certain group of entries from an XML file, how does it differ significantly from XML/XPath?
- --Thundergnat (talk) 21:57, 9 December 2020 (UTC)
- Because we can not use the DOM method to parse 800MB of XML, we need to use the SAX method then. Blue Prawn (talk) 09:18, 10 December 2020 (UTC)