Talk:XML/Input: Difference between revisions

→‎Interpreting XML?: Task description needs to be updated.
(→‎Interpreting XML?: Handling HTML numeric codes should be a separate task.)
(→‎Interpreting XML?: Task description needs to be updated.)
Line 8:
:::I added a <del>entity</del> numeric character reference, since XML processors in general need to be able to handle &amp; and the full character set. --[[User:Kevin Reid|Kevin Reid]] 00:44, 2 June 2009 (UTC)
::::Are you suggesting that the program should convert HTML entities and numeric references into some character encoding? I think that should be a separate task. And, AFAIK, it is HTML specific, not XML. --[[User:PauliKL|PauliKL]] 09:03, 2 June 2009 (UTC)
 
::Donal, the problem is that AWK implementation does not interpret the structure at all. It is quite possible to do some parsing even if there are no ready-made library routines for that. But that does not mean that we should implement a full XML parser. The task should be kept relatively simple.
::I notice that the XML input file has now been changed. But the the task description needs to be changed, too. --[[User:PauliKL|PauliKL]] 09:14, 2 June 2009 (UTC)
Anonymous user