Talk:URL decoding: Difference between revisions
Content added Content deleted
No edit summary |
No edit summary |
||
Line 7: | Line 7: | ||
::::Ok added it as a test case. I know it breaks the Awk code. I left a note saying where to find working gawk code, but it lists every potential UTF-8 character so it's large (and given the possibilities not even complete). I suspect other languages could have similar problems. -- [[User:3havj7t3nps8z8wij3g9|3havj7t3nps8z8wij3g9]] ([[User talk:3havj7t3nps8z8wij3g9|talk]]) 00:47, 27 May 2015 (UTC) |
::::Ok added it as a test case. I know it breaks the Awk code. I left a note saying where to find working gawk code, but it lists every potential UTF-8 character so it's large (and given the possibilities not even complete). I suspect other languages could have similar problems. -- [[User:3havj7t3nps8z8wij3g9|3havj7t3nps8z8wij3g9]] ([[User talk:3havj7t3nps8z8wij3g9|talk]]) 00:47, 27 May 2015 (UTC) |
||
:::::I had no serious problem with the existing awk implementation on your new example. I did have two minor issues I needed to deal with: |
|||
:::::# The url being decoded is hardcoded into the example. I dealt with this by replacing the hardcoded url. A more general solution might place the url on stdin. |
|||
:::::# I use using LC_ALL=C which prevented display of text as utf-8. I dealt with this by unsetting that environmental variable. (LC_CTYPE and LANG might have similar effects, but I was not using them.) |
|||
:::::I suspect that if you were encountering issues that they might be similar. --[[User:Rdm|Rdm]] ([[User talk:Rdm|talk]]) 04:31, 27 May 2015 (UTC) |