Talk:Determine if a string has all the same characters: Difference between revisions

m
(→‎What is a character?: Added a comment.)
Line 69:
 
::In the case of Go, a character (or 'rune' as we prefer to call it) is simply a unicode code point expressed as a 4 byte integer. String literals are encoded as UTF-8 and are not normalized by default (though there is a supplemental package which can do this). Consequently, an accented character is not the same as the corresponding unaccented character plus the accent. Also, unlike Perl 6, there appears to be no easy way to deal with emoji ZWJ sequences at the present time. I've therefore had to be careful in the Go examples to only use emojis which are complete in themselves. --[[User:PureFox|PureFox]] ([[User talk:PureFox|talk]]) 17:11, 31 October 2019 (UTC)
:::Ok, I'm fine with that. It means that different program will give different results for the same input, but it seems to be the consensus, and we are not going to reimplement ICU, not to dumb down languages which are able to deal with Unicode. By the way, the langages I use (Python, R, Stata mostly) don't normalize either by default. [[User:Eoraptor|Eoraptor]] ([[User talk:Eoraptor|talk]]) 18:23, 31 October 2019 (UTC)
1,336

edits