Revision as of 17:40, 26 November 2019 (view source) Anton (talk \| contribs) (→‎{{header\|F_Sharp\|F#}}: inserted Forth section after F#) ← Older edit		Revision as of 17:49, 26 November 2019 (view source) Anton (talk \| contribs) (→‎{{header\|Forth}}) Newer edit →
Line 490: {{works with\|gforth\|0.7.9_20191121}} {{works with\|lxf\|1.6-982-823}} <lang forth>: showbytes ( c-addr u -- ) ~~: showbytes ( c-addr u -- )~~ over + swap ?do i c@ 3 .r loop ; Line 512 ⟶ 511: 𝄞 1D11E F0 9D 84 9E </pre> If you also want to see the implementation of <code>xc!+</code> and <code>xc@+</code>, here it is (<code>u8!+</code> is the UTF-8 implementation of <code>xc!+</code>, and likewise for <code>u8@+</code>): <lang forth>-77 Constant UTF-8-err $80 Constant max-single-byte : u8@+ ( u8addr -- u8addr' u ) count dup max-single-byte u< ?EXIT \ special case ASCII dup $C2 u< IF UTF-8-err throw THEN \ malformed character $7F and $40 >r BEGIN dup r@ and WHILE r@ xor 6 lshift r> 5 lshift >r >r count dup $C0 and $80 <> IF UTF-8-err throw THEN $3F and r> or REPEAT rdrop ; : u8!+ ( u u8addr -- u8addr' ) over max-single-byte u< IF tuck c! 1+ EXIT THEN \ special case ASCII >r 0 swap $3F BEGIN 2dup u> WHILE 2/ >r dup $3F and $80 or swap 6 rshift r> REPEAT $7F xor 2* or r> BEGIN over $80 u>= WHILE tuck c! 1+ REPEAT nip ; </lang> =={{header\|Go}}==

UTF-8 encode and decode: Difference between revisions

UTF-8 encode and decode (view source)