What Unicode sounds like
lame -r -m m -s 16 –bitwidth 8 ~/Downloads/UnicodeData-5.2.0d2.txt unicodedata-520d2txt
lame -r -m m -s 16 –bitwidth 8 ~/Downloads/UnicodeData-5.2.0d2.txt unicodedata-520d2txt
So, Colin Percival has posted a UTF-8 strlen which improves on my previous post. While his code runs slightly slower than mine on my PC, I assume that’s because his code is aimed at a 64-bit architecture. With 32-bits (reading 4 bytes at a time, instead of 8 ) it doesn’t quite get the same [...]