‘Counting Characters in UTF-8 Strings Is Fast’ by Kragen Sitaker shows several ways to count characters UTF-8, using both assembly and C. But, with a few assumptions, we can go faster. Assumption One: We are dealing with a valid UTF-8 string Making this assumption means that once we hit the start of a multi-byte character [...]
Pages
Tags
.net bind9 Books Broken C# C# code comparison Cool criticism Critique csharp Default Development DNS Firefox Functional programming Game GNOME Haskell horrid Java Links Linux massey Mathematics Me Mono music Odd Programming Reference Review self short silly snippet Thought thoughts Types Ubuntu Unicode Unix Work XMLCategories
Archives
