Truncating a UTF-8 codepoint is not fine because most software is not tested wit...

saagarjha · on July 15, 2024

Morally I view “what do I do with my truncated string” to be a separate issue from “how do I truncate the string” as described in the article. Like, yes, you absolutely should not concatenate after doing this operation. But maybe you shouldn’t be showing the user a truncated string either even if it’s all ASCII. The question of “did you make an unparseable UTF-8 string” is answered with “no” and the more complicated but also more interesting question of “did you actually want this” remains unanswered.

Levitating · on July 15, 2024

This is fair, the article takes truncating a string to fit in a status bar as an example.

actionfromafar · on July 15, 2024

Also consider Unicode is not only international characters, but superscripts and other stuff ♥ᵃ

a: there was a list somewhere over which characters hackernews allows?