How fast can you validate UTF-8 strings in JavaScript?
Skip to content
When you recover textual content from the disk or from the network, you may expect it to be a Unicode string in UTF-8. It is the most common format. Unfortunately, not all sequences of bytes are valid UTF-8 and accepting invalid UTF-8 without validating it is a security risk.
How might you validate a UTF-8 string in a JavaScript runtime?
You might use the valid-8 module:
import valid8 from "valid-8";
if(!valid8(file_content)) { console.log("not UTF-8"); }
Another recommended appr...
Read more at lemire.me