TIL Why Charset Matters
TIL Encoding Differentials: Why Charset Matters
Another, less common way to indicate the character encoding is the Byte-Order Mark. This is a specific Unicode character (
U+FEFF
) that can be placed in front of a string to indicate the byte endianness and character encoding. It is mainly used in files, but since these might be served via a web server, modern browsers support it. A Byte-Order Mark at the beginning of an HTML document even takes precedence over acharset
attribute in theContent-Type
header and the<meta>
tag.