RFC 3629 — UTF-8, a Transformation Format of ISO 10646 by node

UTF-8 encodes the Unicode character set as a sequence of bytes — every code point as one to four bytes, ASCII unchanged as single bytes, and self-synchronising. It is the dominant text encoding of the Internet.