UnicodeCharacter.com is a reference for the 154,998 characters of Unicode 16.0, the version of the standard published by the Unicode Consortium on September 10, 2024. Every character, every block, every category — documented one page at a time, with the kind of detail you need when you are deciding whether to use an en dash or a hyphen, whether your form will accept a name with a combining acute accent, or which byte sequence to feed a parser that does not yet speak UTF-8.
What this site is
This is a reference book in browser form. Each character has its own page with its codepoint in hex and decimal, its official name, its general category and bidi class, its UTF-8 and UTF-16 byte sequences, its HTML entity and CSS escape, related characters, and a few paragraphs of editorial context — origin, history, common confusions, typographic notes. Each block has a page explaining why it exists and who maintains the underlying script. Each tool does one thing well and runs entirely in your browser.
What this site is not
This is not an encyclopedia of every writing system in human history — that is what Omniglot and the Unicode Consortium's own pages are for. This is not a tutorial site that will hold your hand through making your first webpage. This is not a chatroom, a forum, a comment thread, or a place to argue about whether ZWJ sequences are emoji or something else. There are no comments, no tracking pixels, no popups, no newsletter modal, and no AI chatbot that will hallucinate the meaning of U+202E at you.
Who it is for
Designers picking a typographically correct dash. Developers debugging mojibake. Typographers comparing the Latin extended ranges. Localization engineers double-checking bidi behavior. Font designers looking up combining-mark categories. Linguists tracing a rare script's history. Students writing an essay on writing systems. And anyone who has typed a character into a Google search box and ended up on a low-effort SEO page that did not answer the question.
Editorial approach
The editorial standard here is simple: no fluff, no thin content, no padded paragraphs to game search engines. Every page is supposed to teach you something specific. If a page does not have at least one real fact you could not have guessed from the character itself, it should not exist. The site is read by humans, even when search engines are how those humans arrive — so the writing is for humans, plain and edited.
The data
Everything you see comes from the Unicode Character Database, the canonical set of files published by the Unicode Consortium for each version of the standard. The data is refreshed quarterly and after each new Unicode release; the changelog records each update. The next planned refresh is Unicode 17.0, expected in September 2025.
One person, one project
This is a single-author project. If you spot an error, have a correction, or want to suggest a character page that should exist, write to me via the contact page. Replies may take a week — I do this in the time around a day job — but I read every message and corrections to the data are folded into the next quarterly refresh.