Emoji are now the most common non-BMP characters by far. 😂, otherwise known as U+1F602 FACE WITH TEARS OF JOY, is the most common one on Twitter’s public stream. It occurs more frequently than the tilde!
More Related Contents:
- What’s the complete range for Chinese characters in Unicode?
- JavaScript strings outside of the BMP
- Java charAt used with characters that have two code units
- Why is Unicode restricted to 0x10FFFF?
- What’s the difference between UTF-8 and UTF-8 without BOM?
- What is the difference between UTF-8 and Unicode?
- Using awk to remove the Byte-order mark
- How can I convert surrogate pairs to normal string in Python?
- What is the proper way to URL encode Unicode characters?
- UTF-8, UTF-16, and UTF-32
- Manually converting unicode codepoints into UTF-8 and UTF-16
- UTF-8 file output in R
- Really Good, Bad UTF-8 example test data [closed]
- How does UTF-8 “variable-width encoding” work?
- How many characters can be mapped with Unicode?
- Unicode Identifiers and Source Code in C++11?
- Should I use accented characters in URLs?
- In Unicode, why are there two representations for the Arabic digits?
- How to use unicode in Android resource?
- How can I get the Unicode code point(s) of a Character?
- python 3.0, how to make print() output unicode?
- iconv: Converting from Windows ANSI to UTF-8 with BOM
- complete, monospaced Unicode font? [closed]
- Is there a Unicode glyph that looks like a “key” icon? [closed]
- What does u’\ufe0f’ in an emoji mean? Is it the same if I delete it?
- How to determine if a character is a Chinese character
- Chinese unicode fonts in PyGame
- How can I use Unicode characters on the Windows command line?
- Input unicode string with pyautogui
- unicode().decode(‘utf-8’, ‘ignore’) raising UnicodeEncodeError