View text source at Wikipedia
The regional indicator symbols are a set of 26 alphabetic Unicode characters (AโZ) intended to be used to encode ISO 3166-1 alpha-2 two-letter country codes in a way that allows optional special treatment.
These were defined by as part of the Unicode 6.0 support for emoji, as an alternative to encoding separate characters for each country flag. Although they can be displayed as Roman letters, it is intended that implementations may choose to display them in other ways, such as by using national flags.[1][2] The Unicode FAQ indicates that this mechanism should be used and that symbols for national flags will not be directly encoded.[3]
They are encoded in the range U+1F1E6 🇦 REGIONAL INDICATOR SYMBOL LETTER A to U+1F1FF 🇿 REGIONAL INDICATOR SYMBOL LETTER Z within the Enclosed Alphanumeric Supplement block in the Supplementary Multilingual Plane.[4]
A pair of regional indicator symbols is referred to as an emoji flag sequence (although it represents a specific region, not a specific flag for that region).[6]
Out of the 676 possible pairs of regional indicator symbols (26 ร 26), only 270 are considered valid Unicode region codes. These are a subset of the region sequences in the Common Locale Data Repository (CLDR):[6][7][8]
A separate mechanism (emoji tag sequences) is used for regional flags, such as England 🏴, Scotland 🏴, Wales 🏴, Texas 🏴 or California 🏴.[12] It uses U+1F3F4 🏴 WAVING BLACK FLAG and formatting tag characters instead of regional indicator symbols. It is based on ISO 3166-2 regions with hyphen removed and lowercase, e.g. GB-ENG โ gbeng, terminating with U+E007F CANCEL TAG. Flag of England is therefore represented by a sequence U+1F3F4, U+E0067, U+E0062, U+E0065, U+E006E, U+E0067, U+E007F. In the tenth revision the Unicode consortium was considering U+1F3F3 🏳 WAVING WHITE FLAG instead,[13] but from eleventh onwards it is black.[14] Some vendors choose to include custom zero-width joiner sequences that only show up on their platform, such as WhatsApp and their Refugee Nation Flag ๐ณ๏ธโ๐งโโฌ๏ธโ๐ง.[15]
Regional indicator symbols subset of Enclosed Alphanumeric Supplement[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
... | (U+1F100–U+1F1E5 omitted) | |||||||||||||||
U+1F1Ex | 🇦 | 🇧 | 🇨 | 🇩 | 🇪 | 🇫 | 🇬 | 🇭 | 🇮 | 🇯 | ||||||
U+1F1Fx | 🇰 | 🇱 | 🇲 | 🇳 | 🇴 | 🇵 | 🇶 | 🇷 | 🇸 | 🇹 | 🇺 | 🇻 | 🇼 | 🇽 | 🇾 | 🇿 |
Notes
|
In 2007 a draft proposal was presented to the Unicode Technical Committee to encode emoji symbols, specifically those in widespread use on mobile phones by Japanese telecommunications companies DoCoMo, KDDI, and SoftBank.[16] The proposed symbols included ten national flags:[17] China (๐จ๐ณ), Germany (๐ฉ๐ช), Spain (๐ช๐ธ), France (๐ซ๐ท), the UK (๐ฌ๐ง), Italy (๐ฎ๐น), Japan (๐ฏ๐ต), South Korea (๐ฐ๐ท), Russia (๐ท๐บ), and the United States (๐บ๐ธ). Encoding these flags but not other countries' flags was considered, by some, as prejudicial.[18] One rejected solution was to encode the ten flags but call them "EMOJI COMPATIBILITY SYMBOL-n" and represent them visually in the Standard as "EC n" instead of showing the flags they represent.[19] Another rejected solution would have allocated 676 codepoints (26ร26) for each possible two letter combination of AโZ. They would represent political entities based on ISO 3166 such as "JP" for Japan or Internet ccTLDs (country code top-level domains) such as "EU" for the European Union.[20]
The accepted solution was to add 26 characters for letters used for the representation of regional indicators, which used in pairs would represent the ten national flags and possible future extensions.[2] Per the Unicode Standard "the main purpose of such [regional indicator symbol] pairs is to provide unambiguous roundtrip mappings to certain characters used in the emoji core sets"[21] specifically the ten national flags:[22] ๐จ๐ณ, ๐ฉ๐ช, ๐ช๐ธ, ๐ซ๐ท, ๐ฌ๐ง, ๐ฎ๐น, ๐ฏ๐ต, ๐ฐ๐ท, ๐ท๐บ, and ๐บ๐ธ.