Matches in DBpedia 2016-04 for { <http://dbpedia.org/resource/CESU-8> ?p ?o }
Showing triples 1 to 34 of
34
with 100 triples per page.
- CESU-8 abstract "The Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8) is a variant of UTF-8 that is described in Unicode Technical Report #26 [1]. A Unicode code point from the Basic Multilingual Plane (BMP), i.e. a code point in the range U+0000 to U+FFFF, is encoded in the same way as in UTF-8. A Unicode supplementary character, i.e. a code point in the range U+10000 to U+10FFFF, is first represented as a surrogate pair, like in UTF-16, and then each surrogate code point is encoded in UTF-8. Therefore, CESU-8 needs six bytes (3 bytes per surrogate) for each Unicode supplementary character while UTF-8 needs only four. Each CESU-8 character code (1, 2, or 3 bytes) can be converted to exactly one UTF-16 code unit (2 bytes).The encoding of Unicode supplementary characters works out to 11101101 1010yyyy 10xxxxxx 11101101 1011xxxx 10xxxxxx (yyyy represents the top five bits of the character minus one i.e. U+10**** becomes 1111, U+01**** becomes 0000, x represents the remaining bits of the character).CESU-8 is not an official part of the Unicode Standard, because Unicode Technical Reports are informative documents only. It should be used exclusively for internal processing and never for external data exchange.CESU-8 is similar to Java's Modified UTF-8 but does not have the special encoding of the NUL character (U+0000).The Oracle database actually uses CESU-8 for its \"UTF8\" character set. Standard UTF-8 can be obtained using the character set \"AL32UTF8\" (since Oracle version 9.0).".
- CESU-8 wikiPageExternalLink modified_utf_8_strings.
- CESU-8 wikiPageExternalLink convexp?conv=CESU-8.
- CESU-8 wikiPageExternalLink tr26.
- CESU-8 wikiPageID "2232502".
- CESU-8 wikiPageLength "3459".
- CESU-8 wikiPageOutDegree "8".
- CESU-8 wikiPageRevisionID "686047385".
- CESU-8 wikiPageWikiLink Category:Character_encoding.
- CESU-8 wikiPageWikiLink Category:Unicode_Transformation_Formats.
- CESU-8 wikiPageWikiLink Oracle_Database.
- CESU-8 wikiPageWikiLink Plane_(Unicode).
- CESU-8 wikiPageWikiLink UTF-16.
- CESU-8 wikiPageWikiLink UTF-8.
- CESU-8 wikiPageWikiLink Unicode.
- CESU-8 wikiPageWikiLinkText "CESU-8".
- CESU-8 wikiPageUsesTemplate Template:Character_encoding.
- CESU-8 wikiPageUsesTemplate Template:Reflist.
- CESU-8 wikiPageUsesTemplate Template:Unicode_navigation.
- CESU-8 subject Category:Character_encoding.
- CESU-8 subject Category:Unicode_Transformation_Formats.
- CESU-8 hypernym Variant.
- CESU-8 type Datum.
- CESU-8 type Encoding.
- CESU-8 comment "The Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8) is a variant of UTF-8 that is described in Unicode Technical Report #26 [1]. A Unicode code point from the Basic Multilingual Plane (BMP), i.e. a code point in the range U+0000 to U+FFFF, is encoded in the same way as in UTF-8. A Unicode supplementary character, i.e. a code point in the range U+10000 to U+10FFFF, is first represented as a surrogate pair, like in UTF-16, and then each surrogate code point is encoded in UTF-8.".
- CESU-8 label "CESU-8".
- CESU-8 sameAs Q455180.
- CESU-8 sameAs CESU-8.
- CESU-8 sameAs CESU-8.
- CESU-8 sameAs CESU-8.
- CESU-8 sameAs m.06xxyj.
- CESU-8 sameAs Q455180.
- CESU-8 wasDerivedFrom CESU-8?oldid=686047385.
- CESU-8 isPrimaryTopicOf CESU-8.