+++ /dev/null
-To generate or modify mapping headers\r
--------------------------------------\r
-Mapping headers are imported from CJKCodecs as pre-generated form.\r
-If you need to tweak or add something on it, please look at tools/\r
-subdirectory of CJKCodecs' distribution.\r
-\r
-\r
-\r
-Notes on implmentation characteristics of each codecs\r
------------------------------------------------------\r
-\r
-1) Big5 codec\r
-\r
- The big5 codec maps the following characters as cp950 does rather\r
- than conforming Unicode.org's that maps to 0xFFFD.\r
-\r
- BIG5 Unicode Description\r
-\r
- 0xA15A 0x2574 SPACING UNDERSCORE\r
- 0xA1C3 0xFFE3 SPACING HEAVY OVERSCORE\r
- 0xA1C5 0x02CD SPACING HEAVY UNDERSCORE\r
- 0xA1FE 0xFF0F LT DIAG UP RIGHT TO LOW LEFT\r
- 0xA240 0xFF3C LT DIAG UP LEFT TO LOW RIGHT\r
- 0xA2CC 0x5341 HANGZHOU NUMERAL TEN\r
- 0xA2CE 0x5345 HANGZHOU NUMERAL THIRTY\r
-\r
- Because unicode 0x5341, 0x5345, 0xFF0F, 0xFF3C is mapped to another\r
- big5 codes already, a roundtrip compatibility is not guaranteed for\r
- them.\r
-\r
-\r
-2) cp932 codec\r
-\r
- To conform to Windows's real mapping, cp932 codec maps the following\r
- codepoints in addition of the official cp932 mapping.\r
-\r
- CP932 Unicode Description\r
-\r
- 0x80 0x80 UNDEFINED\r
- 0xA0 0xF8F0 UNDEFINED\r
- 0xFD 0xF8F1 UNDEFINED\r
- 0xFE 0xF8F2 UNDEFINED\r
- 0xFF 0xF8F3 UNDEFINED\r
-\r
-\r
-3) euc-jisx0213 codec\r
-\r
- The euc-jisx0213 codec maps JIS X 0213 Plane 1 code 0x2140 into\r
- unicode U+FF3C instead of U+005C as on unicode.org's mapping.\r
- Because euc-jisx0213 has REVERSE SOLIDUS on 0x5c already and A140\r
- is shown as a full width character, mapping to U+FF3C can make\r
- more sense.\r
-\r
- The euc-jisx0213 codec is enabled to decode JIS X 0212 codes on\r
- codeset 2. Because JIS X 0212 and JIS X 0213 Plane 2 don't have\r
- overlapped by each other, it doesn't bother standard conformations\r
- (and JIS X 0213 Plane 2 is intended to use so.) On encoding\r
- sessions, the codec will try to encode kanji characters in this\r
- order:\r
-\r
- JIS X 0213 Plane 1 -> JIS X 0213 Plane 2 -> JIS X 0212\r
-\r
-\r
-4) euc-jp codec\r
-\r
- The euc-jp codec is a compatibility instance on these points:\r
- - U+FF3C FULLWIDTH REVERSE SOLIDUS is mapped to EUC-JP A1C0 (vice versa)\r
- - U+00A5 YEN SIGN is mapped to EUC-JP 0x5c. (one way)\r
- - U+203E OVERLINE is mapped to EUC-JP 0x7e. (one way)\r
-\r
-\r
-5) shift-jis codec\r
-\r
- The shift-jis codec is mapping 0x20-0x7e area to U+20-U+7E directly\r
- instead of using JIS X 0201 for compatibility. The differences are:\r
- - U+005C REVERSE SOLIDUS is mapped to SHIFT-JIS 0x5c.\r
- - U+007E TILDE is mapped to SHIFT-JIS 0x7e.\r
- - U+FF3C FULL-WIDTH REVERSE SOLIDUS is mapped to SHIFT-JIS 815f.\r
-\r