X-Git-Url: http://www.wagner.pp.ru/gitweb/?p=oss%2Fcatdoc.git;a=blobdiff_plain;f=charsets%2Fmac-cyrillic.txt;h=d569c2cca57a2dfd6bc8e8e4991faa70a943eef9;hp=22573a2d023f558fc748a60117d96061c941d004;hb=e2020c47c14a8a03f44734a6c0adc48fb5129c2f;hpb=7de410970cd2916482178d730b64aef013b0567a diff --git a/charsets/mac-cyrillic.txt b/charsets/mac-cyrillic.txt index 22573a2..d569c2c 100644 --- a/charsets/mac-cyrillic.txt +++ b/charsets/mac-cyrillic.txt @@ -1,24 +1,32 @@ #======================================================================= -# FTP file name: CYRILLIC.TXT +# File name: CYRILLIC.TXT # -# Contents: Map (external version) from Mac OS Cyrillic -# character set to Unicode 2.0 +# Contents: Map (external version) from Mac OS Cyrillic +# character set to Unicode 2.1 and later. # -# Copyright: (c) 1995-1998 by Apple Computer, Inc., all rights -# reserved. +# Copyright: (c) 1995-2002, 2005 by Apple Computer, Inc., all rights +# reserved. # -# Contacts: Peter Edberg -# Julio Gonzalez -# John Jenkins +# Contact: charsets@apple.com # # Changes: # +# c03 2005-Apr-05 Update header comments. Matches internal xml +# and Text Encoding Converter 2.0. +# b3,c1 2002-Dec-19 Update URLs, notes. Matches internal +# utom. +# b02 1999-Sep-22 Encoding changed for Mac OS 9.0 to merge +# with Mac OS Ukrainian and support EURO SIGN; +# Change mappings for 0xA2, 0xB6, and 0xFF. +# Update contact e-mail address. Matches +# internal utom, ufrm, and Text +# Encoding Converter version 1.5. # n05 1998-Feb-05 Update header comments to new format; no -# mapping changes. Matches internal utom<3>, -# ufrm<13>, and Text Encoding Converter +# mapping changes. Matches internal utom, +# ufrm, and Text Encoding Converter # version 1.3. # n03 1995-Apr-15 First version (after fixing some typos). -# Matches internal ufrm<5>. +# Matches internal ufrm. # # Standard header: # ---------------- @@ -30,18 +38,17 @@ # Macintosh computers and "Unicode" can be used to refer to the # Unicode standard. # -# Apple makes no warranty or representation, either express or -# implied, with respect to these tables, their quality, accuracy, or -# fitness for a particular purpose. In no event will Apple be liable -# for direct, indirect, special, incidental, or consequential damages -# resulting from any defect or inaccuracy in this document or the -# accompanying tables. +# Apple Computer, Inc. ("Apple") makes no warranty or representation, +# either express or implied, with respect to this document and the +# included data, its quality, accuracy, or fitness for a particular +# purpose. In no event will Apple be liable for direct, indirect, +# special, incidental, or consequential damages resulting from any +# defect or inaccuracy in this document or the included data. # # These mapping tables and character lists are subject to change. # The latest tables should be available from the following: # -# -# +# # # For general information about Mac OS encodings and these mapping # tables, see the file "README.TXT". @@ -65,15 +72,35 @@ # Notes on Mac OS Cyrillic: # ------------------------- # -# The Mac Cyrillic encoding is used for most Cyrillic localized -# system software except Ukrainian. A variant of Mac OS Cyrillic -# is used for Ukrainian and for the Cyrillic Language Kit; it is -# covered by a separate table. +# This is a legacy Mac OS encoding; in the Mac OS X Carbon and Cocoa +# environments, it is only supported directly in programming +# interfaces for QuickDraw Text, the Script Manager, and related +# Text Utilities. For other purposes it is supported via transcoding +# to and from Unicode. # -# The Mac OS Cyrillic encoding has the same Cyrillic letter -# repertoire as ISO 8859-5 (although not at the same code points). -# This covers most of the Slavic languages written in Cyrillic -# script. +# This is the "Euro sign" version of Mac Cyrillic for Mac OS 9.0 and +# later. Before Mac OS 9.0, there were two separate Slavic Cyrillic +# encodings: +# +# 1. The Cyrillic currency sign variant (used for localized Russian +# and Bulgarian systems), which had the following: +# 0xA2 U+00A2 CENT SIGN +# 0xB6 U+2202 PARTIAL DIFFERENTIAL +# 0xFF U+00A4 CURRENCY SIGN +# +# 2. The Ukrainian currency sign variant (used for localized Ukrainian +# systems and the pre-9.0 Cyrillic Language Kit), which had the +# following: +# 0xA2 U+0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN +# 0xB6 U+0491 CYRILLIC SMALL LETTER GHE WITH UPTURN +# 0xFF U+00A4 CURRENCY SIGN +# +# This new Cyrillic Euro sign version is based on the old Ukrainian +# currency sign variant, with 0xFF changed to be EURO SIGN. +# +# The Mac OS Cyrillic encoding includes the Cyrillic letter repertoire +# of ISO 8859-5 (although not at the same code points). This covers +# most of the Slavic languages written in Cyrillic script. # # The Mac OS Cyrillic encoding also includes a number of characters # needed for the Mac OS user interface and localization (e.g. @@ -81,46 +108,24 @@ # Cyrillic that are also in the Mac OS Roman encoding are at the # same code point in both; this improves application compatibility. # +# Note: There is a common Ukrainian glyph variation in which the glyph +# for CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I may or may not +# have a dot above. +# # Unicode mapping issues and notes: # --------------------------------- # # Details of mapping changes in each version: # ------------------------------------------- # +# Changes from version n05 to version b02: +# +# - Encoding changed for Mac OS 9.0 to merge with Mac OS Ukrainian and +# support EURO SIGN. 0xA2 changed from U+00A2 to U+0490; 0xB6 changed +# from U+2202 to U+0491; 0xFF changed from U+00A4 to U+20AC. +# ################## -0x00 0x0000 #NULL -0x01 0x0001 #START OF HEADING -0x02 0x0002 #START OF TEXT -0x03 0x0003 #END OF TEXT -0x04 0x0004 #END OF TRANSMISSION -0x05 0x0005 #ENQUIRY -0x06 0x0006 #ACKNOWLEDGE -0x07 0x0007 #BELL -0x08 0x0008 #BACKSPACE -0x09 0x0009 #HORIZONTAL TABULATION -0x0A 0x000A #LINE FEED -0x0B 0x000B #VERTICAL TABULATION -0x0C 0x000C #FORM FEED -0x0D 0x000D #CARRIAGE RETURN -0x0E 0x000E #SHIFT OUT -0x0F 0x000F #SHIFT IN -0x10 0x0010 #DATA LINK ESCAPE -0x11 0x0011 #DEVICE CONTROL ONE -0x12 0x0012 #DEVICE CONTROL TWO -0x13 0x0013 #DEVICE CONTROL THREE -0x14 0x0014 #DEVICE CONTROL FOUR -0x15 0x0015 #NEGATIVE ACKNOWLEDGE -0x16 0x0016 #SYNCHRONOUS IDLE -0x17 0x0017 #END OF TRANSMISSION BLOCK -0x18 0x0018 #CANCEL -0x19 0x0019 #END OF MEDIUM -0x1A 0x001A #SUBSTITUTE -0x1B 0x001B #ESCAPE -0x1C 0x001C #FILE SEPARATOR -0x1D 0x001D #GROUP SEPARATOR -0x1E 0x001E #RECORD SEPARATOR -0x1F 0x001F #UNIT SEPARATOR 0x20 0x0020 # SPACE 0x21 0x0021 # EXCLAMATION MARK 0x22 0x0022 # QUOTATION MARK @@ -251,7 +256,7 @@ 0x9F 0x042F # CYRILLIC CAPITAL LETTER YA 0xA0 0x2020 # DAGGER 0xA1 0x00B0 # DEGREE SIGN -0xA2 0x00A2 # CENT SIGN +0xA2 0x0490 # CYRILLIC CAPITAL LETTER GHE WITH UPTURN 0xA3 0x00A3 # POUND SIGN 0xA4 0x00A7 # SECTION SIGN 0xA5 0x2022 # BULLET @@ -271,7 +276,7 @@ 0xB3 0x2265 # GREATER-THAN OR EQUAL TO 0xB4 0x0456 # CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I 0xB5 0x00B5 # MICRO SIGN -0xB6 0x2202 # PARTIAL DIFFERENTIAL +0xB6 0x0491 # CYRILLIC SMALL LETTER GHE WITH UPTURN 0xB7 0x0408 # CYRILLIC CAPITAL LETTER JE 0xB8 0x0404 # CYRILLIC CAPITAL LETTER UKRAINIAN IE 0xB9 0x0454 # CYRILLIC SMALL LETTER UKRAINIAN IE @@ -344,4 +349,4 @@ 0xFC 0x044C # CYRILLIC SMALL LETTER SOFT SIGN 0xFD 0x044D # CYRILLIC SMALL LETTER E 0xFE 0x044E # CYRILLIC SMALL LETTER YU -0xFF 0x00A4 # CURRENCY SIGN +0xFF 0x20AC # EURO SIGN