Chinese in Mac OS X 10.3 Panther

Installation

Under the Language tab in System Preferences... International, you will find a list of languages supported by OS X 10.3. You don't need to do anything here, since Chinese is installed by default. However, adjustments to this list affect text and font behavior in applications that use Apple's built-in text engine, like Mail and Safari. The language at the top of the list is the default, used by the Finder and applications whenever possible. The "Edit..." option may be used to remove languages from the list altogether, or add new ones.

In the Finder and most Unicode-based applications, Chinese text is sorted by Unicode order (by radical, then number of strokes). To control Chinese sort order in WorldScript-savvy applications, use the Customize Sorting... option.

Troubleshooting:

  • Don't use Chinese to name the hard drive that contains your System folder. Some applications won't be able to start up.
  • You may need to use the File Name Encoding Repair utility to see Chinese file and folder names created in OS 9 and earlier.

Fonts

Five groups of Chinese fonts are installed in Panther:

  • Five GB 18030 fonts:
    • In the /System/Library/Fonts folder: 华文黑体 ST (SinoType) Hei Regular and 华文细黑 ST Hei Light. They both appear in the Font panel under STHeiti.
    • In the /Library/Fonts folder: 华文楷体 ST Kai Regular, 华文宋体 ST Song Regular, and 华文仿宋 ST FangSong Regular.
  • Four GB 2312 fonts:
    • In the /System/Library/Fonts folder: Hei.
    • In the /Library/Fonts folder: Song, Kai, and FangSong.
  • Two Big Five fonts that support the Big-5E and HKSCS-2001 extensions to Big Five:
    • In the /System/Library/Fonts folder: LiHei 儷黑 Pro.
    • In the /Library/Fonts folder: LiSong 儷宋 Pro.
  • Three standard Big Five fonts:
    • In the /System/Library/Fonts folder: Apple LiGothic.
    • In the /Library/Fonts folder: Apple LiSung and BiauKai.
  • A GB 18030 bitmap font is installed in the /Library/Fonts folder.

OS X also makes the fonts in your OS 9/Classic System folder available. Thus, if you install the Chinese Language Kit, the old Macintosh system fonts Taipei and Beijing will be available. You can also install fonts directly into OS X, but it really is not necessary if you already have them installed in OS 9/Classic.

If your OS X installation is missing any Chinese fonts, you can easily add them at any time by inserting Install Disc 2 and opening "AdditionalAsianFonts.pkg" in the Packages folder. After you run the installer, the fonts will appear. You can also install additional fonts for Arabic, Greek, Hebrew, Hindi, and so on (unlike the East Asian fonts, these are not installed with the system, unless you do a custom install) by inserting Install Disc 3 and opening "AdditionalFonts.pkg" in the Packages folder.

For more information and a complete list of the fonts that come with Panther, see: http://docs.info.apple.com/article.html?artnum=25710

Getting Started

Input Menu

Under the Input Menu tab in System Preferences... International, you will find check boxes that activate the Chinese input methods:

Make sure that the "Show input menu in menu bar" box is also checked. The Chinese input methods will appear right away in the Input menu, which appears on the right side of the Menu bar. You can also check the Character Palette box to make it appear as well:

Input menu

Keyboards and input methods are listed by script. Roman-script keyboards are listed first. Unicode-script keyboards like U.S. Extended are listed last. To activate a keyboard or input method, choose it from the menu. Its icon will appear in the Menu bar and it will have a check mark beside it in the menu.

There are two keyboard shortcuts for the Input menu:

  • Command-space toggles between the last two scripts selected in the Input menu. For example, if the U.S. keyboard is the active Roman-script keyboard and you use the the Input menu to switch to the Simplified Chinese input method, you can use this shortcut to jump back and forth between them. This is always on.
  • Command-option-space cycles through all of the keyboards and input methods in the Input menu. This can be turned off using the Options... panel in the Input Menu tab of System Preferences... International.

Another option (also on by default) that can be turned off using the Options... panel is the "Try to match keyboard with text" feature. With this on, when you insert the cursor into (or select) Chinese text in a WorldScript-savvy application, the Input menu jumps to the appropriate Chinese script.

The Input Mode Palette floats on the screen and provides direct access to the keyboards and input methods you have activated. You can turn it on and off in the Input menu.

Chinese Input

To input Chinese, you can use Apple's built-in Traditional Chinese input method (TCIM) or Simplified Chinese input method (SCIM). There are also other input methods available.

The Input window is the first step in entering Chinese characters and words. As you type the input keys for a character, they appear in a window with a line under them to indicate they are in the active input area. Inline input eliminates the Input window and causes the input area to appear in place in the text of your document. As you type the input keys for a character, they appear in the text with a line under them to indicate the active input area. Most applications support inline input by default. Some applications allow the user to turn it on or off.

If there is more than one possible match for the input keys that have been entered, then the Selection window appears:

Selection window

Characters are arranged in rows. Use the up or down arrow keys to move between rows and the right arrow or left arrow keys to move within a row. Press the return key to enter the selected character into text. There are two shortcuts to enter characters into text:

  • Type the number next to a character to selct and enter it.
  • Use the mouse to point and click on a character to select and enter it.

You can adjust the font and point size for the Selection window in the TCIM and SCIM Preferences.

Roman Input

You can easily switch to a Roman keyboard layout using the Input menu.

In addition, the Chinese input methods allow you to directly enter Roman characters as either single-byte (a.k.a. "half-width") or double-byte (a.k.a. "full-width") characters:

  • To enter single-byte Roman characters, press the caps lock key and type as you normally do.
  • Double-byte Roman characters align with Chinese text. This is a useful property in certain contexts, such as tables and forms. To enter double-byte Roman characters, choose "Use Two-Byte Roman Characters" from the Input menu.
  • To enter one lowercase double-byte Roman character, press the tilde (~) key, then press the key you want. Useful for numbers.

Character Palette

In Cocoa applications, the Character Palette is always accessible via Edit > Special Characters... There are four ways to view Chinese characters:

character_palette

  • Simplified Chinese (shown above) displays the GB 18030 character set. You can look for characters both by Radical (includes both Simplified and Traditional characters) and by Category (Unicode blocks), as well as in the Code Table, which has a pop-up menu where you can choose between GB2312 and GB18030. If you highlight a character and then pause the mouse over it, a little info panel will appear, giving the UTF-16, UTF-8, and GB code points.
  • Traditional Chinese allows you to look for characters both by Radical (Big Five only) and in the Code Table, which has a pop-up menu where you can choose between BIG-5E (17,005 characters) and HKSCS-2001 (17,428 characters). If you highlight an indivdual character and then pause the mouse over it, a little info panel will appear, giving the UTF-16, UTF-8, and Big-5E (or HKSCS-2001) code points.
  • Unicode can display the tens of thousands of Chinese characters defined in Unicode, including those beyond the Basic Multilingual Plane. You can look for characters in the Unicode Blocks tab or the Unicode Table tab, which displays a table of Unicode scalar values.
  • Glyph displays the complete contents of the selected font.

To input characters into text, just double-click on the character you want, or use the Insert button.

In the Character Info section (shown above), you will find a list of characters related to the selected character, along with the input key sequences for the Apple input methods. You can drag/copy any character from an application like TextEdit, Mail, or Safari and drop/paste it into the Character Info section to get information about that character.

In the Font Variation section, click on the triangle to see all available glyphs for the selected character in the different fonts on the system.

Hints:

  • There is always a Favorites tab where you can store characters for repeated access. You can arrange them however you like (hold the mouse down before you start to drag).

Utilities

The TCIMTool and the SCIMTool are stored in the /System/Library/Components folder.

Chinese Text Converter

Located in the /Applications/Utilities/Asia Text Extras folder. See Utilities.

Input Method Plug-in Converter

Located in the /Applications/Utilities/Asia Text Extras folder. Converts plain text (.txt) source files into Chinese input method plug-in data (.dat) files. Simply drag the source file onto the Input Method Plug-in Converter icon or an alias, or use File > Open..., and follow the on-screen instructions. To install an input method plug-in, place the .dat file into the "ChineseInputMethodPlug-in" folder, which is located in your Users/~/Library folder. You can also create a Library/ChineseInputMethodPlug-in folder, so all users will have access. Then logout and login to make them appear in the Input menu.

Mail 1.3

Mail 1.3 is fully Unicode-savvy and can be localized for Chinese. It automatically sets the encoding of outgoing messages based on content. The choices it makes are based on the order in the Language tab of System Preferences ... International. Mail simply goes down the list and chooses the first "language" (character set) that contains all the characters used in the message.

If your system is set to run in English, or anything other than Chinese or Japanese, the default order in the list is Japanese, Simplified Chinese, Traditional Chinese. This order causes problems when these character sets overlap. Traditional Chinese email is sometimes sent as Japanese, and if it is not sent as Japanese, then it is sent as Simplified Chinese.

  • If Traditional Chinese is listed before the others, then the default encoding for Traditional Chinese email is Big Five. But quite a few Japanese messages will be sent as HKSCS. Simplified Chinese messages will not be affected as long as they contain at least one simplified form.
  • If Simplified Chinese is listed before the others, then the default encoding for Simplified Chinese email within the limits of the GB 2312 character set is HZ. Simplified Chinese email with characters from beyond the GB 2312 character set, along with all Traditional Chinese and Japanese messages, are sent using the charset name GB2312. Messages that contain Chinese characters beyond the GBK character set are sent encoded as UTF-8.

In short, you should list the language you use the most before the others. If you choose Simplified Chinese, you will have to manually set the outgoing encoding for the others. If you choose Traditional Chinese, you will have to manually set the outgoing encoding for Japanese. Korean is not an issue since Hangul is not part of the Chinese or Japanese character sets, and it should be listed last.

Mail 1.3 allows you to manually set the encoding of an outgoing message in Message > Text Encoding. Note that this only changes the encoding of the body of the message. The encoding of the message subject does not change.

Safari 1.3

Safari 1.3 is Unicode-savvy. Page titles and bookmarks in Chinese are all displayed properly. To reset the encoding of a web page, use Text Encoding in the View menu.

The direct Google search window works well with Chinese (also Japanese and Korean). Just enter the Chinese text and Safari does the search correctly.

The default fonts Safari uses to display Chinese characters in web pages are determined by the order in the Language tab of System Preferences... International.

  • If Traditional Chinese is listed first, then in Simplified Chinese (GB) web pages all Traditional Chinese (Big Five) characters appear in the default font for Traditional Chinese. All other Chinese characters, including all simplified forms, appear in the default font for Simplified Chinese.
  • If Simplified Chinese is listed first, this display problem disappears, since the default font for Simplified Chinese includes all Traditional Chinese characters.
  • If Japanese is listed first, then all Chinese characters that are in the standard Japanese character set appear in the default font for Japanese. The display of the remaining Chinese characters depends on which of the two Chinese character sets is listed first.