Pinyin (Chinese: 拼音; pinyin: pīnyīn; Mandarin pronunciation: [pʰɪ́n jɪ́n]) is the official system to transcribe Chinese characters into the Roman alphabet to teach Mandarin Chinese in China, Hong Kong, Malaysia, Singapore and Taiwan.[1] It is also often used to spell Chinese names in foreign publications and used as an input method to enter Chinese characters (汉字 / 漢字, hànzì) into computers.
Letters
Pinyin differs from other romanizations in several aspects, such as the following:
Syllables starting with u are written as w in place of u (e.g. uan is written as wan). Standalone u is written as wu.
Syllables starting with i are written as y in place of i (e.g. ian is written as yan). Standalone i is written as yi.
Syllables starting with ü are written as yu in place of ü (e.g. üe is written as yue).
ü is written as u when there is no ambiguity (such as ju, qu, and xu), but written as ü when there are corresponding u syllables (such as lü and nü). In such situations where there are corresponding u syllables, it is often replaced with v on a computer, making it easier to type on a standard keyboard.
When preceded by a consonant, iou, uei, and uen are simplified as iu, ui, and un (which do not represent the actual pronunciation).
As in zhuyin, what are actually pronounced as buo, puo, muo, and fuo are given a separate representation: bo, po, mo, and fo.
The apostrophe (') is used before a syllable starting with a vowel (a, o, or e) in a multiple-syllable word when the syllable does not start the word (which is most commonly realized as [ɰ]), unless the syllable immediately follows a hyphen or other dash.[19] This is done to remove ambiguity that could arise, as in Xi'an, which consists of the two syllables xi ("西") and an ("安") as opposed to xian for such words as "先". (This ambiguity does not occur when tone marks are used: The two tone marks in "Xīān" unambiguously show that the word consists of two syllables. However, even with tone marks, the city is usually spelled with an apostrophe as "Xī'ān".)
Eh alone is written as ê; elsewhere as e. Schwa is always written as e.
zh, ch, and sh can be abbreviated as ẑ, ĉ, and ŝ (z, c, s with a circumflex). However, the shorthands are rarely used due to difficulty of entering them on computers, and are confined mainly to Esperanto keyboard layouts.
ng has the uncommon shorthand of ŋ.
The letter v is unused (except in spelling foreign languages, languages of minority nationalities, and some dialects), despite a conscious effort to distribute letters more evenly than in Western languages. However, sometimes, for ease of typing into a computer, the v is used to replace a ü.
Most of the above are used to avoid ambiguity when writing words of more than one syllable in pinyin. For example uenian is written as wenyan because it is not clear which syllables make up uenian; uen-ian, uen-i-an and u-en-i-an are all possible combinations whereas wenyan is unambiguous because we, nya, etc. do not exist in pinyin. See the pinyin table article for a summary of possible pinyin syllables (not including tones).