The Chinese language (汉语, 华语, or 中文) is a member of the Sino-Tibetan family of languages. Chinese is a tonal language related to Tibetan and Burmese, but unrelated to other neighbouring languages genetically, such as, Korean, Vietnamese, Thai or Japanese. However, these languages were strongly influenced by Chinese in the course of history, linguistically, and also extralinguistically. Korean and Japanese both have writing systems employing Chinese characters, which are called Hanja and Kanji respectively. Along with those two languages, Vietnamese also contains many Chinese loanwords and formerly used Chinese characters.
About one-fifth of the world speaks some form of Chinese as its native language, making it the most common language in the world. The Chinese language (spoken in its Mandarin form) is the official language of the People's Republic of China, Republic of China, one of four official languages of Singapore, and one of six official languages of the United Nations.
Spoken Chinese comprises many regional and mutually unintellegible variants. In the West, many people are familiar with the fact that the Romance languages all derive from Latin and so have many underlying features in common while being mutually unintelligible. Some people refer to the lesser variations within a single language, such as the regional variations within Spanish, as dialects. Chinese linguists speak of what we call Chinese as a member of the Sino-Tibetan "language family" (yu3 zu2). Within this language family there are two systems (xi4): the Sinitic (Han4 yu3 xi4) and the Tibeto-Burmese systems. Within the Sinitic system there are seven main groups of languages spoken by ethnic Han Chinese and four more related groups of languages spoken by minority groups. Among these seven main groups, seventeen languages (yu3) are distinguished, and each of these can show a further level of differentiation (fang1 yan2) and beyond that, a final level of differentiation is made (ci4 fang1 yan2) wherein are listed, e.g., Yunnan hua, Sichuan hua, Toisan hua, etc.
It will perhaps be easier to understand how these seventeen languages are associated with geographical areas of China by examining the maps above and to the right. Note that the languages named Bei-yu (represented on the map by lines drawn from Beijing), Jin, and Dungan (a Chinese language spoken by the Muslim population of Kyrghyzstan) are classified as members of the first of the seven main groups of languages. (Dungan is not shown on the right-hand map.) The Gan and Hakka languages are grouped together, as the second of the seven. And the five Min languages are grouped together as the third of the seven. The other four have no subdivisions so their names are carried in both the column for the seven main language groups and in the column for the seventeen languages. (An informative article written in Chinese may be found at [1].)
The seven (sometimes ten) languages spoken by the Han are normally referred to individually as "Chinese dialects." However, the use of the word "dialect" is disputed as these dialects are mostly mutually unintellegible and possess the same amount of variation as the different Romance languages, while varations within these dialects themselves are similar to variation among dialects in the Romance languages. If these dialects are to be referred to as "languages", then they are to be referred to as the Chinese languages as a whole (not to be confused by the various languages spoken by ethnic minority groups.
It is common for speakers of Chinese to be able to speak several variations of the language. Typically in southern China, a person will be able to speak the official Mandarin Chinese, the local dialect, and occasionally either speak or understand another regional dialect, such as Cantonese Chinese.
Chinese speakers will frequently code switch between Mandarin and the local dialect (depending on situation). Sometimes, the various dialects are mixed from other dialects, depending on geographical influence. A person living in Taiwan for example, will commonly mix pronunciations, phrases, and words from Mandarin and Min-nan, and this mixture is considered socially appropriate under many circumstances.
The Chinese written language employs the Han characterss (漢字 pinyin Hànzì), which are named after the Han culture to which it is largely attributed. In Japan and Korea, Han characters were adopted and integrated into their languages and became Kanji and Hanja, respectively. Japan still uses Kanji as an integral part of its writing system; however, Korea's use of Hanja has diminished (indeed, it is not used at all in North Korea). In the field of software and communications internationalization, CJK is a collective term for Chinese, Japanese, and Korean, all of which are double-byte languages, as they have more then 256 characters in their alphabet. The computerized processing of Chinese characters involves some special issues both in input and character encoding schemes, as the standard 100+ key keyboards of todays computers don't allow input of that many characters with one key-press.
The Chinese writing system is mostly logographic, i.e., each character expresses a monosyllabic word part, also known as a morpheme. This is helped by the fact that 90%+ of Chinese morphemes are monosyllabic. Multisyllabic words have a separate logogram for each syllable. Some, but not all, Han characters are ideographs, but most Han Chinese characters have forms that were based on their pronunciation rather than their meanings, so they do not directly express ideas.
Until the 20th century, most formal Chinese writing was done in classical Chinese, which was very different from any of the spoken varieties of Chinese in much the same way that Classical Latin is different from modern Romance languages. Chinese characters that are closer to the spoken language were used to write informal works such as colloquial novels.
Since the May Fourth Movement, the formal standard for written Chinese has been Vernacular Chinese, the grammar and vocabulary of which are similar, but not identical, to the grammar and vocabulary of modern spoken Mandarin.
Chinese characters are understood as morphemes which are independent of phonetic change. Thus, although the number one is "yi" in Mandarin, "yat" in Cantonese and "tsit" in Hokkien, they derive from a common ancient Chinese word and still share an identical character: 一. Nevertheless, the orthographies of Chinese dialects are not identical. The vocabularies used in the different dialects have also diverged. In addition, while literary vocabulary is often shared among all dialects (at least in orthography; the readings are different), colloquial vocabularies are often different.
The complex interaction between the Chinese written and spoken languages can be illustrated with Cantonese. There are two standards forms used in writing Cantonese: formal written Cantonese and colloquial written Cantonese. Formal written Cantonese is very similar to written Mandarin and can be read by a Mandarin speaker without much difficulty. However, formal written Cantonese is rather different from spoken Cantonese. Colloquial written Cantonese is more similar to spoken Cantonese but is largely unreadable by an untrained Mandarin speaker.
Cantonese is unique among non-Mandarin dialects in having a widely used written standard. The other dialects do not have alternative written standards, but many have local characters or use characters which are archaic in "bai hua".
Old Chinese, sometimes known as 'Archaic Chinese', was the language common during the early and middle Zhou Dynasty (11th to 7th centuries B.C.), whose texts include inscriptions on bronze artifacts, the poetry of the Shijing, the history of the Shujing, and portions of the Yijing (I Ching). Work on reconstructing Old Chinese started with Qing dynasty philologists. The pioneer of Western study of Old Chinese is the Swedish linguist Bernhard Karlgren, whose work is based on the forms of the characters and the rhymes of the 'Shijing'. The phonetic elements found in the majority of Chinese characters also provide hints to their Old Chinese pronunciations. Old Chinese was not wholly uninflected. It possessed a rich sound system in which aspiration or rough breathing differentiated the consonants.
Middle Chinese was the language used during the Sui, Tang, and Song dynasties (7th through 10th centuries A.D.). It can be divided into an early period, for which the 切韻 'Qieyun' rhyme table (A.D. 601) relates to, and a late period in the 10th, which the 廣韻 'Guangyun' rhyme table reflects. Bernhard Karlgren called this phase 'Ancient Chinese'. Linguists are confident in having a good reconstruction of which Middle Chinese sounded like. The evidence for the pronunciation of Middle Chinese comes from several sources: modern dialect variations, rhyming dictionaries, and foreign translations. Just as Proto-Indo-European can be reconstructed from modern Indo-European languages, so can Middle Chinese be reconstructed (very tentatively) from modern dialects. In addition, ancient Chinese philologists devoted great amount of effort in summarizing the Chinese phonetic system through "rhyming tables", and these tables serve as a basis for the work of modern linguists. Finally, Chinese phonetic translations of foreign words also provide plenty of clues about the nature of Middle Chinese phonetics.
The development of the spoken Chinese languages from early historical times to the present has been complex. The language tree shown here shows how the present main divisions of the Chinese language developed out of an early common language. Comparison with the map above will give some idea of the complexities that have been left out of the tree. For instance, the Min language that is centered in Fujian Province contains five subdivisions, and the so-called northern language (which is called Mandarin in the West), also contains named subdivisions such as Yun-nan hua, Si-chuan hua, etc.
Most Chinese living in northern China, in Sichuan, and, actually, in a broad arc from the north-east (Manchuria) to the south-west (Yun-nan), use various Mandarin dialects as their home language. (See the three regions colored yellow and brown in the map above.) The prevalence of Mandarin throughout northern China is largely the result of geography, namely the plains of north China. By contrast, the mountains and rivers of southern China have promoted linguistic diversity. The presence of Mandarin in Sichuan is largely due to a plague in the 12th century. This plague, which may have been related to the black death, depopulated the area, leading to later settlement from north China.
Until the mid-20th century, most Chinese living in southern China did not speak any Mandarin. However, despite the mix of officials and commoners speaking various Chinese dialects, Beijingese Mandarin became dominant at least during the officially Manchu-speaking Qing Empire. Since the 17th century, the Empire had set up Orthoepy Academies (正音書院 Zhengyin Shuyuan) in an attempt to make pronunciation conform to the Beijing standard. But these attempts had little success.
This situation changed with the creation (in both the PRC and the ROC) of an elementary school education system committed to teaching Mandarin. As a result, Mandarin is now spoken fluently by most people in Mainland China and in Taiwan. In Hong Kong, the language of education and formal speech remains Cantonese but Mandarin is becoming increasingly influential.
Chinese characters appear to have originated in the Shang dynasty as pictograms depicting concrete objects. Over the course of the Zhou and Han dynasties, the characters became more and more stylistic. In addition, characters were added for words based on the sound of the word.
Spoken Chinese
Main article: Chinese dialects
Written Chinese
Relationship between spoken and written Chinese
The relationship between the Chinese spoken and written languages is complex. This complexity is compounded by the fact that the numerous variations of spoken Chinese have gone through centuries of evolution since at least the late-Han dynasty. However, written Chinese has changed much less than the spoken language.Classification of writing styles
One can classify Chinese writings into four basic types:
Cantonese is unique in that it has a commonly used written character system that is different from "bai hua" or "wen yan". Colloquial Chinese usually involves the use of "dialectal characters".
As with other aspects of the Chinese language, the contrast between different written standards is not sharp and there can be a socially accepted continuum between the written standards. For example, in writing an informal love letter, one may use informal bai hua. In writing a newspaper article, the language used is different and begins to include aspects of wen yan. In writing a ceremonial document, one would use even more wen yan. The language used in the ceremonial document may be completely different from that of the love letter, but there is a socially accepted continuum existing between the two. Pure "wen yan", however, is rarely used.Character forms
There are currently two standards for printed Chinese characters. One is the Traditional system, used in Hong Kong, Macau, and Taiwan. Mainland China and Singapore use the Simplified system (developed by the PRC government in the 1950s), which uses simplified forms for many of the more complicated characters. In addition, most Chinese use some personal simplications.Development of Chinese
Related topics
References
External links