Han Characters : Soft.lk - A Premium Software Developers - Meet one of the most powerful and intuitive POS systems for retail in Sri Lanka

21 Aug, 2019
By, Wikipedia

Han Characters

Chinese characters are logographs used to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represent the only one that has remained in continuous use. Over a documented history spanning more than three millennia, the function, style, and means of writing characters have changed greatly. Unlike letters in alphabets that reflect the sounds of speech, Chinese characters generally represent morphemes, the units of meaning in a language. Writing all of the frequently used vocabulary in a language requires roughly 2000–3000 characters; as of 2024, nearly 100000 have been identified and included in The Unicode Standard. Characters are created according to several principles, where aspects of shape and pronunciation may be used to indicate the character's meaning.

The first attested characters are oracle bone inscriptions made during the 13th century BCE in what is now Anyang, Henan, as part of divinations conducted by the Shang dynasty royal house. Character forms were originally highly pictographic in style, but evolved as writing spread across China. Numerous attempts have been made to reform the script, including the promotion of small seal script by the Qin dynasty (221–206 BCE). Clerical script, which had matured by the early Han dynasty (202 BCE – 220 CE), abstracted the forms of characters—obscuring their pictographic origins in favour of making them easier to write. Following the Han, regular script emerged as the result of cursive influence on clerical script, and has been the primary style used for characters since. Informed by a long tradition of lexicography, states using Chinese characters have standardized their forms: broadly, simplified characters are used to write Chinese in mainland China, Singapore, and Malaysia, while traditional characters are used in Taiwan, Hong Kong, and Macau.

Where the use of characters spread beyond China, they were initially used to write Literary Chinese; they were then often adapted to write local languages spoken throughout the Sinosphere. In Japanese, Korean, and Vietnamese, Chinese characters are known as kanji, hanja, and chữ Hán respectively. Writing traditions also emerged for some of the other languages of China, like the sawndip script used to write the Zhuang languages of Guangxi. Each of these written vernaculars used existing characters to write the language's native vocabulary, as well as the loanwords it borrowed from Chinese. In addition, each invented characters for local use. In written Korean and Vietnamese, Chinese characters have largely been replaced with alphabets—leaving Japanese as the only major non-Chinese language still written using them, alongside the other elements of the Japanese writing system.

At the most basic level, characters are composed of strokes that are written in a fixed order. Historically, methods of writing characters have included inscribing stone, bone, or bronze; brushing ink onto silk, bamboo, or paper; and printing with woodblocks or moveable type. Technologies invented since the 19th century to facilitate the use of characters include telegraph codes and typewriters, as well as input methods and text encodings on computers.

Development

Chinese characters are accepted as representing one of four independent inventions of writing in human history. In each instance, writing evolved from a system using two distinct types of ideographs—either pictographs visually depicting objects or concepts, or fixed signs representing concepts only by shared convention. These systems are classified as proto-writing, because the techniques they used were insufficient to carry the meaning of spoken language by themselves.

Various innovations were required for Chinese characters to emerge from proto-writing. Firstly, pictographs became distinct from simple pictures in use and appearance: for example, the pictograph 大, meaning 'large', was originally a picture of a large man, but one would need to be aware of its specific meaning in order to interpret the sequence 大鹿 as signifying 'large deer', rather than being a picture of a large man and a deer next to one another. Due to this process of abstraction, as well as to make characters easier to write, pictographs gradually became more simplified and regularized—often to the extent that the original objects represented are no longer obvious.

This proto-writing system was limited to representing a relatively narrow range of ideas with a comparatively small library of symbols. This compelled innovations that allowed for symbols which indicated elements of spoken language directly. In each historical case, this was accomplished by some form of the rebus technique, where the symbol for a word is used to indicate a different word with a similar pronunciation, depending on context. This allowed for words that lacked a plausible pictographic representation to be written down for the first time. This technique preempted more sophisticated methods of character creation that would further expand the lexicon. The process whereby writing emerged from proto-writing took place over a long period; when the purely pictorial use of symbols disappeared, leaving only those representing spoken words, the process was complete.

Classification

Chinese characters have been used in several different writing systems throughout history. A writing system is most commonly defined to include the written symbols themselves, called graphemes—which may include characters, numerals, or punctuation—as well as the rules by which they are used to record language. Chinese characters are logographs, which are graphemes that represent units of meaning in a language. Specifically, characters represent a language's morphemes, its most basic units of meaning. Morphemes in Chinese—and therefore the characters used to write them—are nearly always a single syllable in length. In some special cases, characters may denote non-morphemic syllables as well; due to this, written Chinese is often characterized as morphosyllabic. Logographs may be contrasted with letters in an alphabet, which generally represent phonemes, the distinct units of sound used by speakers of a language. Despite their origins in picture-writing, Chinese characters are no longer ideographs capable of representing ideas directly; their comprehension relies on the reader's knowledge of the particular language being written.

The areas where Chinese characters were historically used—sometimes collectively termed the Sinosphere—have a long tradition of lexicography attempting to explain and refine their use; for most of history, analysis revolved around a model first popularized in the 2nd-century Shuowen Jiezi dictionary. More recent models have analysed the methods used to create characters, how characters are structured, and how they function in a given writing system.

Structural analysis

Most characters can be analysed structurally as compounds made of smaller components (部件; bùjiàn), which are often independent characters in their own right, adjusted to occupy a given position in the compound. Components within a character may serve a specific function: phonetic components provide a hint for the character's pronunciation, and semantic components indicate some element of the character's meaning. Components that serve neither function may be classified as pure signs with no particular meaning, other than their presence distinguishing one character from another.

A straightforward structural classification scheme may consist of three pure classes of semantographs, phonographs, and signs—having only semantic, phonetic, and form components respectively—as well as classes corresponding to each combination of component types. Of the 3500 characters that are frequently used in Standard Chinese, pure semantographs are estimated to be the rarest, accounting for about 5% of the lexicon, followed by pure signs with 18%, and semantic–form and phonetic–form compounds together accounting for 19%. The remaining 58% are phono-semantic compounds.

The 20th-century Chinese palaeographer Qiu Xigui presents three principles of character function adapted from earlier proposals by Tang Lan [zh] and Chen Mengjia, with semantographs describing all characters whose forms are wholly related to their meaning, regardless of the method by which the meaning was originally depicted, phonographs that include a phonetic component, and loangraphs encompassing existing characters that have been borrowed to write other words. Qiu also acknowledges the existence of character classes that fall outside of these principles, such as pure signs.

Semantographs

Pictographs

Most of the oldest characters are pictographs (象形; xiàngxíng), representational pictures of physical objects. Examples include 日 ('Sun'), 月 ('Moon'), and 木 ('tree'). Over time, the forms of pictographs have been simplified in order to make them easier to write. As a result, modern readers generally cannot deduce what many pictographs were originally meant to resemble; without knowing the context of their origin in picture-writing, they may be interpreted instead as pure signs. However, if a pictograph's use in compounds still reflects its original meaning, as with 日 in 晴 ('clear sky'), it can still be analysed as a semantic component.

Pictographs have often been extended from their original meanings to take on additional layers of metaphor and synecdoche, which sometimes displace the character's original sense. When this process results in excessive ambiguity between distinct senses written with the same character, it is usually resolved by new compounds being derived to represent particular senses.

Indicatives

Indicatives (指事; zhǐshì), also called simple ideographs or self-explanatory characters, are visual representations of abstract concepts that lack any tangible form. Examples include 上 ('up') and 下 ('down')—these characters were originally written as dots placed above and below a line, and later evolved into their present forms with less potential for graphical ambiguity in context. More complex indicatives include 凸 ('convex'), 凹 ('concave'), and 平 ('flat and level').

Compound ideographs

The compound character 好 illustrated as its component characters 女 and 子 positioned side by side

Compound ideographs (会意; 會意; huìyì)—also called logical aggregates, associative idea characters, or syssemantographs—combine other characters to convey a new, synthetic meaning. A canonical example is 明 ('bright'), interpreted as the juxtaposition of the two brightest objects in the sky: 日 'Sun' and 月 'Moon', together expressing their shared quality of brightness. Other examples include 休 ('rest'), composed of pictographs ⼈ 'MAN' and ⽊ 'TREE', and 好 ('good'), composed of ⼥ 'WOMAN' and ⼦ 'CHILD'.

Many traditional examples of compound ideographs are now believed to have actually originated as phono-semantic compounds, made obscure by subsequent changes in pronunciation. For example, the Shuowen Jiezi describes 信 ('trust') as an ideographic compound of ⼈ 'MAN' and ⾔ 'SPEECH', but modern analyses instead identify it as a phono-semantic compound—though with disagreement as to which component is phonetic. Peter A. Boodberg and William G. Boltz go so far as to deny that any compound ideographs were devised in antiquity, maintaining that secondary readings that are now lost are responsible for the apparent absence of phonetic indicators, but their arguments have been rejected by other scholars.

Phonographs

Phono-semantic compounds

Phono-semantic compounds (形声; 形聲; xíngshēng) are composed of at least one semantic component and one phonetic component. They may be formed by one of several methods, often by adding a phonetic component to disambiguate a loangraph, or by adding a semantic component to represent a specific extension of a character's meaning. Examples of phono-semantic compounds include 河 (hé; 'river'), 湖 (hú; 'lake'), 流 (liú; 'stream'), 沖 (chōng; 'surge'), and 滑 (huá; 'slippery'). Each of these characters have three short strokes on their left-hand side: 氵, a simplified combining form of ⽔ 'WATER'. This component serves a semantic function in each example, indicating the character has some meaning related to water. The remainder of each character is its phonetic component: 湖 (hú) is pronounced identically to 胡 (hú) in Standard Chinese, 河 (hé) is pronounced similarly to 可 (kě), and 沖 (chōng) is pronounced similarly to 中 (zhōng).

The phonetic components of most compounds may only provide an approximate pronunciation, even before subsequent sound shifts in the spoken language. Some characters may only have the same initial or final sound of a syllable in common with phonetic components. A phonetic series comprises all the characters created using the same phonetic component, which may have diverged significantly in their pronunciations over time. For example, 茶 (chá; caa4; 'tea') and 途 (tú; tou4; 'route') are characters in the phonetic series using 余 (yú; jyu4), a literary first-person pronoun. Their Old Chinese pronunciations were similar, but the phonetic component no longer serves as a useful hint for their pronunciation in modern varieties of Chinese due to subsequent sound shifts—demonstrated here in both their Mandarin and Cantonese readings.

Loangraphs

The phenomenon of existing characters being adapted to write other words with similar pronunciations was necessary in the initial development of Chinese writing, and has remained common throughout its subsequent history. Some loangraphs (假借; jiǎjiè; 'borrowing') are introduced to represent words previously lacking a written form—this is often the case with abstract grammatical particles such as 之 and 其. The process of characters being borrowed as loangraphs should not be conflated with the distinct process of semantic extension, where a word acquires additional senses, which often remain written with the same character. As both processes often result in a single character form being used to write several distinct meanings, loangraphs are often misidentified as being the result of semantic extension, and vice versa.

Loangraphs are also used to write words borrowed from other languages, such as the Buddhist terminology introduced to China in antiquity, as well as contemporary non-Chinese words and names. For example, each character in the name 加拿大 (Jiānádà; 'Canada') is often used as a loangraph for its respective syllable. However, the barrier between a character's pronunciation and meaning is never total: when transcribing into Chinese, loangraphs are often chosen deliberately as to create certain connotations. This is regularly done with corporate brand names: for example, Coca-Cola's Chinese name is 可口可乐; 可口可樂 (Kěkǒu Kělè; 'delicious enjoyable').

Signs

Some characters and components are pure signs, whose meaning merely derives from their having a fixed and distinct form. Basic examples of pure signs are found with the numerals beyond four, e.g. 五 ('five') and 八 ('eight'), whose forms do not give visual hints to the quantities they represent.

Traditional Shuowen Jiezi classification

The Shuowen Jiezi is a character dictionary authored c. 100 CE by the scholar Xu Shen. In its postface, Xu analyses what he sees as all the methods by which characters are created. Later authors iterated upon Xu's analysis, developing a categorization scheme known as the 'six writings' (六书; 六書; liùshū), which identifies every character with one of six categories that had previously been mentioned in the Shuowen Jiezi. For nearly two millennia, this scheme was the primary framework for character analysis used throughout the Sinosphere. Xu based most of his analysis on examples of Qin seal script that were written down several centuries before his time—these were usually the oldest specimens available to him, though he stated he was aware of the existence of even older forms. The first five categories are pictographs, indicatives, compound ideographs, phono-semantic compounds, and loangraphs. The sixth category is given by Xu as 轉注 (zhuǎnzhù; 'reversed and refocused'); however, its definition is unclear, and it is generally disregarded by modern scholars.

Modern scholars agree that the theory presented in the Shuowen Jiezi is problematic, failing to fully capture the nature of Chinese writing, both in the present, as well as at the time Xu was writing. Traditional Chinese lexicography as embodied in the Shuowen Jiezi has suggested implausible etymologies for some characters. Moreover, several categories are considered to be ill-defined: for example, it is unclear whether characters like 大 ('large') should be classified as pictographs or indicatives. However, awareness of the 'six writings' model has remained a common component of character literacy, and often serves as a tool for students memorizing characters.

History

The broadest trend in the evolution of Chinese characters over their history has been simplification, both in graphical shape (字形; zìxíng), the "external appearances of individual graphs", and in graphical form (字体; 字體; zìtǐ), "overall changes in the distinguishing features of graphic[al] shape and calligraphic style, ... in most cases refer[ring] to rather obvious and rather substantial changes". The traditional notion of an orderly procession of script styles, each suddenly appearing and displacing the one previous, has been disproven by later scholarship and archaeological work. Instead, scripts evolved gradually, with several distinct styles often coexisting within a given area.

Traditional invention narrative

Several of the Chinese classics indicate that knotted cords were used to keep records prior to the invention of writing. Works that reference the practice include chapter 80 of the Tao Te Ching and the "Xici II" commentary to the I Ching. According to one tradition, Chinese characters were invented during the 3rd millennium BCE by Cangjie, a scribe of the legendary Yellow Emperor. Cangjie is said to have invented symbols called 字 (zì) due to his frustration with the limitations of knotting, taking inspiration from his study of the tracks of animals, landscapes, and the stars in the sky. On the day that these first characters were created, grain rained down from the sky; that night, the people heard the wailing of ghosts and demons, lamenting that humans could no longer be cheated.

Neolithic precursors

Collections of graphs and pictures have been discovered at the sites of several Neolithic settlements throughout the Yellow River valley, including Jiahu (c. 6500 BCE), Dadiwan and Damaidi (6th millennium BCE), and Banpo (5th millennium BCE). Symbols at each site were inscribed or drawn onto artefacts, appearing one at a time and without indicating any greater context. Qiu concludes, "We simply possess no basis for saying that they were already being used to record language." A historical connection with the symbols used by the late Neolithic Dawenkou culture (c. 4300 – c. 2600 BCE) in Shandong has been deemed possible by palaeographers, with Qiu concluding that they "cannot be definitively treated as primitive writing, nevertheless they are symbols which resemble most the ancient pictographic script discovered thus far in China... They undoubtedly can be viewed as the forerunners of primitive writing."

Oracle bone script

Ox scapula inscribed with characters recording the result of divinations – dated c. 1200 BCE

The oldest attested Chinese writing comprises a body of inscriptions produced during the Late Shang period (c. 1250 – 1050 BCE), with the very earliest examples from the reign of Wu Ding dated between 1250 and 1200 BCE. Many of these inscriptions were made on oracle bones—usually either ox scapulae or turtle plastrons—and recorded official divinations carried out by the Shang royal house. Contemporaneous inscriptions in a related but distinct style were also made on ritual bronze vessels. This oracle bone script (甲骨文; jiǎgǔwén) was first documented in 1899, after specimens were discovered being sold as "dragon bones" for medicinal purposes, with the symbols carved into them identified as early character forms. By 1928, the source of the bones had been traced to a village near Anyang in Henan—discovered to be the site of Yin, the final Shang capital—which was excavated by a team led by Li Ji from the Academia Sinica between 1928 and 1937. To date, over 150000 oracle bone fragments have been found.

Oracle bone inscriptions recorded divinations undertaken to communicate with the spirits of royal ancestors. The inscriptions range from a few characters in length at their shortest, to several dozen at their longest. The Shang king would communicate with his ancestors by means of scapulimancy, inquiring about subjects such as the royal family, military success, and the weather. Inscriptions were made in the divination material itself before and after it had been cracked by exposure to heat; they generally include a record of the questions posed, as well as the answers as interpreted in the cracks. A minority of bones feature characters that were inked with a brush before their strokes were incised; the evidence of this also shows that the conventional stroke orders used by later calligraphers had already been established for many characters by this point.

Oracle bone script is the direct ancestor of later forms of written Chinese. The oldest known inscriptions already represent a well-developed writing system, which suggests an initial emergence predating the late 2nd millennium BCE. Although written Chinese is first attested in official divinations, it is widely believed that writing was also used for other purposes during the Shang, but that the media used in other contexts—likely bamboo and wooden slips—were less durable than bronzes or oracle bones, and have not been preserved.

Zhou scripts

As early as the Shang, the oracle bone script existed as a simplified form alongside another that was used in bamboo books, in addition to elaborate pictorial forms often used in clan emblems. These other forms have been preserved in bronze script (金文; jīnwén), where inscriptions were made using a stylus in a clay mould, which was then used to cast ritual bronzes. These differences in technique generally resulted in character forms that were less angular in appearance than their oracle bone script counterparts.

Study of these bronze inscriptions has revealed that the mainstream script underwent slow, gradual evolution during the late Shang, which continued during the Zhou dynasty (c. 1046 – 256 BCE) until assuming the form now known as small seal script (小篆; xiǎozhuàn) within the Zhou state of Qin. Other scripts in use during the late Zhou include the bird-worm seal script (鸟虫书; 鳥蟲書; niǎochóngshū), as well as the regional forms used in non-Qin states. Examples of these styles were preserved as variants in the Shuowen Jiezi. Historically, Zhou forms were collectively known as large seal script (大篆; dàzhuàn), though Qiu refrains from using this term due to its lack of precision.

Character

Stroke


Japanese writing
Components
Kanji Stroke order Radicals Jōyō kanji list Kyōiku kanji Tōyō kanji Jinmeiyō kanji Hyōgai kanji
Kana Hiragana Hentaigana Katakana Man'yōgana Sōgana Gojūon
Typographic symbols Japanese punctuation Iteration mark
Uses
Syllabograms Furigana Okurigana Braille
Transliteration
Rōmaji Hepburn Kunrei-shiki / ISO 3602 Nihon-shiki JSL Wāpuro (keyboard input)
Cyrillization Polivanov system
v t e

Example Korean dictionary listings
Hanja	Hangul	Gloss
Native translation	Sino-Korean
水	물; mul	수; su	'water'
人	사람; saram	인; in	'person'
大	큰; keun	대; dae	'big'
小	작을; jakeul	소; so	'small'
下	아래; arae	하; ha	'down'
父	아비; abi	부; bu	'father'

Hanja

Hangul

Gloss

Native translation

Sino-Korean

水

물; mul

수; su

'water'

人

사람; saram

인; in