A Chinese whole character, or whole Chinese character (), is a complete Chinese character. It lies at the final level of the stroke-component-character Chinese character composition.
According to their structures, Chinese characters can be divided into undecomposable characters () and decomposable characters (). An undecomposable character is formed by one primitive component and is also called a single-component character, a decomposable character can be decomposed into two or more components and is also called a multi-component character.
An undecomposable character is directly formed by strokes, cannot be decomposed into smaller components, though may be a component of a decomposable character. For example, 人 is an undecomposable characters formed by strokes ã and ãÂÂ, and is used to form character ä¸Â.
The following are some lists of undecomposable characters created by different authors.
Chinese Character Information Dictionary (æ¼¢åÂÂä¿¡æÂ¯åÂÂå ¸) contains a total of 7,785 standardized characters in the China Mainland. According to static statistics, there are 323 undecomposable characters, accounting for 4.149%. According to dynamic statistics, undecomposable characters account for 25.910% of the corpus. Because many single-component characters are frequently used characters. The list of undecomposable characters is as follows (in Pinyin order): å¹堫巴ç½ç¾åÂÂÃ¥ÂÂè´ÂæÂŒÂÂå¸Âå¿ åÂÂä¸Âç§ÂÃ¥ÂÂä¸ÂæÂ¥æÂÂÃ¥ÂÂÃ¥ÂÂ产é¿åÂÂ车è£辰æÂÂä¸ÂæÂ¿å°ºå½³æÂ¥èµ¤è«ä¸Âå·Âå·Â串åÂÂÃ¥ÂÂ寸大æÂ¹ä¸¹åÂÂæ°ÂçµåÂÂä¸Â丢ä¸ÂæÂÂå¾ä¸Âå¿èÂÂè³åÂÂä¹Âå¡æÂ¹é£ÂéÂÂ丰夫å¼Âç«éÂÂçÂÂä¸Âå¹²çÂÂæÂÂé©个è®æÂ´å·¥å¼ÂçÂÂ广é¾ÂæÂÂ亥禾ä¹ÂæÂኼÂå¹»å¥Âç«ä¸Â乩åÂÂå 己夹ç²æÂÂå ¼æÂ¬è§ÂÃ¥ÂÂå·¾æÂ¤ä»Âå ÂäºÂä¹Âä¹ éÂÂæÂ§è¼巨åÂÂå¼ÂÃ¥ÂÂå£æÂ¥ä¹ÂèÂÂ礼éÂÂÃ¥ÂÂÃ¥ÂÂé¶è¯两äºÂãÂÂ令é¾Âçª乱马买æ¯ÂçÂÂä¹Âé¨米èÂÂä¸Âå Âé¢çÂÂæ°Âç¿æÂ«æ¯ÂæÂ¨ç®ä¹Âå 年廿ä¹ÂçÂÂÃ¥ÂÂ女丬ä¹Âç®å¹çÂÂæ°Âä¹Âå¹³ä¸Â妻æ°ÂÃ¥ÂÂç¾Âä¸Âä¸Âæ±Âè¾²çÂŒÂÂ人壬åÂÂæÂ¥ä¹³å ¥å ä¼Â丧山ä¸Âåºå°Âç³身çÂÂÃ¥ÂÂçÂÂ尸失è±åÂÂç³å²è±Â士æ°Âä¸ÂäºÂæÂÂ书æÂ¯æÂÂæÂÂç©水å¶å¸巳èÂÂ太天头å¸åÂÂå Âå½Â屯ä¹Âç¦丸ä¸Â亡çÂÂé¦为åÂÂå«æÂªæÂÂä¹ÂæÂ æ¯ÂäºÂå Âå¿æÂÂå¤Â西习系ä¸Â乡象å°Âå¿ÂæÂÂè¡Â丫çÂÂè½§äºÂ严央ç¾Âä¹Âå¤Âä¹Âä¸Â页æÂ³ä¸Â夷ä¹Â已以ä¹Âä¹Âå¼Â亦尹å¼Âæ°¸ç¨尤尢ç±é ÂÃ¥ÂÂäºÂäºÂè¾禺é¨禹çÂÂè¿æÂ°æÂÂÃ¥ÂÂæÂÂæÂÂä¹Âä¸Âçªå ÂäºÂæÂ£ä¹ÂæÂ¢è±¸ä¸ÂéÂÂèÂÂå·ÂæÂ±ç«¹ä¸»ä¸Âé¹åÂÂèª
Su segmented 7,000 commonly used characters and obtained 233 undecomposable characters, accounting for 3.4%. The list of undecomposable characters is as follows (in stroke-based order): ä¸Âä¹ÂäºÂÃ¥ÂÂä¸ÂÃ¥ÂÂä¸ÂÃ¥ÂÂ堫人堥ä¹Âå¿ä¹ÂÃ¥ÂÂå åÂÂäºÂä¹ÂÃ¥ÂÂÃ¥ÂÂÃ¥ÂÂä¹Âä¸Âå¹²äºÂäºÂäºÂ士åÂÂå·¥æÂÂä¸Â寸ä¸Â大å Âä¸Âå¼Âä¸Âå°Âå£山巾åÂÂå·Â彳个ä¹Â习丸å¤Âå¡åÂÂ广亡é¨丫ä¹Âä¹Â尸已巳å¼Âå·±å«åÂÂÃ¥ÂÂÃ¥ÂÂä¹Â女é£ÂÃ¥ÂÂä¹ åÂÂ乡ä¹Â丰çÂÂäºÂå¼Â夫天æÂ é¦ä¸Âä¸Â廿æÂ¨äºÂå ä¸Â太ç¾¹尤车çÂÂ屯æÂÂäºÂç¦æÂ¢å°ÂæÂ°æÂ¥ä¸Âå 水æÂÂçÂÂæ¯Âæ°Â壬åÂÂå¤Âé¿çÂÂå¸Âçªä¹ÂæÂÂæ°Âå¿丹ç«为æÂ·å¿Â尹尺夬ä¸Âç¿巴书æ¯ÂçÂÂæÂªæÂ«æÂÂæÂ£çÂÂä¸ÂæÂ¾Â¯ä¸Âé¾ÂæÂÂå¹³çÂÂä¸Âå¸ä¸Âç®ä¸Âç²ç³çµç°ç±央å²åÂÂç¿å¹åÂÂçÂÂ失ç¢禾ä¸Âç½æÂ¥çÂÂä¹Âç¨ç©æ°Âä¹ÂÃ¥ÂÂ主åÂÂå¿ æ°¸å¼Âåºæ¯ÂèÂÂè³äºÂè£åÂÂÃ¥ÂÂ西夹夷æÂ³è«æÂ²æÂ±ä¸¢ä¹Âä¹Âè¼èªè¡ÂçªèÂÂç±³å·Âè¿严ç«æÂ´æÂÂ两é ÂæÂ¥éÂÂ串æÂÂ身系äºÂ鍿ÂÂÃ¥ÂÂç§Âè¾æÂŽÂÂ禺éÂÂ禹
In the Specification of the Undecomposable Characters Commonly Used in the Modern Chinese (ç°代常ç¨ç¬ä½ÂÃ¥ÂÂè§ÂèÂÂ), 256 modern commonly used undecomposable characters have been identified within the scope of modern Chinese characters, forming the "List of Modern Commonly Used Undecomposable Characters". The list of undecomposable characters is as follows (in stroke-based order): ä¸Âä¹ÂäºÂÃ¥ÂÂä¸ÂÃ¥ÂÂä¸ÂÃ¥ÂÂ堫人堥å¿åÂÂå ä¹ÂÃ¥ÂÂäºÂÃ¥ÂÂÃ¥ÂÂä¹ÂÃ¥ÂÂä¸Âå¹²äºÂå·¥åÂÂ士æÂÂä¸Â寸大ä¸Âä¸Âä¸Âä¸Âå°Âå£山巾åÂÂå·Â个æÂ¹ä¹ ä¹Âå¡丸åÂÂ广亡é¨丫ä¹Âä¹Â尸己已巳å¼ÂÃ¥ÂÂå«ä¹Â女åÂÂé£Âä¹ åÂÂ马乡丰çÂÂå¼ÂäºÂ天夫æÂ äºÂä¸Âä¸ÂæÂ¨äºÂä¸Âç¬太æÂ¹å°¤è½¦å·¨çÂÂ屯æÂÂäºÂç¦æÂ¢å°ÂæÂ°æÂ¥ä¸Âè´Âå 水è§ÂÃ¥ÂÂçÂÂæÂÂæ°Âæ¯Â壬åÂÂå¤Âé¿çÂÂæÂ¤çªç¶æÂÂæ°Âå¿丹é¸Âå ÂæÂÂæÂ¹ç«为æÂÂæÂ·å¿Âå°ºä¸Âå·´åÂÂäºÂ书çÂÂæÂªæÂ«åÂȾ£çÂÂä¸ÂæÂ¾Â¯ä¸Âç³æÂÂé¾Âå¹³ä¸Âå¡å¸ä¸ÂæÂ¨ä¸Âç²ç³çµç°ç±å²央åÂÂç¿å¹åÂÂçÂÂç¢失ä¹Â禾ä¸Âç½æÂ¥çÂÂä¹Âç¨ç©ä¹ÂÃ¥ÂÂÃ¥ÂÂé¸Â主ç«ÂÃ¥ÂÂ头忠永æ°Âå¼ÂåºçÂÂæ¯Âè³äºÂè£åÂÂÃ¥ÂÂ西ç¾èÂÂ页夹夷è«æÂ²èÂÂå¹´æÂ±è¼èªè¡Âå¤èÂÂ亦衣产亥ç¾Âç±³å·ÂÃ¥ÂÂ严æ±Âç«æÂ´æÂÂ两é ÂæÂ¥å¤éÂÂ串æÂÂ身å±è¨Âç¾Âå¼ÂäºÂ鍿ÂÂÃ¥ÂÂç§ÂèÂÂé¶æÂ¿é©æÂŽÂ¢éÂÂ鬼禹é¦Â堼象é¼Â
The List of Commonly Used Character Components in the Specification of Common Modern Chinese Character Components and Component Names (ç°代常ç¨åÂÂé¨件åÂÂé¨件åÂÂç§°è§ÂèÂÂ) includes a total of 311 commonly used character components, i.e., undecomposable characters. The list of undecomposable characters is as follows (in Pinyin order): å¹堫巴ç½ç¾åÂÂÃ¥ÂÂÃ¥ÂÂè´ÂæÂŒÂÂå¿ ä¸Âç§ÂÃ¥ÂÂä¸ÂæÂÂÃ¥ÂÂÃ¥ÂÂ产é¿åÂÂ车è£辰æÂ¿å°ºæÂ¥èµ¤è«ä¸Âåºå·Â串åÂÂÃ¥ÂÂå±寸大æÂ¹ä¸¹åÂÂÃ¥ÂÂå¼ÂçµåÂÂä¸Âé¼Âä¸Âå ÂæÂÂè±Âå¿èÂÂè³äºÂÃ¥ÂÂå¡æÂ¹é£ÂéÂÂ丰å¤夫å¼Âç«ç¶ä¸Âå¹²çÂÂé«ÂæÂÂé©个æÂ´å·¥å¼Âè°·çÂÂ广é¾Â鬼æÂÂ亥禾é»ÂÃ¥ÂÂä¹ÂäºÂæÂ·é»Âç«åÂȌÂÂå 己夹ç²堼æÂ¬è§Âè§Âå·¾æÂ¤ä»ÂéÂÂ京äºÂä¹Âä¹ éÂÂè¼巨堷å¡å¼Âå£æÂ¥èÂÂä¹ÂéÂÂÃ¥ÂÂç«ÂÃ¥ÂÂ丽é¶两äºÂå Âé¾Âå¤鹿åÂ愯Â马æ¯ÂçÂÂå¯ä¹Âé¨米å Â颿°Âç¿æÂ«æ¯ÂæÂ¨ç®ä¹ÂÃ¥ÂÂå 年é¸ÂçÂÂÃ¥ÂÂ女ä¹Âç®çÂÂä¹Âå¹³ä¸Â妻堶æ°ÂÃ¥ÂÂ欠ä¸Âä¸Âæ±ÂæÂ²åÂȍÂŒÂÂ人壬åÂÂæÂ¥èÂÂå ¥ä¸Âä¼Âè²山ä¸Âåºå°ÂèÂÂç³身åÂÂçÂÂ尸失åÂÂç³é£Âå²ç¢士æ°Â示ä¸ÂäºÂæÂÂé¦Â书鼠æÂ¯æÂÂç©永å¸ä¸Âå·³åÂÂèÂÂ太天ç°头å¸åÂÂ屯ç¦丸ä¸Â亡çÂÂå«为æÂªæÂÂæÂÂä¹ÂæÂ äºÂÃ¥ÂÂå¿æÂÂå¤Â西习ä¸Â乡åÂÂ象å°Âå¿Âè¾ÂæÂÂç©´è¡ÂçÂÂ丫çÂÂäºÂ严è¨Â央ç¾Âå¤Âä¹Âä¸Â页ä¸Â衣夷ä¹Âå·²ä¹Â亦庸永ç¨尤ç±é ÂÃ¥ÂÂäºÂé±¼ä¸ÂäºÂé¨禹çÂÂæÂ°æÂÂäºÂÃ¥ÂÂä¹Âä¸Âçªå ÂäºÂæÂ£ä¹Âç´æÂ¢è³ä¸ÂéÂÂèÂÂå·ÂæÂ±ç«¹ä¸»ä¸ÂÃ¥ÂÂèª
Based on the above experimental results, it is estimated that the number of undecomposable characters in modern Chinese characters approximately account for 4%.
Since each experimenting family has slightly different understandings of components and often uses different character sets, there are differences in the number of undecomposable characters obtained. But generally speaking, the results are quite similar, ranging between over 200 to over 300.
A decomposable character can be decomposed into more than one component. For example, "Ã¥ÂÂ" (character) is formed by two components (å® + Ã¥ÂÂ).
There are two frequently used modes of component combination in the study of Chinese character structures: first-level component combination and primitive component combination.
The first-level component combination mode or pattern is what people often call the structures of Chinese characters. According to this analysis, the structures of decomposable characters can be divided into 4 major categories and 13 subcategories:
Full surround:
Surrounded from three sides:
Surrounded from two sides:
The following data is excerpted from Chinese Character Information Dictionary, with 7,785 Chinese standard characters.
According to statistics from the Chinese Character Information Dictionary, there are a total of 49 characters of overlaid (or nested, including fully surrounded) structure among the 7,785 Chinese standard characters in the dictionary:
Ã¥ÂÂè¤Âä¹Âå±å¤åºå½裹åÂÂå°åÂÂÃ¥ÂÂÃ¥ÂÂè¡°ç½åÂÂç°å¾å¢å´巫å 幽åÂÂÃ¥ÂÂè¡·å©åÂÂè¤å Â袤亵è£ÂÃ¥ÂÂå¡åµå«å¹å¿åÂÂÃ¥ÂÂÃ¥ÂÂÃ¥ÂÂè±³å®åÂጴÂÃ¥ÂÂãÂÂ¥
If the full surrounded characters are moved to the surrounding category, the overlaid characters will be even less.
According to the planar analysis by primitive components, Chinese character structures include the following modes or patterns:
The level to which Chinese character components should be divided must be determined based on specific needs. For example, Chinese character teaching often uses a coarser level of analysis in order to be concise, while component-encoding Chinese character input methods often use relatively detailed analysis in order to reduce coding elements.
The following data is excerpted from Chinese Character Information Dictionary. The components here refer to primitive components.
The static distribution is mainly concentrated in the number of components 2, 3, and 4; the dynamic distribution is mainly concentrated in the number of components 1, 2, and 3. In both dynamic and static distribution statistics, more than 99% of the characters have less than 5 primitive components. Note that the static component counts for component numbers 1 and 5 are the same, but their dynamic component counts are very different.