In contemporary Japanese writing, foreign-language loanwords and foreign names are normally written in the katakana script, which is one component of the Japanese writing system. As far as possible, sounds in the source language are matched to the nearest sounds in the Japanese language, and the result is transcribed using standard katakana characters, each of which represents one syllable (strictly mora). For example, America is written ã¢ã¡ãªã« (A-me-ri-ka). To accommodate various foreign-language sounds not present in Japanese, a system of extended katakana has also developed to augment standard katakana.
A much less common form of transcription, Ateji, uses kanji characters for their phonetic values.
Since Japanese has few closed syllables, syllable-final consonants in the source language are often represented using the -u (or sometimes -o or -i) kanas with implicitly silent vowels â though this vowel often is pronounced in Japanese â or the syllable coda is not represented at all. For example, the name Jim is written ã¸ã (Ji-mu). A similar principle applies to consonant clusters; for example spring would be transcribed as ã¹ãÂÂãªã³ã° (su-pu-ri-n-gu), and scratch would be transcribed as ã¹ã¯ã©ãÂÂã (su-ku-ra-tchi).
Japanese has only five native vowel sounds, each a pure vowel (monophthong) with a long and short form, and some degree of approximation is necessary when representing vowels from, for example, English. Diphthongs are represented by vowel sequences, as in ãÂÂã©ã¦ã³ Bu-ra-u-n "Brown", ãÂÂã¤ã¹ na-i-su "nice", ãÂÂã£ã¢ di-a "dear/deer", ã‹¢ re-a "rare". etc. The English spelling <ore> (phonologically /ÃÂÃÂ/ (RP) or /ÃÂÃÂr/ (GA)) is usually "diphthongized" as o-a in Japanese (e.g. ã³ã¢ ko-a "core"), possibly because it is also pronounced as a diphthong (/oÃÂ/) in some accents of English. English /eê/ is transcribed to either e-e (ã¨ã¼ã¹ e-e-su "ace") or e-i (ã¹ãÂÂã¤ã³ Su-pe-i-n "Spain"); similarly, /ÃÂÃÂ/ is transcribed to either o-o (ã·ã§ã¼ sho-o "show") or o-u (ã·ã£ãÂÂ㦠sha-do-u "shadow").
Long vowels are generally written with ã¼ to indicate lengthening, as in ã³ã¼ã© kà Âra (cola), rather than writing a distinct vowel ÃÂã³ã¦ã© *koura. There are two irregularities of note here. Firstly, lengthening of the final vowel may be ambiguous, and vary over time or between users. For example, in present Japan, "computer" is generally represented as ã³ã³ãÂÂãÂ¥ã¼ã¿ã¼ konpyà «tà(long final), but in some cases, such as the computer industry, following Japanese Industrial Standards, it is represented as ã³ã³ãÂÂãÂ¥ã¼ã¿ konpyà «ta (short final). Secondly, in modern Chinese loanwords, notably food names, in careful transcription diphthongs are represented by separate vowels, even if in Japanese they would appear to be a long vowel; this is particularly common with òu, especially in è± dòu "(soy) bean", usually rendered as ãÂÂã¦. Further, long vowels in the Japanese transcription need not reflect Chinese pronunciation. For example, the dish æÂ±å¡è "Dongpo pork", in pinyin dà Ângpà Âròu (dà Âng÷pà Â÷ròu), is represented in Japanese as ãÂÂã³ãÂÂã¼ãÂÂ㦠donpà Ârou, or more commonly ãÂÂã³ãÂÂã¼ãÂÂ㦠tonpà Ârou. Note that in Chinese pinyin à  represents a high tone, while in Japanese à  represents a long vowel, and /d/ is pronounced differently (Chinese /d/ is similar to Japanese or English /t/). This distinction is not always followed, and varies by term: the spelling ãÂÂã³ãÂÂã¼ãÂÂã¼ tonpà Ârà  is also common; and in terms such as Ã¥ÂÂéÂÂè twice cooked pork, the spelling ãÂÂã¤ã³ã¼ãÂÂã¼ is more common, despite representing diphthongs.
Although the diphthong /au/ across languages is usually transcribed as ã¢ã¦ a-u, local reading transcriptions of the same sequence from Mandarin, represented in both WadeâÂÂGiles and Pinyin as ao are represented as ã¢ãª a-o instead, again in more of a manner of transliteration based on these systems - e.g. ãÂÂãªãÂȋÂÂã©ã¼ãÂÂã³ ma-o tso-o-to-n (Mao Zedong).
Japanese does not have separate l and r sounds, and l- is normally transcribed using the kana that are perceived as representing r-. For example, London becomes ãÂÂã³ãÂÂã³ (Ro-n-do-n). Other sounds not present in Japanese may be converted to the nearest Japanese equivalent; for example, the name Smith is written ã¹ãÂÂã¹ (Su-mi-su). Foreign sounds can be difficult to express in Japanese, resulting in spellings such as ãÂÂã«ã·ãÂÂã§ã Furushichofu (Khrushchev), ã¢ãªã¼ãÂȋÂÂã¼ã¡ãÂÂã¤ã¼ Arë HÃÂmeneë (Ali Khamenei) and ã¤ãÂÂãÂÂã¯ãÂȋÂÂã¼ã«ãÂÂã³ Itsuhaku PÃÂruman or ã¤ãÂÂã¡ã¼ã¯ãÂȋÂÂã¼ã«ãÂÂã³ ItsÃÂku PÃÂruman (Itzhak Perlman).
The phoneme /v/ in various languages is transcribed either to b or v, although it is unknown whether there is such an equivalent phoneme /v/ in Japanese. For example, ãÂÂãÂÂãÂÂ㢠Benechia / ã´ã§ãÂÂãÂÂã£ã¢ Ve-ne-tsi-a "Venezia" (Italian for "Venice"), ãªã¼ãÂÂã¼ o-o-ba-a "over", ã©ã ra-bu / ã©ã´ ravu "love".
Wa is usually written as ã¯, although ã¦ã¡ is sometimes used in transcriptions from Ancient Greek or Latin (e.g. ãÂÂãÂÂã«ã¦ã¡ Mi-ne-ru-wa "Minerva").
Geminated consonants are typically transcribed consistently and faithfully, as gemination is also featured in Japanese. The only notable exceptions are /rr/ and /òò/, although /ll/ and /ÃÂÃÂ/ are still transcribed. Examples: is ã¢ãÂÂã©ã¼ã A-r-ra-a-fu; Italian Donatello is ãÂÂãÂÂãÂÂãÂÂã Do-na-te-r-ro; Italian degli is ãÂÂãÂÂ㪠de-r-ri; but Italian Verrocchio is simply ã´ã§ãÂÂãÂÂãÂÂ㪠Ve-ro-k-ki-o, not *Ve-r-ro-k-ki-o. Italian /òò/ may be transcribed as the lengthened portion of the preceding vowel and a sequence of /nj/. For example, Sardegna is ãµã«ãÂÂã¼ãÂÂ㣠Sa-ru-de-e-nya.
Plain short consonants may be transcribed as geminated consonants to reflect the laxness of the preceding vowel, although this is not universal and there are plenty of exceptions. For example: English kick is ãÂÂãÂÂ㯠ki-k-ku and castle is ãÂÂã£ãÂÂã¹ã« kya-s-su-ru, but extra is ã¨ã¯ã¹ãÂÂã© e-ku-su-to-ra and battle is ãÂÂãÂÂã« ba-to-ru. This practice expands to almost all English obstruents regardless of their voicing (/k/, /á/, /s/, /z/, /f/, etc.), also to German/Scots /x/, occasionally to /n/ and /m/ (as pseudo-geminated consonant sequences /nn/ or /nm/). For example: English bag is ãÂÂ(ãÂÂ)ã° ba-(g)-gu; English Anna is ã¢ã³ã A-n-na; English gamma is ㋳ã ga-n-ma; English shuffle is ã·ã£ãÂÂãÂÂã« sha-f-fu-ru; German Mach is ãÂÂãÂÂã ma-h-ha, Masoch is ãÂÂã¾ãÂÂã Ma-zo-h-ho.
German [x] is transcribed roughly as h-h, accordingly to its preceding vowel, if it's not followed by a vowel (e.g. ãÂÂãÂÂã ma-h-ha "Mach", ãÂÂãÂÂã Ba-h-ha "Bach", ãÂÂã¾ãÂÂã Ma-zo-h-ho "Masoch"); [ç], its allophone occurring only after high vowels and consonants, are as h if followed by a vowel (e.g. ã¡ã«ãÂÂã³ me-ru-hen "Märchen"), or as hi if not (e.g. ãªãÂÂã¿ã¼ Ri-hi-ta-a "Richter"). Russian /x/ is transcribed as fu if not followed by a vowel (e.g. ã«ã¶ãÂÂã¹ã¿ã³ Ka-za-fu-su-ta-n "Kazakhstan"). Mandarin [ÃÂ] (in pinyin x(i)) is transcribed as sh (e.g. ã·ã£ãª shao from å° xiÃÂo "little").
Similar to the way speakers of English say Italian words, Japanese does not usually transcribe the Italian glide /j/ to reflect its true nature, but as /i/, perhaps for consistency and convenience. For example, Venezia is ã´ã§ãÂÂãÂÂã£ã¢ Ve-ne-tsi-a, Sicilia is ã·ãÂÂãªã¢ Shi-chi-ri-a. Contemporary transcriptions of palatalized consonants from Slavic languages, however, are made using yà Âon, e.g.: Russian ãÂÂã£ãÂÂã´ã«ã¹ã¯ Pya-chi-go-ru-su-ku (Pyatigorsk), Polish ãÂÂã§ã«ã¹ã³=ãÂÂã£ã¯ Bye-ru-su-ko=bya-wa (Bielsko-Biaà Âa).
In modern times, an extended katakana system has developed to cater for foreign sounds not present in Japanese. Most of these novel katakana forms are digraphs, composed of standard katakana characters, but in digraph combinations not found in native words. For example, the word photo is transcribed as ãÂÂã©ã (fo-to), where the novel digraph ãÂÂã© (fo) is made up from ã (normally fu) plus a novel small combining form of 㪠(normally o). In other cases novel diacritics may be applied to create new sounds, such as ã´ for vu, which consists of 㦠(u) combined with a dakuten to indicate a voiced pronunciation.
Japanese is written without spaces between words, and, to aid understanding, foreign phrases and names are sometimes transliterated with an interpunct separating the words, called a ; for example, (Bill Gates). When it is assumed that the reader knows the separate gairaigo words in the phrase, the middle dot is omitted, especially for wasei eigo. For example, the phrase konpyà «tàgÃÂmu ("computer game") contains two well-known gairaigo, and therefore is not written with a middle dot; the same principle is applied for panti sutokkingu ("pantyhose", lit. "panty stocking"), Japanese coinage.
Because Japanese is written with relatively complex Kanji characters, Japanese text must generally be written larger for legibility. Furthermore, as both Kanji and Kana are traditionally of equal width and height, Japanese characters are generally much larger than Latin characters. As Kanji are logographic and Kana encode entire syllables (or rather, morae), the higher information density of Japanese writing usually evens out with the larger text so that Japanese and English texts take about the same amount of space, but challenges arise with foreign consonant clusters incompatible with Japanese phonotactics and the Kana system. For example, the word remote control becomes the cumbersome ãªã¢ã¼ãÂÂã³ã³ãÂÂãÂÂã¼ã« (ri-mà Â-to-ko-n-to-rà Â-ru) in Japanese. Here, additional vowels are added between and , between and , and after at the word's end, and the vowels of mo and ro have been lengthened to mimic the English pronunciation. These additional sounds not only add to the word's length when spoken, but it also severely bloats the word when written. As such, the word is typically shortened to simply ãªã¢ã³ã³ (ri-mo-ko-n) in modern Japanese speech and writing.
The English schwa /ÃÂ/ is variously "transcribed" to a, e, o, depending on the English spelling (this is more of transliteration than it is transcription). For example, ãÂÂãÂ¥ã¢ã« dyu-a-ru "dual", ãÂÂãÂ¥ã¨ã« dyu-e-ru "duel", ãÂÂã¹ã¿ã¡ã³ã Te-su-ta-me-n-to "Testament", ãÂÂã³ãÂÂã³ Ro-n-do-n "London". There are no definite rules when it comes to the schwa, however; e.g. ã©ã³ãÂÂã ra-n-da-mu "random", ãªã¼ãÂÂã³ o-o-pu-n "open", ã¶ za "the". The British /ÃÂ/ which is equivalent to the North American /ÃÂ/ is transcribed to a(-a); e.g. ã³ã³ãÂÂãÂ¥ã¼ã¿(ã¼) ko-n-pyu-u-ta(-a) "computer", ã¢ã¼ã¿ã¼ mo-o-ta-a "motor". On the other hand, the French schwa is transcribed to u or o (e.g. ã½ã ãªã¨ so-mu-ri-e "sommelier", ã do "de") similarly to instances where there's a lack of vowels, and the German schwa is almost always transcribed to e (e.g. ã¢ã«ãÂÂã«ã A-ru-be-ru-to "Albert", ã¦ã³ãÂÂã£ã¼ã un-di-i-ne "undine").
English /æ/ is typically transcribed to a; e.g. . The sequences /kæ/ and /áæ/ are sometimes transcribed to kya and gya respectively; e.g. , .
The older English suffix -age /-êdÃÂ/ is always transcribed to -e-e-ji as if it were pronounced as /eêdÃÂ/ as in "age" or "rage"; e.g. ã¡ãÂÂãÂȋ¼ã¸ me-s-se-e-ji "message", ãÂÂãÂÂã±ã¼ã¸ pa-k-ke-e-ji "package". The more recent -age /-ÃÂÃÂÃÂ/ is more "properly" transcribed to -a-a-ju; e.g. ãÂÂã©ã¼ã¸ãÂÂ¥ mi-ra-a-ju "mirage". However, "garage" /gÃÂÃÂrÃÂÃÂÃÂ/ is more commonly transcribed to ã‹‹¼ã¸ ga-re-e-ji as it also has /ÃÂgærêdÃÂ/ as an alternative pronunciation in British English.
English /ti(ÃÂ)/ and /tê/ is typically transcribed to ã chi (e.g. ãÂÂã¼ã chëmu "team"), but ãÂÂ㣠ti is also used (ãÂÂã£ã¢ tia "tear"). The suffix -tic can be transcribed to either ãÂÂãÂÂ㯠-chikku or ãÂÂã£ãÂÂ㯠-tikku. However, -ty is almost always transcribed to ãÂÂã£(ã¼) -ti(i), not *ãÂÂ(ã¼) *-chi(i) (e.g. ãÂÂã¼ãÂÂã£ã¼ pÃÂtë "party", ã¤ã³ãÂÂã£ãÂÂãÂÂ㣠infiniti "infinity").
The English voiceless labialized velar approximant /hw/ (orthographically wh), which is a distinct phoneme from /w/ in some varieties of English, can be transcribed as ho(w)-. For example, White is ãÂÂã¯ã¤ã Howaito, whale is ãÂÂã¨ã¼ã« hoÃÂru.
Modern English compounds are usually transcribed in a way that reflects the independent pronunciations of the individual components. That is to say, there is no phonetic linking between components. For example, "overall" is transcribed as ãªã¼ãÂÂã¼ãªã¼ã« o-o-ba-a-o-o-ru, not *o-o-ba-a-ro-o-ru as it is pronounced in English. However, there are a few exceptions, such as "pineapple", which is transcribed as ãÂÂã¤ãÂÂãÂÂãÂÂã« pa-i-na-p-pu-ru, or "double-u", as ãÂÂãÂÂãªã¥㼠da-bu-ryu-u.
French vowels are usually phonemically transcribed, but non-phonemic stressed vowels (utterance-final) are sometimes also transcribed as long vowels. Compare the examples of ã¡ã¾ã³ me-zo-n "maison" and ã«ã‹¼ ka-re-e "Calais", in which the same vowel /ÃÂ/ is transcribed as e and e-e depending on whether it is stressed or not. The French schwa is ignored altogether: words are usually transcribed as if there were no schwa at all. For example, the word "le" is transcribed as ã« ru, as is the single sound /l/ in "cheval" > ã·ãÂ¥ã´ã¡ã« shuvaru.
French /w/ is typically transcribed as u, but the sequence /wa/ is as o-(w)a (e.g. ãÂÂã¢ã Po-a-ro "Poirot").
Although a syllable-final /n/ is typically transcribed using the moraic ã³ n, ã³ is used in French to transcribe nasalized vowels, so French words with a final /n/ often use ã nu instead for distinction, e.g. ãÂÂãÂÂ㋼ã Ma-do-re-e-nu "Madeleine". This is especially the case when the masculine and feminine of a word are distinct in French, e.g. bon --> ãÂÂã³ bo-n, vs. bonne --> ãÂÂã³ã bo-n-nu (the n is sometimes doubled, especially when the French orthography uses two n, even if it has no consequence in the French pronunciation).
The German /v/ (orthographically w) can be transcribed in several ways. In long-established words, it is generally w. E.g.: Walküre "valkyrie" > ã¯ã«ãÂÂãÂ¥ã¼ã¬ wa-ru-kyu-u-re. In newer transcriptions, it can also be v. E.g.: Schwestern "sisters" > ã·ãÂ¥ã´ã§ã¹ã¿ã³ shu-ve-su-tan. The schwa , spelt e, is transcribed as e, as in the aforementioned wa-ru-kyu-u-re.
The following tables give the Hepburn romanization and an approximate IPA transcription for katakana as used in contemporary Japanese. Their use in transcription is, of course, in the inverse direction.
The following katakana have been developed or proposed specifically for the purposes of transcribing foreign words. Examples such as ãÂÂãÂÂ¥(tu) in ã«ã¼ãÂÂãÂ¥ã¼ã³(cartoon), ãÂÂã£(ti) in ãÂÂã¼ãÂÂã£(party), ãÂÂã¡ (tsa) in ã¢ã¼ãÂÂã¡ã«ãÂÂ(Mozart) are found mostly in foreign words.