Tamil phonology is characterised by the presence of "true-subapical" retroflex consonants and multiple rhotic consonants. Its script does not distinguish between voiced and unvoiced consonants; phonetically, voice is assigned depending on a consonant's position in a word, voiced intervocalically and after nasals except when geminated. Tamil phonology permits few consonant clusters, which can never be word initial.
The vowels are called ' ('life letter'). The vowels are classified into short and long (five of each type) and two diphthongs.
The long (netil) vowels are about twice as long as the short (kuá¹Âil) vowels. The diphthongs are usually pronounced about 1.5 times as long as the short vowels, though most grammatical texts place them with the long vowels.
Tamil has two diphthongs: and , the latter of which is restricted to a few lexical items. Some like Krishnamurti consider the diphthongs as clusters of /a/ + /j, ÃÂ/ as they pattern with other VC. The way some words are written also varies e.g. avvai as (avvai), (auvai) or (avvay) (first one most common). Word final is pronounced as , it is called a (kuá¹Âá¹Âiyalukaram) "short u" (as it has only half a sound unit, compared to 1, 1.5 or 2 of other vowels) in tolkÃÂppiyam and it is unrounded even in literary Tamil; in spoken Tamil it can occur medially as well in some words after the first syllable. Word final [u] occurs in some names, chiefly nicknames like rÃÂjÃÂndraá¹ as rÃÂju but also some other terms of the form (C)VCu as in karu, puḻu. There is a minimal pair naá¹Âu "center", naá¹Âà  "to plant". In declensions the full u isnt deleted like the half u like naá¹Âuvillai - naá¹Âillai.
Colloquially, an initial or may have a onglide; likewise, an initial or may have a onglide, e.g. and . In Karnatakan dialects short versions of them may further become [a, ÃÂa], eg. eá¹Âá¹Âu, utai /ÃÂnnÃÂ, ÃÂÃÂdêiki/. This is very light or doesn't happen in Sri Lankan dialects.
Indian Colloquial Tamil also has nasalized vowels formed from word final vowel + nasal cluster (except for /Vó/ where an epenthetic u is added after it). Long vowel + nasal just nasalizes the vowel, short vowel + nasal may also change the quality, for example, /an/ gets fronted to [ÃÂÃÂ] becomes [aÃÂÃÂÃÂ] ([aÃÂæÃÂ] for some speakers), /am/ gets rounded to [õ] becomes , becomes , becomes , the remaining vowels only get nasalized. There is word final nasals in loanwords like á¹Âaim, à Âm, sÃÂmÃÂn. Karnataka's dialects have [ÃÂÃÂ] for /an/ and -m is just deleted, eg. maram [mÃÂrÃÂ].
In spoken Tamil sometimes an epenthetic vowel u is added to words ending in consonants, e.g. nil > nillu, ÃÂḷ > ÃÂḷu, nÃÂḷ > nÃÂḷu (nàin some dialects), vayal > vayalu etc. If another word is joined at the end, it is deleted.
Colloquially, the high short vowels , are lowered to and when next to a short consonant and . For example, becomes ; and becomes . This is an instance of raising umlaut. It doesn't happen in pronouns and some other words e.g. ivaá¹ and evaá¹ are different words. also monophthongises to an but it causes the lowering of before it, e.g. ilai > ele/ela. Additionally, the front long vowels and are subject to retraction when present in the first syllable of a bisyllabic word and followed by a retroflex consonant. As such, "house" becomes , but its inflected form remains . Likewise, "search!" becomes , but "(he) searched" remains . The presence and degree of retraction for each vowel may be different; it varies between dialects and even individual speakers. Almost all words end with vowels in spoken Tamil.
For some speakers in spoken Tamil the front vowels /i(:), e(:)/ get rounded to their corresponding rounded back vowels when they are after a labial consonant /m, p, ÃÂ/ and before a retroflex consonant, some words with it are quite acceptable like à ®ªà ¯Âà ®£à ¯ /peó/ > à ®ªà ¯Âà ®£à ¯Â/à ®ªà ¯Âà ®£à ¯Âà ®£à ¯ [poó~poó:ï] but others like à ®µà ¯Âà ®Âà ¯ /ÃÂi:ÃÂu/ > à ®µà ¯Âà ®Âà ¯ [ÃÂu:ÃÂï] are less accepted and may even be considered vulgar.
Another change in spoken Tamil is vowel harmony, where vowels change their height to be more similar to nearby vowels: e.g. literary Tamil /koÃÂu/ > spoken Tamil [kuÃÂï].
The consonants are known as ' ('body letters'). The consonants are classified into three categories with six in each category: valliá¹Âam ('hard'), melliá¹Âam ('soft' or nasal), and iá¹Âayiá¹Âam ('medium'). Tamil has very restricted consonant clusters (for example, there are no word-initial clusters). There are well defined rules for voicing stops in the written form of Tamil, Centamiḻ (the period of Tamil history before Sanskrit words were borrowed). Stops are voiceless when at the start of a word, in a consonant cluster with another stop and when geminated. They are voiced otherwise.
Tamil is characterized by its use of more than one type of coronal consonants: like many of the other languages of India, it contains a series of retroflex consonants. Notably, the Tamil retroflex series includes the retroflex approximant () (example Tamiḻ; often transcribed 'zh'). Among the other Dravidian languages, the retroflex approximant also occurs in Malayalam, old Badaga, old Telugu and old Kannada. In most dialects of colloquial Tamil, this consonant is seen as shifting to the retroflex lateral approximant in the south and palatal approximant /j/ in the north.
The proto-Dravidian alveolar stop *ṯ developed into an alveolar trill /r/ in the Southern and South Central Dravidian languages while *ṯṯ and *á¹Âṯ remained (modern á¹Âá¹Â, á¹Âá¹Â).
[n] and [nê] are mostly in complementary distribution and are predictable; namely, [nê] occurs word initially and before /tê/, while [n] occurs everywhere else. [nê] occurs in other places from compounding of words starting with it and geminates from sandhi, eg. annÃÂram (a-nÃÂram) , ainà «á¹Âu (ai-nà «á¹Âu) but speakers might use the alveolar nasal there instead.
/ò/ is rare word initially and is mostly only found before /táÃÂ/ word medially; it occurs in geminated form as in ÃÂññai or maññai, mostly from Sanskritic loans with jñ or sandhi where the 2nd word starts with ñ aññÃÂá¹Âam, in singular form in some rare words pà «Ã±ai, añal and in compounds like aá¹Âiñaá¹Â. Only around 5 words have doubled intervocalic [à Â], all are different forms of the word aá¹ á¹ aá¹Âam "that manner", apart from that [à Â] only occurs before /k/.
A chart of the Tamil consonant phonemes in the International Phonetic Alphabet follows:
The voiceless consonants are voiced in different positions.
In modern Tamil, however, voiced plosives occur initially in loanwords. Geminate stops get simplified to singleton unvoiced stops after long vowels, suggesting the primary cue is now voicing (cf. kà «á¹Âá¹Âam-kà «á¹Âam becoming kà «á¹Âam-kà «á¸Âam in modern speakers). Altogether, we see a shift in progress towards phonemic voicing, more advanced in some dialects than others.
Historically [j] was a possible allophone of medial -c- now the terms with [j] have solidified, compare Kannada which only had [s] as the medial allophone, Tamil ñÃÂyiá¹Âu, Kannada nÃÂsaru. In some cases both remained as in ucir, uyir. There are also cases where the opposite happened due to hypercorrection, eg. Tamil kayiá¹Âu, Madurai Tamil kacaru, kacuru, kaciru even though the word didn't originally have a -c-. There are also cases where it became t mutalai/mutaḷai/mucali, Kannada mosaḷe and disappeared after lengthening the previous vowel nilÃÂ, Kodava nelaci.
Old Tamil had a phoneme called the ÃÂytam, which was written as âÂÂ'. Tamil grammarians of the time classified it as a dependent phoneme (or restricted phoneme) ('). The rules of pronunciation given in the TolkÃÂppiyam, a text on the grammar of old Tamil, says that the ÃÂytam in old Tamil patterned with semivowels and it occurred after a short vowel and before a stop; it either lengthened the previous vowel, geminated the stop or was lost if the following segment is phonetically voiced in the environment. It is said to be the descendant of Proto Dravidian laryngeal *H. The ÃÂytam in modern Tamil is used to transcribe foreign phones like à ®Âà ®ªà ¯ (à ®Âp) for [f], à ®Âà ® (à ®Âj) for [z], à ®Âà ®¸ (à ®Âs) for [z, ÃÂ] and à ®Âà ® (à ®Âk) for [x], similar to a nuqta.
Unlike most Indic scripts, Tamil does not have distinct letters for aspirated consonants and they are found as allophones of the normal stops. The Tamil script also lacks distinct letters for voiced and unvoiced stops as their pronunciations depend on their location in a word. For example, the voiceless stop occurs at the beginning of words while the voiced stop cannot. In the middle of words, voiceless stops commonly occur as a geminated pair like -pp-, while voiced stops do not. Only voiced stops can appear medially and after a corresponding nasal. Thus both the voiced and voiceless stops can be represented by the same script in Tamil without ambiguity, the script denoting only the place and broad manner of articulation (stop, nasal, etc.). The TolkÃÂppiyam cites detailed rules as to when a letter is to be pronounced with voice and when it is to be pronounced unvoiced. The only exceptions to these rules are the letters à ® and à ®± as they are pronounced medially as and respectively.
Some loan words are pronounced in Tamil as they were in the source language, even if this means that consonants which should be unvoiced according to the TolkÃÂppiyam are voiced.
Elision is the reduction in the duration of sound of a phoneme when preceded by or followed by certain other sounds. There are well-defined rules for elision in Tamil. They are categorised into different classes based on the phoneme which undergoes elision.
1. Kuá¹Âá¹Âiyal ukaram refers to the vowel turning into the close back unrounded vowel at the end of words (e.g.: âÂÂà ®Âà ®±à ¯Ââ (meaning âÂÂsixâÂÂ) will be pronounced ).
2. Kuá¹Âá¹Âiyal ikaram refers to the short which is formed when a word ending with kuá¹Âá¹Âiyalukaram is joined with a word starting with the consonant .
The following text is Article 1 of the Universal Declaration of Human Rights.
All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.
à ®®à ®©à ®¿à ®¤à ®ªà ¯ à ®ªà ®¿à ®±à ®¿à ®µà ®¿à ®¯à ®¿à ®©à ®°à ¯ à ®Âà ®Âà ®²à ®°à ¯Âà ®®à ¯ à ®Âà ¯Âà ®¤à ®¨à ¯Âà ®¤à ®¿à ®°à ®®à ®¾à ®Âà ®µà ¯ à ®ªà ®¿à ®±à ®Âà ¯Âà ®Âà ®¿à ®©à ¯Âà ®±à ®©à ®°à ¯Â; à ® à ®µà ®°à ¯Âà ®Âà ®³à ¯ à ®®à ®¤à ®¿à ®ªà ¯Âà ®ªà ®¿à ®²à ¯Âà ®®à ¯Â, à ®Âà ®°à ®¿à ®®à ¯Âà ®Âà ®³à ®¿à ®²à ¯Âà ®®à ¯ à ®Âà ®®à ®®à ®¾à ®©à ®µà ®°à ¯Âà ®Âà ®³à ¯Â, à ® à ®µà ®°à ¯Âà ®Âà ®³à ¯ à ®¨à ®¿à ®¯à ®¾à ®¯à ®¤à ¯Âà ®¤à ¯Âà ®¯à ¯Âà ®®à ¯ à ®®à ®©à ®Âà ¯Âà ®Âà ®¾à ®Âà ¯Âà ®Âà ®¿à ®¯à ¯Âà ®¯à ¯Âà ®®à ¯ à ®Âà ®¯à ®±à ¯Âà ®ªà ®£à ¯Âà ®ªà ®¾à ®Âà ®ªà ¯ à ®ªà ¯Âà ®±à ¯Âà ®±à ®µà ®°à ¯Âà ®Âà ®³à ¯Â. à ® à ®µà ®°à ¯Âà ®Âà ®³à ¯ à ®Âà ®°à ¯Âà ®µà ®°à ¯Âà ®Âà ®©à ¯Âà ®°à ¯Âà ®µà ®°à ¯ à ®Âà ®Âà ¯Âà ®¤à ®° à ®Âà ®£à ®°à ¯Âà ®µà ¯Âà ®ªà ¯ à ®ªà ®¾à ®Âà ¯Âà ®Âà ®¿à ®²à ¯ à ®¨à ®Âà ®¨à ¯Âà ®¤à ¯Âà ®Âà ¯Âà ®³à ¯Âà ®³à ®²à ¯ à ®µà ¯Âà ®£à ¯Âà ®Âà ¯Âà ®®à ¯Â.
maá¹Âitap piá¹Âiviyiá¹Âar cakalarum cutantiramÃÂkavàpiá¹Âakkiá¹Âá¹Âaá¹Âar; avarkaḷ matippilum, urimaikaḷilum camamÃÂá¹Âavarkaḷ, avarkaḷ niyÃÂyattaiyum maá¹ÂaccÃÂá¹Âciyaiyum iyaá¹Âpaá¹ÂpÃÂkap peá¹Âá¹Âavarkaḷ. Avarkaḷ oruvaruá¹Âaá¹Âoruvar cakà Âtara uá¹Âarvup pÃÂá¹ kil naá¹Âantukoḷḷal vÃÂá¹Âá¹Âum.
/manitêapâ¿piriÃÂijinaþ sakalaþum sutêantêiþamaÃÂkaÃÂeàpirakkinranaþ àaÃÂaþkaàmatêippilum uþimaiïkaÃÂilum samamaÃÂnaÃÂaþkaàaÃÂaþkaànijaÃÂjatêtêaiïjum manatáÃÂtáÃÂaÃÂÃÂtáÃÂijum ijarpaópaÃÂkapâ¿perraÃÂaþkaààaÃÂaþkaàoþuÃÂaþuÃÂanoþuÃÂaþ sakoÃÂtêaþa uóaþÃÂupâ¿paÃÂà Âkil naÃÂantêukoÃÂÃÂal ÃÂeÃÂóÃÂum/