The Pashto alphabet () is the right-to-left abjad-based alphabet developed from the Perso-Arabic script, used for the Pashto language in Afghanistan and Northwestern Parts of Pakistan.It originated in the 16th century through the works of Pir Roshan.
Pashto is written in the Arabic Naskh. Pashto uses all 28 letters of the Arabic alphabet, and shares 3 letters (<span style="font-size:140%;">ÃÂ</span>, <span style="font-size:140%;">þ</span>, and <span style="font-size:140%;">ÃÂ</span>) with Persian in the additional letters.
Pashto has several letters which do not appear in the Persian alphabet, which are shown in the table below:
All the additional characters are derived from existing Arabic letters by adding diacritics; for example, the consonants xÃÂën/á¹£ÃÂën and õe/áºÂÃÂe look like Arabic's sën and re respectively with a dot above and beneath. Similarly, the letters representing retroflex consonants are written with a small circle (known as a "paná¸Âak", "ÃÂaá¹Âwanday" or "skÃÂá¹Âay") attached underneath the corresponding dental consonants.
The consonant is written as either <span style="font-size:140%;">ë</span> or <span style="font-size:140%;">ï</span>.
In addition to Persian vowels, Pashto has <span style="font-size:140%;">æ</span>, <span style="font-size:140%;">ÃÂ</span>, <span style="font-size:140%;">ÃÂ</span>, and <span style="font-size:140%;">ÃÂ</span> for additional vowels and diphthongs.
Pashto employs stress: this can change the aspect of the verb and the meaning of the word. The Arabic alphabet does not show stress placement, but in transliteration it is indicated by the use of acute accent diactric: over the vowel.
Pashto has 45 letters and 4 diacritic marks. The Southeastern (SE) and Southwestern (SW), Northeastern (NE) and Northwestern (NW) dialects of Pashto are included.
The superscribed element of the letter <span style="font-size:140%;"></span> in earlier varieties was not hamza-shaped, but was very similar to little kÃÂf of the letter <span style="font-size:140%;"></span>. Such shape of the upper element of the letter is hard to find in modern fonts.
Since the time of Bayazid Pir Roshan, <span style="font-size:140%;"></span> (dÃÂl with subscript dot) was used for , which was still used in the Diwan of Mirza written in 1690 CE, but this sign was later replaced by <span style="font-size:140%;"></span>.
Another rare glyph for is <span style="font-size:140%;"></span>, a ì with the same dot about harakat.
The four diacritic marks are used:
Notes
Notes
There are broadly two standards for Pashto orthography, the Afghan orthography, which is regulated by the Academy of Sciences of Afghanistan, and the Peshawar orthography of the Pashto Academy in Peshawar. They used to be very similar in the past, until the orthography reforms were introduced in 1970s and 80s in Afghanistan. Both of them use additional letters: . The Afghan standard is currently dominant due to the lack and negative treatment of Pashto education in Pakistan. Most writers use mixed orthography combining elements of both standards. In Pakistan, Pashto speakers who are not literate in their mother tongue often use Urdu letters.
The main differences between the two are as follows:
Word-final -y sound is denoted by letter in Pakistan and dotless letter in Afghanistan. Word-final -i sound is denoted by letter in both Pakistan and Afghanistan. Pre-reform Afghan orthography used for both cases, and some writers still often confuse them.
Word-final -a sound is denoted by in Peshawar orthography, while the -ÃÂ sound is denoted by . Afghan orthography uses for both sounds.
The letters and for g are considered variants of the same character. Both are widely used, but the Afghan official materials prefer the form, while the Pakistani orthography sets a specific glyph for which looks like with a circle below. Most Arabic script fonts, however, only implement a form of that looks like with a circle.
Both standards prescribe the usage of for k. In practice, however, even the official sources often use the form. Historically, the two are calligraphic variants of the same character, is more common in modern Arabic, and is more common in Persian and Urdu. In Unicode they are split into two separate glyphs.
The y- sound before a -letter is written as in the Pakistani orthography and as in the Afghan orthography. Pre-reform Afghan orthography also used .
Pakistani orthography uses for the postposition kxÃÂe "in". Afghan standard prefers . In most dialects, this postposition is pronounced ke or ki, but the historical pronunciation, also found as a variant in some Southern Pashto dialects, is ká¹£ÃÂe. The verbal prefix (as in kenastÃÂl or ká¹£ÃÂenastÃÂl "to sit down") is still pronounced ká¹£ÃÂe- in Southern Pashto and ke- in Northern Pashto, but some Afghan authors may also spell it like . On the other hand, words with combination, like nÃÂxá¹£ÃÂa "mark, sign", bÃÂxá¹£ÃÂÃÂl "forgive, pardon", are written identically according to both standards, but some authors speaking Northern Pashto may write them according to their pronunciation: nÃÂxa, bÃÂxÃÂl.
In some auxiliary words like pronouns and particles, as well as in plural and oblique singular forms of feminine nouns, the Pakistani orthography uses , while the Afghan orthography often uses . It reflects the pronunciation of unstressed word-final -e in some Afghan dialects, particularly the Kandahari accent. Note also that the pronoun "you" is usually written tÃÂso in Pakistan, reflecting the local dialects. In Afghanistan, this pronoun is written tÃÂsi or tÃÂso. In verbal prefixes like pre-, ká¹£ÃÂe-/ke-, both standards use .
The auxiliary verb in passive constructions is often written without a space with the copula in the Afghan orthography. E.g., likÃÂle à ¡ÃÂwe da "is (fem.) written" may be spelled by some authors.
The potential/optative participles are written with -ÃÂy in Afghanistan (e.g. likÃÂlÃÂy "able to write"), and with -ay in Pakistan ( likÃÂlay). These participles are pronounced with -ÃÂy in Southern Pashto of Kandahar, but even the Kabuli writers who pronounce them with -ay use -ÃÂy to distinguish them from the past participles (\ likÃÂlay "written").
In both modern orthographies, matres lectionis ( for o and u, for i) should always be written in native Pashto words. Words like tÃÂruõmÃÂy "darkness, dark night", wrusta "after, behind" etc used to be and still sometimes are written as and . The borrowed words should be written the way they were in the original languages: bulbul "nightingale", or gul "flower".
The phrase pÃÂ xayr "welcome", lit. "well, successfully" is written in two words in Afghanistan (), but often as a single word in Pakistan ().
The Afghan orthography does not use a space in compound and suffixed words, while in Peshawar standard the letters should be disconnected without a space. The zero-width non-joiner is used in such cases.
The archaic orthography may also be used in certain texts, before standardisation.
Peshawar and Afghan standards also differ in the way they spell Western loanwords. Afghan spellings are influenced by Persian/Dari orthography, and through it often borrows French and German forms of the words, while Pakistani orthography is influenced by Urdu spellings of English words.
In the 16th century, Bayazid Pir Roshan from Waziristan Pakhtunkhwa invented the Roshani script to write Pashto. It had 41 letters:
28 of his letters came from the Arabic alphabet. He introduced 13 new letters into the Pashto alphabet. Most of the new letters he introduced i.e. <span style="font-size:140%;"></span> and <span style="font-size:140%;">ü</span> are still written in the same form and are pronounced almost in the same way in modern Pashto. The sound system of the southern dialect of modern Pashto preserves the distinction between all the consonant phonemes of his orthography.
Pir Roshan also introduced the letter <span style="font-size:140%;">ÃÂ</span> (ràwith dot below and dot above) to represent , like the â¨sâ© in pleasure, for which modern Pashto uses <span style="font-size:140%;">ÃÂ</span> instead. Modern Pashto uses the letter <span style="font-size:140%;">ÃÂ</span> to represent the sound (northern dialect: ), but for that sound, Pir Roshan used a letter looking like <span style="font-size:140%;">֕</span> (dÃÂl with central dot). His letter <span style="font-size:140%;">ÃÂ</span> (dÃÂl with dot below) to represent has been replaced by <span style="font-size:140%;">ÃÂ</span> in modern Pashto. He also used <span style="font-size:140%;">ÃÂ</span> (sën with three dots below), an obsolete letter from the medieval Nastaÿlëq script, to denote the letter <span style="font-size:140%;">ó</span> (representing ) only in the isolated form. The Arabic ligature <span style="font-size:140%;">ï»»</span> (lÃÂm-alif) was also used. Two of his letters, <span style="font-size:140%;">þ</span> and <span style="font-size:140%;">ÃÂ</span>, were borrowed from the Persian alphabet.
The following table (read from left to right) gives the letters' isolated forms, along with possible Latin equivalents and typical IPA values:
Waziristani has the following vowels:
These can potentially be romanised as:
In the Marwat dialect and in the KarlÃÂá¹Âi dialects presence of nasalised vowels has been noted. As such the nasalised vowels be transcribed in the following ways:
It can also be transcribed as: