Centre for Internet & Society

The Centre for Internet and Society (CIS) is doing a project on developing a text-to-speech software in 15 Indian languages. This blog post captures the key updates of the work done by CIS team for Bengali language.

Indian Bengali was added to eSpeak by the Indian team.  The language was added and improved based on user testing and review.

  1. Conjunct consonant pronunciation when (ba) is conjunct was corrected.
  2. INHERENT vowel PRONUNCIEATION SOME TIMES NEED TO BE SUPPRESSED.  It was made possible programmatically.
  3. Changed phoneme rules for য dZV modified to ্) যা :&
  4. sissue with য়/ Oj eg; desired pronunciation /Poi'sa but was being pronounced as 'payosa'
  5. Modified the [phoneme to jV and phone,me rules C)য় (C j
  6. Issue with 'ja'fala, that should double the pronunciation of the preceeding consonant, was rectified. Eg; Onnay was pronounced as 'onyai' as against the desired pronunciation
  7. The Bengali files sent by the developer on 20th December and Third January had been updated in the version.
  8. 71 Bengali Exception words had been included with their pronunciation rules - where inherent vowel even being between two consonant is not pronounced.
  9. Changed phoneme rules for য dZV modified to ্) যা :&
  10. issue with য়/ Oj was corrected. eg; desired pronunciation /Poi'sa but was being pronounced as 'payosa'
  11. Issue with 'ja'fala, that should double the pronunciation of the preceeding consonant was rectified. Eg; Onnay was pronounced as 'onyai' as against the desired pronunciation, The Bengali files sent by the developer on, 20th December and Third January had been updated in the version.
  12. 4/Auguest/2014Few exception words had been added  দশটা  , তিরিশটা, চল্লিশটা, পঞ্চাশটা, ষাটটা, সত্তরটা, করবে, বলবে, চলবে, শুনবে, শুনব, মানকচু, ওলকপি, ফটকিরি, চরণপদ্ম.
  13. In eSpeak  Indian test build 1.11, the following exception words were added:
  1. অমরসঙ্গী     O:mor'SongI
  2. চুপচাপ       cup'c,ap
  3. পাগলা        pa',g'la
  4. পাগলামো      pa',g'la'mo
  5. মর্যাদা       mO:rJ,ada
  6. আমপাতা      'ampa,ta
  7. আকাশবানী     akaS'banI
  8. বিদঘুটে       bi,dg#Ut.e
  9. দেবদেবী      deb'debI
  10. অথচ         O:t#oco
  11. আমরাই       am,rai
  12. খেলা         k#&'la
  13. খেলবো        k#el'bo
  14. চারটি        car,t.i
  15. একটি        ekt.i
  16. একটা      &kt.a:
  17. দরকার     dO:r'kar'
  18. দেখছি      dek#'c#i
  19. বলছি       bo,lc#i
  20. করছি       ko'rc#i
  21. শুনছি        Sun'c#i
  22. ভাবছি       b#a'bc#i
  23. ঘুরছি         g#U,rc#i
  24. দিনরাত      din'rat.
  25. সবরকম     SO:b,rO:kom
  26. কাজকর্ম     kaJ,kO:'rmo
  27. দুঃখ         duk#'k#o
  28. তিনটে        tin't.e
  29. চারটে        car't.e
  30. পাঁচটা        p~ac't.a
  31. সাতটা        Sat't.a
  32. আটটা        a:t.'t.a
  33. দশটা         dOS't.:a
  34. একুশটা       ekuS't.:a
  35. বাইশটা       ba:iS't.:a
  36. তেইশটা       tei'St.:a
  37. চব্বিশটা      co'b'biS't.:a
  38. পঁচিশটা       p~o'ciS't.:a
  39. ছাব্বিশটা      ch:a'b'biS't.:a
  40. সাতাশটা       Sat.aS't.:a
  41. আঠাশটা      a:t.#a:S't.:a
  42. উনত্রিশটা      un'triS't.:a
  43. তিরিশটা       tiriS't.:a
  44. একত্রিশটা     ek'triS't.:a
  45. বত্রিশটা      bot:riS't.:a
  46. তেত্রিশটা      tetriS't.:a
  47. চৌঁত্রিশটা      c~ou:triS't.:a
  48. পঁয়ত্রিশটা     p~oj'triS't.:a
  49. ছত্রিশটা      chot:riS't.:a
  50. সাঁইত্রিশটা     S~ai'triS't.:a
  51. আটত্রিশটা    a:t.triS't.:a
  52. উনচল্লিশটা    uno'col'l:iS't.:a
  53. চল্লিশটা      col'l:iS't.:a
Filed under: , ,
The views and opinions expressed on this page are those of their individual authors. Unless the opposite is explicitly stated, or unless the opposite may be reasonably inferred, CIS does not subscribe to these views and opinions which belong to their individual authors. CIS does not accept any responsibility, legal or otherwise, for the views and opinions of these individual authors. For an official statement from CIS on a particular issue, please contact us directly.