22
Portable Text to Speech for Indonesian Language (Bahasa) Ilham Ari Elbaith Zaeni 安安安 DA220207 Presented on Seminar Class March 18, 20014 Dept of Electrical Engineering Southern Taiwan University of Science and Technolog

Portable Text to Speech for Indonesian Language (Bahasa)

  • Upload
    osanna

  • View
    42

  • Download
    2

Embed Size (px)

DESCRIPTION

Portable Text to Speech for Indonesian Language (Bahasa). Ilham Ari Elbaith Zaeni 安啓聖 DA220207. Presented on Seminar Class March 18, 20014 Dept of Electrical Engineering Southern Taiwan University of Science and Technology. Outline. Definition Background System Design Algorithm - PowerPoint PPT Presentation

Citation preview

Page 1: Portable Text to Speech for Indonesian Language (Bahasa)

Portable Text to Speech for Indonesian Language (Bahasa)

Ilham Ari Elbaith Zaeni安啓聖

DA220207

Presented on Seminar ClassMarch 18, 20014

Dept of Electrical EngineeringSouthern Taiwan University of Science and Technology

Page 2: Portable Text to Speech for Indonesian Language (Bahasa)

Outline

• Definition• Background• System Design• Algorithm• Hardware Implementation• Result

Page 3: Portable Text to Speech for Indonesian Language (Bahasa)

Definition

• A text-to-speech (TTS): system converts normal language text into speech [wikipedia]

• Syllable : a unit of pronunciation having one vowel sound, with or without surrounding consonants, forming the whole or a part of a word

• Language of Indonesia– Indonesia have >700 languages. – The official language is Indonesian (known as Bahasa

Indonesia), a variant of Malay + other languages. [wikipedia]

Page 4: Portable Text to Speech for Indonesian Language (Bahasa)

LanguageNumber

(millions)Year

surveyedMain areas where spoken

Indonesian/Malay 210 2010 throughout Indonesia

Javanese 84.32000

(census)Northern Banten, Northern West Java, Yogyakarta, Central Java andEast Java

Sundanese 342000

(census) West Java, Banten

Madurese 13.62000

(census) Madura Island (East Java)Minangkabau 5.5 2007 West Sumatra, RiauMusi (Palembang Malay)[4] 3.9

2000 (census) South Sumatra

Manado Malay (Minahasan) 3.8 2001 Minahasa, North SulawesiBugis 3.5 1991 South Sulawesi

Banjarese 3.52000

(census)South Kalimantan, East Kalimantan, Central Kalimantan

Acehnese 3.52000

(census) Aceh

Balinese 3.32000

(census) Bali Island and Lombok IslandBetawi 2.7 1993 Jakarta etc

Page 5: Portable Text to Speech for Indonesian Language (Bahasa)

English one two water person house dog coconut

Kutainese satu due ranam urang rumah koyok nyiurIndonesian/

Malay satu dua air orang rumah anjing kelapa

Javanese siji loro banyu uwong omah asu kambil

Sundanese hiji dua cai/ci jalma imah anjing kalapa

Madurese settong dhuwa' âên oreng roma pate' nyior

Minangkabau cie' duo aie urang rumah anjiang karambia

Palembang Malay sikok duo banyu wong rumah anjing kelapo

Page 6: Portable Text to Speech for Indonesian Language (Bahasa)

Loan words of Sanskrit Origin◦ भा�षा� bahasa (language),◦ का�च kaca (glass, mirror), ◦ रा�ज- raja (king), ◦ मनु ष्य manusia (mankind), ◦ भा�मिम bumi (earth/ world), ◦ आगम agama (religion), ◦ स्त्री� Istri (wife/woman),

Page 7: Portable Text to Speech for Indonesian Language (Bahasa)

Loan words of Arabic Origin◦ selamat ( السالم salaam = peace)◦ dunia (دنياdunya = the present world), ◦ Sabtu (السبت as-sabt =Saturday), ◦ kabar (خبر ḵabar = news), ◦ ijazah (إجازة ijāza = vacation), ◦ kitab (كتاب kitāb = book), ◦ tertib (ترتيب tartīb = orderly) ◦ kamus(قاموس qāmūs = dictionary)

Page 8: Portable Text to Speech for Indonesian Language (Bahasa)

Loan words of Chinese Origin◦ pisau ( 匕首 bǐshǒu  – knife), ◦ loteng, ( 樓 /層 = lóu/céng – [upper] floor/ level), ◦mie ( 麵 > 面 Hokkien mī – noodles),◦ lumpia ( 潤餅 (Hokkien = lūn-piáⁿ) – springroll), ◦ cawan ( 茶碗 cháwǎn – teacup), ◦ teko ( 茶壺 > 茶壶 = cháhú [Mandarin], teh-ko

[Hokkien] = teapot), ◦ 苦力 kuli = 苦 khu (bitter) and 力 li (energy)

Page 9: Portable Text to Speech for Indonesian Language (Bahasa)

Loan words of Portuguese Origin◦meja (from mesa = table), ◦ boneka (from boneca = doll), ◦ jendela (from janela = window), ◦ gereja (from igreja = church), ◦ bendera (from bandeira = flag),◦ sepatu (from sapato = shoes), ◦ keju (from queijo = cheese),◦mentega (from manteiga = butter), ◦Minggu (from domingo = Sunday)

Page 10: Portable Text to Speech for Indonesian Language (Bahasa)

Loan words of Dutch Origin◦ polisi (from politie = police), ◦ kualitas (from kwaliteit = quality), ◦ rokok (from roken = smoking cigarettes), ◦ korupsi (from corruptie = corruption), ◦ kantor (from kantoor = office), ◦ resleting (from ritssluiting = zipper) ◦ gratis (from gratis = free)

Page 11: Portable Text to Speech for Indonesian Language (Bahasa)

Background

• Developed on 2005 for – LCEN competition– Bachelor degree final project on Dept of Electrical

Engineering Brawijaya University.• To help people who cannot speak • It should be portable• Why not applied on the smartphone?

– Iphone released on 2007.– First Android (HTC Dream) released on 2008

[wikipedia_Smartphone].

Page 12: Portable Text to Speech for Indonesian Language (Bahasa)

System Design

• Process that should do:– User can type a text using keypad/keyboard.– Text stored into memory– Text converted into syllable– Play sound that is corresponding the syllable.

Page 13: Portable Text to Speech for Indonesian Language (Bahasa)

System Design

• What is needed by the system– Text Editor

• Input > 26 button Using 2 keypad 4x4• Display LCD 2x16• Processor Microcontroller AT89S8252

– Memory for the Sound • ISD25120 Chipcorder from Windbond • Audio Amplifier

Page 14: Portable Text to Speech for Indonesian Language (Bahasa)

Algorithm

Page 15: Portable Text to Speech for Indonesian Language (Bahasa)

Syllable spelling (on Bahasa)Bahasa have :

vowel (a, e, i, o, u)consonant (b, c, d, f, g, h, j, k, l, m, n, p, q, r,s, t, v, w, x, y, z)

Diphthong (ai, au, oi)Consonants combination (kh, ng, ny, sy, etc)

Page 16: Portable Text to Speech for Indonesian Language (Bahasa)

Rule of spelling (on Bahasa)If there are two vowel in sequence, the words

should be separated between two vowels.Ex: ma-in (play), bu-ah (fruit)

If there is one consonant between two vowel, the words should be separated before the consonantEx: sa-ya (i), ka-mu (you)

Page 17: Portable Text to Speech for Indonesian Language (Bahasa)

Rule of spelling (on Bahasa)If there is two consonant between two vowel,

the words should be separated between the consonantEx: man-di (take a bath), makh-luk (creature)

If there is three consonant between two vowel, the words should be separated after the first consonantEx: ul-tra, in-fra [wikisource]

Page 18: Portable Text to Speech for Indonesian Language (Bahasa)

Hardware Implementation

Page 19: Portable Text to Speech for Indonesian Language (Bahasa)

Hardware Implementation

Page 20: Portable Text to Speech for Indonesian Language (Bahasa)

Hardware Implementation

Page 21: Portable Text to Speech for Indonesian Language (Bahasa)

ResultNo. Input Text Meaning

The expected output

Output Correct/Wrong

1 saya i /sa/ /ya/ /sa/ /ya/ correct

2 anda you /an/ /da/ /an/ /da/ correct

3 dia he/she /di/ /a/ /di/ /a/ correct

4 intra intra /in/ /tra/ - wrong

5 mereka they /me/ /re/ /ka/ /me/ /re/ /ka/ correct

6 keluar out /ke/ /lu/ /ar/ /ke/ /lu/ /ar/ correct

7 alias alias /a/ /li/ /as/ /a/ /li/ /as/ correct

8 anak child /a/ /nak/ /a/ /na/ /ak/ wrong

9 tragedi tragedy /tra/ /ge/ /di/ /us/ /ra/ /ge/ /di/ wrong

10 buka open /bu/ /ka. /bu/ /ka. correct

11 angka number /ang/ /ka/ /ang/ /ka/ correct

12 pramuka scout /pra/ /mu/ /ka/ /pra/ /mu/ /ka/ correct

13 syair poem /sya/ /ir/ /sya/ /ir/ correct

14 karena because /ka/ /re/ /na/ /ka/ /re/ /na/ correct

15 artikulasi articulation /ar/ /ti/ /ku/ /la/ /si/ /ar/ /ti/ /ku/ /la/ /si/ correct

Page 22: Portable Text to Speech for Indonesian Language (Bahasa)

Reference

• http://en.wikipedia.org/wiki/Smartphone• http://en.wikipedia.org/wiki/Text_to_Speech• http://en.wikipedia.org/wiki/

Languages_of_Indonesia• http://en.wikipedia.org/wiki/

Indonesian_language• http://id.wikisource.org/wiki/

Pedoman_Umum_Ejaan_Bahasa_Indonesia_yang_Disempurnakan_(1987)/Bab_I