Jump to content

Speech Recognition & Synthesis

Kubijyanye na Wikipedia

Speech Recognition & Synthesis, yahoze yitwa Speech Services, [1] ni porogaramu isomwa munsakaza amashusho yakozwe na Google kuri sisitemu y'imikorere ya Android . Iha imbaraga porogaramu zo gusoma mu ijwi riranguruye (ivuga) inyandiko ku insakaza amashusho, hamwe n'inkunga y'indimi nyinshi. Umwandiko Kurmvugo ishobora gukoreshwa na porogaramu nka Google Play Ibitabo mu gusoma ibitabo mu ijwi riranguruye, inkoronyamagambo yo gusoma ibisobanuro mu ijwi riranguruye kugira ngo uvuge amagambo, Google TalkBack, hamwe n'ibindi bitekerezo byatanzwe byifashishwa bishingiye kuri porogaramu, kimwe na porogaramu z'ishyaka mucyongereza bita (third-part). Abakoresha bagomba gushiraho amakuru yijwi kuri buri rurimi.

Indimi zishyigikira

[hindura | hindura inkomoko]

 

Bamwe mu bategura porogaramu batangiye guhuza no guhindura porogaramu zabo za Auto kugira ngo bashyiremo Text-to-Speech, nka Hyundai mu mwaka 2015. [2] Porogaramu nka textPlus na WhatsApp zikoresha Text-to-Speech kugira ngo usome amatangazo mu ijwi riranguruye kandi itange imikorere mu gusubiza amajwi.

inkoranyamagambo Cloud Text-to-Speech ikoreshwa na WaveNet, [3] software yakozwe na Google yo mu Bwongereza ishami rya AI ryitwa DeepMind, ryaguzwe na Google mu mwaka 2014. Iragerageza gutandukanya abobahanganye nabo, Amazon na Microsoft . [4]

Tekinoroji ya AI ya DeepMind ya tekinoroji yateye imbere cyane kandi ifatika. Amajwi menshi akoresha (harimo na Siri y'uruganda rwa Apple) akoresha (concatenative synthesis), [3] aho porogaramu ibika fonema imwe hanyuma ikayicamo hamwe kugira ngo igire amagambo n'interuro.

WaveNet itanga imvugo isa nk'isanzwe kuruta izindi sisitemu y'imvugo. Ihuza hamwe n'imvugo n'ibindi bisa nk'umuntu mugushimangira no guhindagurika k'umutwe, fonema, n'amagambo. Ugereranije, WaveNet itanga amajwi yo kuvuga abantu bakunda kuruta ubundi buryo bw'ikoranabuhanga. Bitandukanye n'ubundi buryo bwinshi bwanditse bw'imvugo, moderi ya WaveNet ikora amajwi mugutangira yerekana amajwi m'uburyo bwa kera. Icyitegererezo gikoresha urusobe rw'imitsi rwatojwe hakoreshejwe urugero runini rw'imvugo. Mu gihe cy'amahugurwa, umuyoboro ukuramo imiterere y'ibanze y'ijambo, nk'ijwi rikurikira hamwe n'uburyo bw'imvugo ifatika . Iyo uhaye inyandiko yinjiza moderi ya WaveNet yatojwe ishobora kubyara imvugo ihindagurika uhereye k'umurongo, icyitegererezo kimwe icyarimwe, hamwe n'icyitegererezo kigera ku 24.000 ku masegonda kandi cyoroshye hagati y'ijwi ryihariye. [3]

Serivise yahinduwe (Speech Recognition & Synthesis muri 2023. )</link>[ <span title="This claim needs references to reliable sources. (August 2023)">citation ikenewe</span> ]

  • Guhindura imvugo
  • Ijwi
  • Kwandukura
  1. Wang, Jules (November 8, 2021). "You'll never guess the latest Google app to cross 10 billion installs (seriously)". Android Police. Archived from the original on November 8, 2021. Retrieved November 18, 2021.
  2. "Google, Hyundai show off new third-party Android Auto apps". CNET. CBS Interactive. Retrieved 17 January 2015.
  3. 3.0 3.1 3.2 "WaveNet". www.deepmind.com (in Icyongereza). Retrieved 2023-06-22. Cite error: Invalid <ref> tag; name ":0" defined multiple times with different content
  4. "Text-to-Speech AI: Lifelike Speech Synthesis". Google Cloud (in Icyongereza). Retrieved 2023-06-22.