Thus, there is a need to provide improved text-to-speech processing that can present contextual information to listeners. As a result, today's documents can have many different formats and contain various different document elements, including links, images, headings, tables, captions, footnotes, etc., which makes text-to-speech processing more challenging. Further, in recent years, documents have become more complex and more diversified. Regardless of the reasons for listening to speech associated with a document, conventional text-to-speech processing is often not able to impart to the user (listener) contextual information about the text that is being spoken. For users with impaired vision, listening to the resulting speech for a document is particularly important. A user desiring to review a document can then listen to the resulting speech instead of having to read through the document. embedding inaudible data in the audio signalįor various reasons, documents have been converted to speech (spoken text) using conventional text-to-speech processing. in vocoders Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
#Text to speech synthesizer online pitch control pdf#
Google Patents Intelligent text-to-speech conversionÄownload PDF Info Publication number US9626955B2 US9626955B2 US15/090,544 US201615090544A US9626955B2 US 9626955 B2 US9626955 B2 US 9626955B2 US 201615090544 A US201615090544 A US 201615090544A US 9626955 B2 US9626955 B2 US 9626955B2 Authority US United States Prior art keywords text document audio speech spoken Prior art date Legal status (The legal status is an assumption and is not a legal conclusion.
Google Patents US9626955B2 - Intelligent text-to-speech conversion US9626955B2 - Intelligent text-to-speech conversion