Skip to main navigation Skip to search Skip to main content

Accessing documents via audio: An extensible transcoder for HTML to VoiceXML conversion

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

5 Scopus citations

Abstract

Increasing proliferation of hand-held devices as well as the need for delivering information to visually impaired persons have caused the need for transcoding web information into documents that can be delivered as audio. Web information is typically represented as HTML (Hyper Text Mark-up Language) documents. Audio delivery of web documents is done using VoiceXML. Due to this difference in mark-up notation, much of the web is inaccessible via audio. One way to solve this accessibility problem is to automatically transcode HTML documents to VoiceXML. In this paper, we describe such an automatic transcoder that converts HTML into VoiceXML. The transcoder is compositional and is realized in two phases: The parsing phase where the input HTML file is converted to HTML node tree, and the semantic mapping phase where each node in the HTML tree is compositionally mapped to its equivalent VoiceXML node. Our transcoder is extensible in the sense that: (i) it can be upgraded easily by users to accommodate modifications to and extensions of HTML; (ii) it provides means for the user to modify the translation logic while dealing with certain HTML tags. The translator is being publicly distributed.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsKlaus Miesenberger, Joachim Klaus, Wolfgang Zagler, Dominique Burger
PublisherSpringer Verlag
Pages339-346
Number of pages8
ISBN (Print)3540223347, 9783540223344
DOIs
StatePublished - 2004

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3118

Fingerprint

Dive into the research topics of 'Accessing documents via audio: An extensible transcoder for HTML to VoiceXML conversion'. Together they form a unique fingerprint.

Cite this