This Issue


International Journal of Foundations of Computer Science: Vol. 16, No. 03
Print ISSN: 0129-0541
Online ISSN: 1793-6373

 
writing-guides   ealerts
Connect with WS
 


THE DESIGN PRINCIPLES AND ALGORITHMS OF A WEIGHTED GRAMMAR LIBRARY

CYRIL ALLAUZEN

AT&T Labs - Research, 180 Park Avenue, Florham Park, NJ 07932, USA

MEHRYAR MOHRI

Courant Institute of Mathematical Sciences, New York University, 719 Broadway, 12th Floor, New York, NY 10003, USA

BRIAN ROARK

Center for Spoken Language Understanding, OGI School of Science & Engineering, Oregon Health & Science University, 20000 NW Walker Road, Beaverton, Oregon 97006, USA

Received: 30 November 2004
Accepted: 21 February 2005

We present the software design principles, algorithms, and utilities of a general weighted grammar library, the GRM Library, that can be used in a variety of applications in text, speech, and biosequence processing. Several of the algorithms and utilities of this library are described, including in some cases their pseudocodes and pointers to their use in applications. The algorithms and the utilities were designed to support a wide variety of semirings and the representation and use of large grammars and automata of several hundred million rules or transitions.

Cited by (4):
, , . (2012) Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing 20:8, 2341-2351. Online publication date: 1-Oct-2012. [CrossRef]
. (2010) Rewriting the orthography of SMS messages. Natural Language Engineering 16:02, 133. Online publication date: 1-Apr-2010. [CrossRef]
, , . (2009) Integrating morphology into automatic speech recognition. 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 354-358. [CrossRef]
, . (2008) A Maximum Entropy Model of Phonotactics and Phonotactic Learning. Linguistic Inquiry 39:3, 379-440. Online publication date: 1-Jul-2008. [CrossRef]