An Efficient Compiler for Weighted Rewri...

Approximating Context-Free Grammars with a Finite-State Calculus

November 11, 1997

85% Match

Edmund Grimley-Evans

Computation and Language

Although adequate models of human language for syntactic analysis and semantic interpretation are of at least context-free complexity, for applications such as speech processing in which speed is important finite-state models are often preferred. These requirements may be reconciled by using the more complex grammar to automatically derive a finite-state approximation which can then be used as a filter to guide speech recognition or to reject many hypotheses at an early stage...

Find SimilarView on arXiv

Parallel Composition of Weighted Finite-State Transducers

October 6, 2021

85% Match

Shubho Sengupta, Vineel Pratap, Awni Hannun

Computation and Language

Finite-state transducers (FSTs) are frequently used in speech recognition. Transducer composition is an essential operation for combining different sources of information at different granularities. However, composition is also one of the more computationally expensive operations. Due to the heterogeneous structure of FSTs, parallel algorithms for composition are suboptimal in efficiency, generality, or both. We propose an algorithm for parallel composition and implement it o...

Find SimilarView on arXiv

Multilingual Text Analysis for Text-to-Speech Synthesis

August 19, 1996

84% Match

Richard Sproat

Computation and Language

We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, ...

Find SimilarView on arXiv

Selected Operations, Algorithms, and Applications of n-Tape Weighted Finite-State Machines

April 28, 2011

84% Match

André Kempe

Formal Languages and Automat...

Computation and Language

A weighted finite-state machine with n tapes (n-WFSM) defines a rational relation on n strings. It is a generalization of weighted acceptors (one tape) and transducers (two tapes). After recalling some basic definitions about n-ary weighted rational relations and n-WFSMs, we summarize some central operations on these relations and machines, such as join and auto-intersection. Unfortunately, due to Post's Correspondence Problem, a fully general join or auto-intersection algo...

Find SimilarView on arXiv

Compiling a Partition-Based Two-Level Formalism

May 2, 1996

84% Match

Edmund University of Cambridge Grimley-Evans, George Anton University of Cambridge Kiraz, Stephen G. University of Cambridge Pulman

Computation and Language

This paper describes an algorithm for the compilation of a two (or more) level orthographic or phonological rule notation into finite state transducers. The notation is an alternative to the standard one deriving from Koskenniemi's work: it is believed to have some practical descriptive advantages, and is quite widely used, but has a different interpretation. Efficient interpreters exist for the notation, but until now it has not been clear how to compile to equivalent automa...

Find SimilarView on arXiv

Algorithms for Weighted Pushdown Automata

October 13, 2022

83% Match

Alexandra Butoi, Brian DuSell, Tim Vieira, ... , Chiang David

Computation and Language

Weighted pushdown automata (WPDAs) are at the core of many natural language processing tasks, like syntax-based statistical machine translation and transition-based dependency parsing. As most existing dynamic programming algorithms are designed for context-free grammars (CFGs), algorithms for PDAs often resort to a PDA-to-CFG conversion. In this paper, we develop novel algorithms that operate directly on WPDAs. Our algorithms are inspired by Lang's algorithm, but use a more ...

Find SimilarView on arXiv

Finite-State Non-Concatenative Morphotactics

June 30, 2000

83% Match

Kenneth R. Beesley, Lauri Karttunen

Computation and Language

Finite-state morphology in the general tradition of the Two-Level and Xerox implementations has proved very successful in the production of robust morphological analyzer-generators, including many large-scale commercial systems. However, it has long been recognized that these implementations have serious limitations in handling non-concatenative phenomena. We describe a new technique for constructing finite-state transducers that involves reapplying the regular-expression com...

Find SimilarView on arXiv

Composing Finite State Transducers on GPUs

May 16, 2018

83% Match

Arturo Argueta, David Chiang

Computation and Language

Distributed, Parallel, and C...

Weighted finite-state transducers (FSTs) are frequently used in language processing to handle tasks such as part-of-speech tagging and speech recognition. There has been previous work using multiple CPU cores to accelerate finite state algorithms, but limited attention has been given to parallel graphics processing unit (GPU) implementations. In this paper, we introduce the first (to our knowledge) GPU implementation of the FST composition operation, and we also discuss the o...

Find SimilarView on arXiv

On the Disambiguation of Weighted Automata

May 2, 2014

83% Match

Mehryar Mohri, Michael D. Riley

Formal Languages and Automat...

We present a disambiguation algorithm for weighted automata. The algorithm admits two main stages: a pre-disambiguation stage followed by a transition removal stage. We give a detailed description of the algorithm and the proof of its correctness. The algorithm is not applicable to all weighted automata but we prove sufficient conditions for its applicability in the case of the tropical semiring by introducing the *weak twins property*. In particular, the algorithm can be use...

Find SimilarView on arXiv

3-Way Composition of Weighted Finite-State Transducers

February 11, 2008

83% Match

Cyril Allauzen, Mehryar Mohri

Computational Complexity

Composition of weighted transducers is a fundamental algorithm used in many applications, including for computing complex edit-distances between automata, or string kernels in machine learning, or to combine different components of a speech recognition, speech synthesis, or information extraction system. We present a generalization of the composition of weighted transducers, 3-way composition, which is dramatically faster in practice than the standard composition algorithm wh...

Find SimilarView on arXiv

An Efficient Compiler for Weighted Rewrite Rules

Approximating Context-Free Grammars with a Finite-State Calculus

Parallel Composition of Weighted Finite-State Transducers

Multilingual Text Analysis for Text-to-Speech Synthesis

Selected Operations, Algorithms, and Applications of n-Tape Weighted Finite-State Machines

Compiling a Partition-Based Two-Level Formalism

Algorithms for Weighted Pushdown Automata

Finite-State Non-Concatenative Morphotactics

Composing Finite State Transducers on GPUs

On the Disambiguation of Weighted Automata

3-Way Composition of Weighted Finite-State Transducers