An Efficient Compiler for Weighted Rewri...

Non-simplifying Graph Rewriting Termination

February 26, 2013

83% Match

Guillaume LORIA Université de Lorraine Bonfante, Bruno LORIA Inria Nancy Grand-Est Guillaume

Computation and Language

Computational Complexity

Logic in Computer Science

So far, a very large amount of work in Natural Language Processing (NLP) rely on trees as the core mathematical structure to represent linguistic informations (e.g. in Chomsky's work). However, some linguistic phenomena do not cope properly with trees. In a former paper, we showed the benefit of encoding linguistic structures by graphs and of using graph rewriting rules to compute on those structures. Justified by some linguistic considerations, graph rewriting is characteriz...

Find SimilarView on arXiv

Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs

January 17, 2023

83% Match

Anej Svete, Benjamin Dayan, Tim Vieira, ... , Eisner Jason

Data Structures and Algorith...

Computation and Language

Weighted finite-state automata (WSFAs) are commonly used in NLP. Failure transitions are a useful extension for compactly representing backoffs or interpolation in $n$-gram models and CRFs, which are special cases of WFSAs. The pathsum in ordinary acyclic WFSAs is efficiently computed by the backward algorithm in time $O(|E|)$, where $E$ is the set of transitions. However, this does not allow failure transitions, and preprocessing the WFSA to eliminate failure transitions cou...

Find SimilarView on arXiv

Tropical Modeling of Weighted Transducer Algorithms on Graphs

November 1, 2018

83% Match

Emmanouil Theodosis, Petros Maragos

Rings and Algebras

Data Structures and Algorith...

Machine Learning

Weighted Finite State Transducers (WFSTs) are versatile data structures that can model a great number of problems, ranging from Automatic Speech Recognition to DNA sequencing. Traditional computer science algorithms are employed when working with these structures in order to optimise their size, but also the runtime of decoding algorithms. However, these algorithms are not unified under a common framework that would allow for their treatment as a whole. Moreover, the inherent...

Find SimilarView on arXiv

Extracting Weighted Finite Automata from Recurrent Neural Networks for Natural Languages

June 27, 2022

82% Match

Zeming Wei, Xiyue Zhang, Meng Sun

Computation and Language

Machine Learning

Recurrent Neural Networks (RNNs) have achieved tremendous success in sequential data processing. However, it is quite challenging to interpret and verify RNNs' behaviors directly. To this end, many efforts have been made to extract finite automata from RNNs. Existing approaches such as exact learning are effective in extracting finite-state models to characterize the state dynamics of RNNs for formal languages, but are limited in the scalability to process natural languages. ...

Find SimilarView on arXiv

Finite State Transducers Approximating Hidden Markov Models

July 17, 1997

82% Match

Andre Rank Xerox Research Centre, Grenoble Laboratory, France Kempe

Computation and Language

This paper describes the conversion of a Hidden Markov Model into a sequential transducer that closely approximates the behavior of the stochastic model. This transformation is especially advantageous for part-of-speech tagging because the resulting transducer can be composed with other transducers that encode correction rules for the most frequent tagging errors. The speed of tagging is also improved. The described methods have been implemented and successfully tested on six...

Find SimilarView on arXiv

Glushkov's construction for functional subsequential transducers

August 5, 2020

82% Match

Aleksander Mendoza-Drosik

Formal Languages and Automat...

Computation and Language

Glushkov's construction has many interesting properties and they become even more evident when applied to transducers. This article strives to show the wast range of possible extensions and optimisations for this algorithm. Special flavour of regular expressions is introduced, which can be efficiently converted to $\epsilon$-free functional subsequential weighted finite state transducers. Produced automata are very compact, as they contain only one state for each symbol (from...

Find SimilarView on arXiv

Powerful and Extensible WFST Framework for RNN-Transducer Losses

March 18, 2023

82% Match

Aleksandr Laptev, Vladimir Bataev, ... , Ginsburg Boris

Audio and Speech Processing

Artificial Intelligence

Computation and Language

Machine Learning

Sound

This paper presents a framework based on Weighted Finite-State Transducers (WFST) to simplify the development of modifications for RNN-Transducer (RNN-T) loss. Existing implementations of RNN-T use CUDA-related code, which is hard to extend and debug. WFSTs are easy to construct and extend, and allow debugging through visualization. We introduce two WFST-powered RNN-T implementations: (1) "Compose-Transducer", based on a composition of the WFST graphs from acoustic and textua...

Find SimilarView on arXiv

Reduction of Intermediate Alphabets in Finite-State Transducer Cascades

October 23, 2000

82% Match

Andre Kempe

Computation and Language

This article describes an algorithm for reducing the intermediate alphabets in cascades of finite-state transducers (FSTs). Although the method modifies the component FSTs, there is no change in the overall relation described by the whole cascade. No additional information or special algorithm, that could decelerate the processing of input, is required at runtime. Two examples from Natural Language Processing are used to illustrate the effect of the algorithm on the sizes of ...

Find SimilarView on arXiv

Differentiable Weighted Finite-State Transducers

October 2, 2020

82% Match

Awni Hannun, Vineel Pratap, ... , Hsu Wei-Ning

Machine Learning

We introduce a framework for automatic differentiation with weighted finite-state transducers (WFSTs) allowing them to be used dynamically at training time. Through the separation of graphs from operations on graphs, this framework enables the exploration of new structured loss functions which in turn eases the encoding of prior knowledge into learning algorithms. We show how the framework can combine pruning and back-off in transition models with various sequence-level loss ...

Find SimilarView on arXiv

Use of Weighted Finite State Transducers in Part of Speech Tagging

October 10, 1997

82% Match

Evelyne Bell Labs Tzoukermann, Dragomir R. Columbia University Radev

Computation and Language

This paper addresses issues in part of speech disambiguation using finite-state transducers and presents two main contributions to the field. One of them is the use of finite-state machines for part of speech tagging. Linguistic and statistical information is represented in terms of weights on transitions in weighted finite-state transducers. Another contribution is the successful combination of techniques -- linguistic and statistical -- for word disambiguation, compounded w...

Find SimilarView on arXiv

An Efficient Compiler for Weighted Rewrite Rules

Non-simplifying Graph Rewriting Termination

Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs

Tropical Modeling of Weighted Transducer Algorithms on Graphs

Extracting Weighted Finite Automata from Recurrent Neural Networks for Natural Languages

Finite State Transducers Approximating Hidden Markov Models

Glushkov's construction for functional subsequential transducers

Powerful and Extensible WFST Framework for RNN-Transducer Losses

Reduction of Intermediate Alphabets in Finite-State Transducer Cascades

Differentiable Weighted Finite-State Transducers

Use of Weighted Finite State Transducers in Part of Speech Tagging