ID: cs/0204026

Querying Databases of Annotated Speech

April 11, 2002

View on ArXiv
Steve Cassidy, Steven Bird
Computer Science
Computation and Language
Databases

Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University and one at the University of Pennsylvania. Various query languages are described, along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the internet.

Similar papers 1

Towards a query language for annotation graphs

July 13, 2000

92% Match
Steven Bird, Peter Buneman, Wang-Chiew Tan
Computation and Language
Databases

The multidimensional, heterogeneous, and temporal nature of speech databases raises interesting challenges for representation and query. Recently, annotation graphs have been proposed as a general-purpose representational framework for speech databases. Typical queries on annotation graphs require path expressions similar to those used in semistructured query languages. However, the underlying model is rather different from the customary graph models for semistructured data: ...

Find SimilarView on arXiv

ATLAS: A flexible and extensible architecture for linguistic annotation

July 13, 2000

91% Match
Steven Bird, David Day, John Garofolo, John Henderson, ... , Liberman Mark
Computation and Language

We describe a formal model for annotating linguistic artifacts, from which we derive an application programming interface (API) to a suite of tools for manipulating these annotations. The abstract logical model provides for a range of storage formats and promotes the reuse of tools that interact through this API. We focus first on ``Annotation Graphs,'' a graph model for annotations on linear signals (such as text and speech) indexed by intervals, for which efficient database...

Find SimilarView on arXiv

A Formal Framework for Linguistic Annotation

March 2, 1999

90% Match
Steven University of Pennsylvania Bird, Mark University of Pennsylvania Liberman
Computation and Language

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, `named entity' identification, co-reference annotation, and so on. While there are several o...

Find SimilarView on arXiv

A Formal Framework for Linguistic Annotation (revised version)

October 26, 2000

90% Match
Steven Bird, Mark Liberman
Computation and Language
Databases
Data Structures and Algorith...

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions - audio, video and/or physiological recordings - or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, `named entity' identification, co-reference annotation, and so on. While there are several ong...

Find SimilarView on arXiv

Annotation graphs as a framework for multidimensional linguistic data analysis

July 5, 1999

88% Match
Steven Bird, Mark Liberman
Computation and Language

In recent work we have presented a formal framework for linguistic annotation based on labeled acyclic digraphs. These `annotation graphs' offer a simple yet powerful method for representing complex annotation structures incorporating hierarchy and overlap. Here, we motivate and illustrate our approach using discourse-level annotations of text and speech data drawn from the CALLHOME, COCONUT, MUC-7, DAMSL and TRAINS annotation schemes. With the help of domain specialists, we ...

Find SimilarView on arXiv

Praaline: Integrating Tools for Speech Corpus Research

February 8, 2018

88% Match
George ILC Christodoulides
Computation and Language
Databases

This paper presents Praaline, an open-source software system for managing, annotating, analysing and visualising speech corpora. Researchers working with speech corpora are often faced with multiple tools and formats, and they need to work with ever-increasing amounts of data in a collaborative way. Praaline integrates and extends existing time-proven tools for spoken corpora analysis (Praat, Sonic Visualiser and a bridge to the R statistical package) in a modular system, fac...

Find SimilarView on arXiv

A Framework for Natural Language Interfaces to Temporal Databases

November 25, 1996

88% Match
I. Microsoft Research Institute, Macquarie University, Sydney Androutsopoulos, G. D. Dept. of Artificial Intelligence, University of Edinburgh Ritchie, P. Dept. of Computer Science, University of Edinburgh Thanisch
Computation and Language

Over the past thirty years, there has been considerable progress in the design of natural language interfaces to databases. Most of this work has concerned snapshot databases, in which there are only limited facilities for manipulating time-varying information. The database community is becoming increasingly interested in temporal databases, databases with special support for time-dependent entries. We have developed a framework for constructing natural language interfaces to...

Find SimilarView on arXiv

Annotation Graphs and Servers and Multi-Modal Resources: Infrastructure for Interdisciplinary Education, Research and Development

April 10, 2002

87% Match
Christopher Cieri, Steven Bird
Computation and Language

Annotation graphs and annotation servers offer infrastructure to support the analysis of human language resources in the form of time-series data such as text, audio and video. This paper outlines areas of common need among empirical linguists and computational linguists. After reviewing examples of data and tools used or under development for each of several areas, it proposes a common framework for future tool development, data annotation and resource sharing based upon ann...

Find SimilarView on arXiv

TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit

April 3, 2002

87% Match
Steven Bird, Kazuaki Maeda, Xiaoyi Ma, Haejoong Lee, ... , Zayat Salim
Computation and Language
Sound

Four diverse tools built on the Annotation Graph Toolkit are described. Each tool associates linguistic codes and structures with time-series data. All are based on the same software library and tool architecture. TableTrans is for observational coding, using a spreadsheet whose rows are aligned to a signal. MultiTrans is for transcribing multi-party communicative interactions recorded using multi-channel signals. InterTrans is for creating interlinear text aligned to audio. ...

Find SimilarView on arXiv

Many uses, many annotations for large speech corpora: Switchboard and TDT as case studies

July 13, 2000

87% Match
David Graff, Steven Bird
Computation and Language

This paper discusses the challenges that arise when large speech corpora receive an ever-broadening range of diverse and distinct annotations. Two case studies of this process are presented: the Switchboard Corpus of telephone conversations and the TDT2 corpus of broadcast news. Switchboard has undergone two independent transcriptions and various types of additional annotation, all carried out as separate projects that were dispersed both geographically and chronologically. T...

Find SimilarView on arXiv