Building Knowledge Bases for the Generation of Software Documentation

July 25, 1996

Cecile ITRI, Univ. of Brighton Paris, Keith Vander ITRI, Univ. of Brighton Linden

Computer Science

Computation and Language

Automated text generation requires a underlying knowledge base from which to generate, which is often difficult to produce. Software documentation is one domain in which parts of this knowledge base may be derived automatically. In this paper, we describe \drafter, an authoring support tool for generating user-centred software documentation, and in particular, we describe how parts of its required knowledge base can be obtained automatically.

Automatic Generation of Technical Documentation

November 29, 1994

91% Match

Ehud CoGenTex, Ithaca, USA Reiter, Chris University of Edinburgh, UK Mellish, John University of Edinburgh, UK Levine

Computation and Language

Natural-language generation (NLG) techniques can be used to automatically produce technical documentation from a domain knowledge base and linguistic and contextual models. We discuss this application of NLG technology from both a technical and a usefulness (costs and benefits) perspective. This discussion is based largely on our experiences with the IDAS documentation-generation project, and the reactions various interested people from industry have had to IDAS. We hope that...

Find SimilarView on arXiv

Toward Human-Like Summaries Generated from Heterogeneous Software Artefacts

May 6, 2019

88% Match

Mahfouth Alghamdi, Christoph Treude, Markus Wagner

Software Engineering

Automatic text summarisation has drawn considerable interest in the field of software engineering. It can improve the efficiency of software developers, enhance the quality of products, and ensure timely delivery. In this paper, we present our initial work towards automatically generating human-like multi-document summaries from heterogeneous software artefacts. Our analysis of the text properties of 545 human-written summaries from 15 software engineering projects will ultim...

Find SimilarView on arXiv

Generating Multilingual Documents from a Knowledge Base: The TECHDOC Project

July 21, 1994

88% Match

Dietmar FAW Ulm, Ulm, Germany Rösner, Manfred University of Toronto and FAW Ulm Stede

Computation and Language

TECHDOC is an implemented system demonstrating the feasibility of generating multilingual technical documents on the basis of a language-independent knowledge base. Its application domain is user and maintenance instructions, which are produced from underlying plan structures representing the activities, the participating objects with their properties, relations, and so on. This paper gives a brief outline of the system architecture and discusses some recent developments in t...

Find SimilarView on arXiv

Acquiring Correct Knowledge for Natural Language Generation

June 26, 2011

88% Match

E. Reiter, R. Robertson, S. G. Sripada

Computation and Language

Natural language generation (NLG) systems are computer software systems that produce texts in English and other human languages, often from non-linguistic input data. NLG systems, like most AI systems, need substantial amounts of knowledge. However, our experience in two NLG projects suggests that it is difficult to acquire correct knowledge for NLG systems; indeed, every knowledge acquisition (KA) technique we tried had significant problems. In general terms, these problems ...

Find SimilarView on arXiv

Towards a Workbench for Acquisition of Domain Knowledge from Natural Language

April 30, 1996

88% Match

Andrei HCRC, Edinburgh University Mikheev, Steven Thomson Technical Labs, Rockville Maryland Finch

Computation and Language

In this paper we describe an architecture and functionality of main components of a workbench for an acquisition of domain knowledge from large text corpora. The workbench supports an incremental process of corpus analysis starting from a rough automatic extraction and organization of lexico-semantic regularities and ending with a computer supported analysis of extracted data and a semi-automatic refinement of obtained hypotheses. For doing this the workbench employs methods ...

Find SimilarView on arXiv

Generating Software for Well-Understood Domains

February 1, 2023

88% Match

Jacques Carette, Spencer Smith, Jason Balaci

Software Engineering

Programming Languages

Current software development is often quite code-centric and aimed at short-term deliverables, due to various contextual forces (such as the need for new revenue streams from many individual buyers). We're interested in software where different forces drive the development. \textbf{Well understood domains} and \textbf{long-lived software} provide one such context. A crucial observation is that software artifacts that are currently handwritten contain considerable duplicatio...

Find SimilarView on arXiv

Natural Language Generation for Non-Expert Users

September 18, 2019

87% Match

Van Duc New Mexico State University Nguyen, Tran Cao New Mexico State University Son, Enrico New Mexico State University Pontelli

Artificial Intelligence

Computation and Language

Motivated by the difficulty in presenting computational results, especially when the results are a collection of atoms in a logical language, to users, who are not proficient in computer programming and/or the logical representation of the results, we propose a system for automatic generation of natural language descriptions for applications targeting mainstream users. Differently from many earlier systems with the same aim, the proposed system does not employ templates for t...

Find SimilarView on arXiv

Verifiable Source Code Documentation in Controlled Natural Language

November 12, 2013

87% Match

Tobias Kuhn, Alexandre Bergel

Software Engineering

Artificial Intelligence

Computation and Language

Human-Computer Interaction

Logic in Computer Science

Writing documentation about software internals is rarely considered a rewarding activity. It is highly time-consuming and the resulting documentation is fragile when the software is continuously evolving in a multi-developer setting. Unfortunately, traditional programming environments poorly support the writing and maintenance of documentation. Consequences are severe as the lack of documentation on software structure negatively impacts the overall quality of the software pro...

Find SimilarView on arXiv

Supporting Software Maintenance with Dynamically Generated Document Hierarchies

August 11, 2024

87% Match

Katherine R. Dearstyne, Alberto D. Rodriguez, Jane Cleland-Huang

Software Engineering

Software documentation supports a broad set of software maintenance tasks; however, creating and maintaining high-quality, multi-level software documentation can be incredibly time-consuming and therefore many code bases suffer from a lack of adequate documentation. We address this problem through presenting HGEN, a fully automated pipeline that leverages LLMs to transform source code through a series of six stages into a well-organized hierarchy of formatted documents. We ev...

Find SimilarView on arXiv

N-Gram Cluster Identification During Empirical Knowledge Representation Generation

December 5, 1994

87% Match

Robin Department of Computer Science, University of Sheffield, England Collier

Computation and Language

This paper presents an overview of current research concerning knowledge extraction from technical texts. In particular, the use of empirical techniques during the identification and generation of a semantic representation is considered. A key step is the discovery of useful n-grams and correlations between clusters of these n-grams.

Find SimilarView on arXiv