Building Knowledge Bases for the Generat...

Generating Natural Language Descriptions from OWL Ontologies: the NaturalOWL System

April 24, 2014

86% Match

Ion Androutsopoulos, Gerasimos Lampouras, Dimitrios Galanis

Computation and Language

Artificial Intelligence

We present NaturalOWL, a natural language generation system that produces texts describing individuals or classes of OWL ontologies. Unlike simpler OWL verbalizers, which typically express a single axiom at a time in controlled, often not entirely fluent natural language primarily for the benefit of domain experts, we aim to generate fluent and coherent multi-sentence texts for end-users. With a system like NaturalOWL, one can publish information in OWL on the Web, along with...

Find SimilarView on arXiv

A Parallel Evaluation Data Set of Software Documentation with Document Structure Annotation

August 11, 2020

86% Match

Bianka Buschbeck, Miriam Exel

Computation and Language

This paper accompanies the software documentation data set for machine translation, a parallel evaluation data set of data originating from the SAP Help Portal, that we released to the machine translation community for research purposes. It offers the possibility to tune and evaluate machine translation systems in the domain of corporate software documentation and contributes to the availability of a wider range of evaluation scenarios. The data set comprises of the language ...

Find SimilarView on arXiv

Towards Automatically Extracting UML Class Diagrams from Natural Language Specifications

October 26, 2022

86% Match

Song Yang, Houari Sahraoui

Software Engineering

Information Retrieval

In model-driven engineering (MDE), UML class diagrams serve as a way to plan and communicate between developers. However, it is complex and resource-consuming. We propose an automated approach for the extraction of UML class diagrams from natural language software specifications. To develop our approach, we create a dataset of UML class diagrams and their English specifications with the help of volunteers. Our approach is a pipeline of steps consisting of the segmentation of ...

Find SimilarView on arXiv

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

April 28, 2020

85% Match

Mahfouth Alghamdi, Christoph Treude, Markus Wagner

Software Engineering

Automatic text summarisation has drawn considerable interest in the area of software engineering. It is challenging to summarise the activities related to a software project, (1) because of the volume and heterogeneity of involved software artefacts, and (2) because it is unclear what information a developer seeks in such a multi-document summary. We present the first framework for summarising multi-document software artefacts containing heterogeneous data within a given time...

Find SimilarView on arXiv

Model-based generation of natural language specifications

December 6, 2016

85% Match

Phan Vo Thu Nhat, Maria Spichkova

Software Engineering

Application of formal models provides many benefits for the software and system development, however, the learning curve of formal languages could be a critical factor for an industrial project. Thus, a natural language specification that reflects all the aspects of the formal model might help to understand the model and be especially useful for the stakeholders who do not know the corresponding formal language. Moreover, an automated generation of the documentation from the ...

Find SimilarView on arXiv

Interactive Duplicate Search in Software Documentation

August 22, 2019

85% Match

D. V. Luciv, D. V. Koznov, A. A. Shelikhovskii, K. Yu. Romanovsky, G. A. Chernishev, A. N. Terekhov, D. A. Grigoriev, A. N. Smirnova, ... , Vasenina A. I.

Software Engineering

Data Structures and Algorith...

Various software features such as classes, methods, requirements, and tests often have similar functionality. This can lead to emergence of duplicates in their descriptive documentation. Uncontrolled duplicates created via copy/paste hinder the process of documentation maintenance. Therefore, the task of duplicate detection in software documentation is of importance. Solving it makes planned reuse possible, as well as creating and using templates for unification and automatic...

Find SimilarView on arXiv

Autonomous Intelligent Software Development

August 12, 2022

85% Match

Mark Alan Matties

Artificial Intelligence

Software Engineering

We present an overview of the design and first proof-of-concept implementation for AIDA, an autonomous intelligent developer agent that develops software from scratch. AIDA takes a software requirements specification and uses reasoning over a semantic knowledge graph to interpret the requirements, then designs and writes software to satisfy them. AIDA uses both declarative and procedural knowledge in the core domains of data, algorithms, and code, plus some general knowledge....

Find SimilarView on arXiv

Documentation Generation as Information Visualization

November 11, 2020

85% Match

Will Crichton

Software Engineering

Human-Computer Interaction

Automatic documentation generation tools, or auto docs, are widely used to visualize information about APIs. However, each auto doc tool comes with its own unique representation of API information. In this paper, I use an information visualization analysis of auto docs to generate potential design principles for improving their usability. Developers use auto docs as a reference by looking up relevant API primitives given partial information, or leads, about its name, type, or...

Find SimilarView on arXiv

Towards a Flexible System Architecture for Automated Knowledge Base Construction Frameworks

February 4, 2021

85% Match

Osman Din

Information Retrieval

Although knowledge bases play an important role in many domains (including in archives, where they are sometimes used for entity extraction and semantic annotation tasks), it is challenging to build knowledge bases by hand. This is owing to a number of factors: Knowledge bases must be accurate, up-to-date, comprehensive, and as flexible and as efficient as possible. These requirements mean a large undertaking, in the form of extensive work by subject matter experts (such as s...

Find SimilarView on arXiv

A Semantic Approach for Automatic Structuring and Analysis of Software Process Patterns

October 2, 2012

85% Match

Nahla Jlaiel, Khouloud Madhbouh, Mohamed Ben Ahmed

Artificial Intelligence

Computation and Language

The main contribution of this paper, is to propose a novel semantic approach based on a Natural Language Processing technique in order to ensure a semantic unification of unstructured process patterns which are expressed not only in different formats but also, in different forms. This approach is implemented using the GATE text engineering framework and then evaluated leading up to high-quality results motivating us to continue in this direction.

Find SimilarView on arXiv

Building Knowledge Bases for the Generation of Software Documentation

Generating Natural Language Descriptions from OWL Ontologies: the NaturalOWL System

A Parallel Evaluation Data Set of Software Documentation with Document Structure Annotation

Towards Automatically Extracting UML Class Diagrams from Natural Language Specifications

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

Model-based generation of natural language specifications

Interactive Duplicate Search in Software Documentation

Autonomous Intelligent Software Development

Documentation Generation as Information Visualization

Towards a Flexible System Architecture for Automated Knowledge Base Construction Frameworks

A Semantic Approach for Automatic Structuring and Analysis of Software Process Patterns