The large-scale structure of semantic networks: statistical analyses and a model for semantic growth

October 1, 2001

Joint Word Representation Learning using a Corpus and a Semantic Lexicon

November 19, 2015

87% Match

Danushka Bollegala, Alsuhaibani Mohammed, ... , Kawarabayashi Ken-ichi

Computation and Language

Artificial Intelligence

Methods for learning word representations using large text corpora have received much attention lately due to their impressive performance in numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and word analogy detection. Despite their success, these data-driven word representation learning methods do not consider the rich semantic relational structure between words in a co-occurring context. On the other hand, already much manual effort...

Find SimilarView on arXiv

Discovering universal statistical laws of complex networks

December 15, 2011

87% Match

Stefano Cardanobile, Volker Pernice, ... , Rotter Stefan

Physics and Society

Social and Information Netwo...

Quantitative Methods

Different network models have been suggested for the topology underlying complex interactions in natural systems. These models are aimed at replicating specific statistical features encountered in real-world networks. However, it is rarely considered to which degree the results obtained for one particular network class can be extrapolated to real-world networks. We address this issue by comparing different classical and more recently developed network models with respect to t...

Find SimilarView on arXiv

Multiplex model of mental lexicon reveals explosive learning in humans

May 26, 2017

87% Match

Massimo Stella, Nicole M. Beckage, ... , De Domenico Manlio

Physics and Society

Computation and Language

Social and Information Netwo...

Adaptation and Self-Organizi...

Word similarities affect language acquisition and use in a multi-relational way barely accounted for in the literature. We propose a multiplex network representation of this mental lexicon of word similarities as a natural framework for investigating large-scale cognitive patterns. Our representation accounts for semantic, taxonomic, and phonological interactions and it identifies a cluster of words which are used with greater frequency, are identified, memorised, and learned...

Find SimilarView on arXiv

The effect of linguistic constraints on the large scale organization of language

February 14, 2011

87% Match

Madhav Krishna, Ahmed Hassan, ... , Radev Dragomir

Computation and Language

Social and Information Netwo...

This paper studies the effect of linguistic constraints on the large scale organization of language. It describes the properties of linguistic networks built using texts of written language with the words randomized. These properties are compared to those obtained for a network built over the text in natural order. It is observed that the "random" networks too exhibit small-world and scale-free characteristics. They also show a high degree of clustering. This is indeed a surp...

Find SimilarView on arXiv

Distributional Measures of Semantic Distance: A Survey

March 8, 2012

87% Match

Saif M. Mohammad, Graeme Hirst

Computation and Language

The ability to mimic human notions of semantic distance has widespread applications. Some measures rely only on raw text (distributional measures) and some rely on knowledge sources such as WordNet. Although extensive studies have been performed to compare WordNet-based measures with human judgment, the use of distributional measures as proxies to estimate semantic distance has received little attention. Even though they have traditionally performed poorly when compared to Wo...

Find SimilarView on arXiv

Learning Graph Embeddings from WordNet-based Similarity Measures

August 16, 2018

87% Match

Andrey Kutuzov, Mohammad Dorgham, Oleksiy Oliynyk, ... , Panchenko Alexander

Computation and Language

We present path2vec, a new approach for learning graph embeddings that relies on structural measures of pairwise node similarities. The model learns representations for nodes in a dense space that approximate a given user-defined graph distance measure, such as e.g. the shortest path distance or distance measures that take information beyond the graph structure into account. Evaluation of the proposed model on semantic similarity and word sense disambiguation tasks, using var...

Find SimilarView on arXiv

Simulating Urban Scaling with an Term Linkages Network of a University

May 16, 2023

87% Match

Raan Anthony F. J. van

Physics and Society

In this paper we make an attempt to increase our understanding of the urban scaling phenomenon. We investigate how superlinear scaling emerges if a network increases in size and how this scaling depends on the occurrence of elements that constitute the network. To this end we consider a city as a complex network and simulate this by the network of publications of a university. In this simulation the publications take the role of the city inhabitants and the concepts (terms) i...

Find SimilarView on arXiv

Normalized Web Distance and Word Similarity

May 25, 2009

87% Match

Rudi L. software consultant Oakland, CA Cilibrasi, Paul M. B. CWI, Amsterdam Vitanyi

Computation and Language

Information Retrieval

There is a great deal of work in cognitive psychology, linguistics, and computer science, about using word (or phrase) frequencies in context in text corpora to develop measures for word similarity or word association, going back to at least the 1960s. The goal of this chapter is to introduce the normalizedis a general way to tap the amorphous low-grade knowledge available for free on the Internet, typed in by local users aiming at personal gratification of diverse objectives...

Find SimilarView on arXiv

Embedding Words in Non-Vector Space with Unsupervised Graph Learning

October 6, 2020

87% Match

Max Ryabinin, Sergei Popov, ... , Voita Elena

Computation and Language

Machine Learning

It has become a de-facto standard to represent words as elements of a vector space (word2vec, GloVe). While this approach is convenient, it is unnatural for language: words form a graph with a latent hierarchical structure, and this structure has to be revealed and encoded by word embeddings. We introduce GraphGlove: unsupervised graph word representations which are learned end-to-end. In our setting, each word is a node in a weighted graph and the distance between words is t...

Find SimilarView on arXiv

A density compensation-based path computing model for measuring semantic similarity

June 3, 2015

87% Match

Xinhua Zhu, Fei Li, ... , Peng Qi

Artificial Intelligence

The shortest path between two concepts in a taxonomic ontology is commonly used to represent the semantic distance between concepts in the edge-based semantic similarity measures. In the past, the edge counting is considered to be the default method for the path computation, which is simple, intuitive and has low computational complexity. However, a large lexical taxonomy of such as WordNet has the irregular densities of links between concepts due to its broad domain but. The...

Find SimilarView on arXiv