Universal Cross-Lingual Text Classificat...

How Do Multilingual Encoders Learn Cross-lingual Representation?

July 12, 2022

91% Match

Shijie Wu

Computation and Language

NLP systems typically require support for more than one language. As different languages have different amounts of supervision, cross-lingual transfer benefits languages with little to no training data by transferring from other languages. From an engineering perspective, multilingual NLP benefits development and maintenance by serving multiple languages with a single system. Both cross-lingual transfer and multilingual NLP rely on cross-lingual representations serving as the...

Find SimilarView on arXiv

Universal Language Model Fine-tuning for Text Classification

January 18, 2018

91% Match

Jeremy Howard, Sebastian Ruder

Computation and Language

Machine Learning

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the e...

Find SimilarView on arXiv

A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

October 18, 2022

90% Match

Kunbo Ding, Weijie Liu, Yuejian Fang, Weiquan Mao, Zhe Zhao, Tao Zhu, Haoyan Liu, ... , Chen Yiren

Computation and Language

Existing zero-shot cross-lingual transfer methods rely on parallel corpora or bilingual dictionaries, which are expensive and impractical for low-resource languages. To disengage from these dependencies, researchers have explored training multilingual models on English-only resources and transferring them to low-resource languages. However, its effect is limited by the gap between embedding clusters of different languages. To address this issue, we propose Embedding-Push, Att...

Find SimilarView on arXiv

Multilingual Hierarchical Attention Networks for Document Classification

July 4, 2017

90% Match

Nikolaos Pappas, Andrei Popescu-Belis

Computation and Language

Hierarchical attention networks have recently achieved remarkable performance for document classification in a given language. However, when multilingual document collections are considered, training such models separately for each language entails linear parameter growth and lack of cross-language transfer. Learning a single multilingual model with fewer parameters is therefore a challenging but potentially beneficial objective. To this end, we propose multilingual hierarchi...

Find SimilarView on arXiv

Cross-lingual Dataless Classification for Languages with Small Wikipedia Presence

November 13, 2016

90% Match

Yangqiu Song, Stephen Mayhew, Dan Roth

Computation and Language

This paper presents an approach to classify documents in any language into an English topical label space, without any text categorization training data. The approach, Cross-Lingual Dataless Document Classification (CLDDC) relies on mapping the English labels or short category description into a Wikipedia-based semantic representation, and on the use of the target language Wikipedia. Consequently, performance could suffer when Wikipedia in the target language is small. In thi...

Find SimilarView on arXiv

Unsupervised Cross-lingual Representation Learning at Scale

November 5, 2019

90% Match

Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, ... , Stoyanov Veselin

Computation and Language

This paper shows that pretraining multilingual language models at scale leads to significant performance gains for a wide range of cross-lingual transfer tasks. We train a Transformer-based masked language model on one hundred languages, using more than two terabytes of filtered CommonCrawl data. Our model, dubbed XLM-R, significantly outperforms multilingual BERT (mBERT) on a variety of cross-lingual benchmarks, including +14.6% average accuracy on XNLI, +13% average F1 scor...

Find SimilarView on arXiv

Soft Language Clustering for Multilingual Model Pre-training

June 13, 2023

90% Match

Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, ... , Zhou Jie

Computation and Language

Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size. In this paper, we propose XLM-P, which contextually retrieves prompts as flexible guidance for encoding instances conditionally. Our XLM-P enables (1) lightweight modeling of language-invariant and language-spec...

Find SimilarView on arXiv

A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification

October 15, 2021

90% Match

Sosuke Nishikawa, Ikuya Yamada, ... , Echizen Isao

Computation and Language

We present a multilingual bag-of-entities model that effectively boosts the performance of zero-shot cross-lingual text classification by extending a multilingual pre-trained language model (e.g., M-BERT). It leverages the multilingual nature of Wikidata: entities in multiple languages representing the same concept are defined with a unique identifier. This enables entities described in multiple languages to be represented using shared embeddings. A model trained on entity fe...

Find SimilarView on arXiv

Using Machine Translation to Augment Multilingual Classification

May 9, 2024

90% Match

Adam King

Computation and Language

An all-too-present bottleneck for text classification model development is the need to annotate training data and this need is multiplied for multilingual classifiers. Fortunately, contemporary machine translation models are both easily accessible and have dependable translation quality, making it possible to translate labeled training data from one language into another. Here, we explore the effects of using machine translation to fine-tune a multilingual model for a classif...

Find SimilarView on arXiv

Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations

October 24, 2019

90% Match

Zuyi Bao, Rui Huang, ... , Zhu Kenny Q.

Computation and Language

Previous work on cross-lingual sequence labeling tasks either requires parallel data or bridges the two languages through word-byword matching. Such requirements and assumptions are infeasible for most languages, especially for languages with large linguistic distances, e.g., English and Chinese. In this work, we propose a Multilingual Language Model with deep semantic Alignment (MLMA) to generate language-independent representations for cross-lingual sequence labeling. Our m...

Find SimilarView on arXiv

Universal Cross-Lingual Text Classification

How Do Multilingual Encoders Learn Cross-lingual Representation?

Universal Language Model Fine-tuning for Text Classification

A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Multilingual Hierarchical Attention Networks for Document Classification

Cross-lingual Dataless Classification for Languages with Small Wikipedia Presence

Unsupervised Cross-lingual Representation Learning at Scale

Soft Language Clustering for Multilingual Model Pre-training

A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification

Using Machine Translation to Augment Multilingual Classification

Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations