Fine-tuning Encoders for Improved Monoli...

Zero-shot Cross-lingual Transfer without Parallel Corpus

October 7, 2023

89% Match

Yuyang Zhang, Xiaofeng Han, Baojun Wang

Computation and Language

Recently, although pre-trained language models have achieved great success on multilingual NLP (Natural Language Processing) tasks, the lack of training data on many tasks in low-resource languages still limits their performance. One effective way of solving that problem is to transfer knowledge from rich-resource languages to low-resource languages. However, many previous works on cross-lingual transfer rely heavily on the parallel corpus or translation models, which are oft...

Find SimilarView on arXiv

A General-Purpose Multilingual Document Encoder

May 11, 2023

89% Match

Onur Galoğlu, Robert Litschko, Goran Glavaš

Computation and Language

Massively multilingual pretrained transformers (MMTs) have tremendously pushed the state of the art on multilingual NLP and cross-lingual transfer of NLP models in particular. While a large body of work leveraged MMTs to mine parallel data and induce bilingual document embeddings, much less effort has been devoted to training general-purpose (massively) multilingual document encoder that can be used for both supervised and unsupervised document-level tasks. In this work, we p...

Find SimilarView on arXiv

From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers

May 1, 2020

89% Match

Anne Lauscher, Vinit Ravishankar, ... , Glavaš Goran

Computation and Language

Massively multilingual transformers pretrained with language modeling objectives (e.g., mBERT, XLM-R) have become a de facto default transfer paradigm for zero-shot cross-lingual transfer in NLP, offering unmatched transfer performance. Current downstream evaluations, however, verify their efficacy predominantly in transfer settings involving languages with sufficient amounts of pretraining data, and with lexically and typologically close languages. In this work, we analyze t...

Find SimilarView on arXiv

Topic Modeling for Short Texts with Large Language Models

June 2, 2024

89% Match

Tomoki Doi, Masaru Isonuma, Hitomi Yanaka

Computation and Language

As conventional topic models rely on word co-occurrence to infer latent topics, topic modeling for short texts has been a long-standing challenge. Large Language Models (LLMs) can potentially overcome this challenge by contextually learning the semantics of words via pretraining. This paper studies two approaches, parallel prompting and sequential prompting, to use LLMs for topic modeling. Due to the input length limitations, LLMs cannot process many texts at once. By splitti...

Find SimilarView on arXiv

Fine-Tuning Transformers: Vocabulary Transfer

December 29, 2021

89% Match

Vladislav Mosin, Igor Samenko, Alexey Tikhonov, ... , Yamshchikov Ivan P.

Computation and Language

Artificial Intelligence

Machine Learning

Transformers are responsible for the vast majority of recent advances in natural language processing. The majority of practical natural language processing applications of these models are typically enabled through transfer learning. This paper studies if corpus-specific tokenization used for fine-tuning improves the resulting performance of the model. Through a series of experiments, we demonstrate that such tokenization combined with the initialization and fine-tuning strat...

Find SimilarView on arXiv

Cross-Lingual Text Classification with Multilingual Distillation and Zero-Shot-Aware Training

February 28, 2022

89% Match

Ziqing Yang, Yiming Cui, ... , Wang Shijin

Computation and Language

Multilingual pre-trained language models (MPLMs) not only can handle tasks in different languages but also exhibit surprising zero-shot cross-lingual transferability. However, MPLMs usually are not able to achieve comparable supervised performance on rich-resource languages compared to the state-of-the-art monolingual pre-trained models. In this paper, we aim to improve the multilingual model's supervised and zero-shot performance simultaneously only with the resources from s...

Find SimilarView on arXiv

Prompting Large Language Models for Topic Modeling

December 15, 2023

89% Match

Han Wang, Nirmalendu Prakash, Nguyen Khoi Hoang, Ming Shan Hee, ... , Lee Roy Ka-Wei

Artificial Intelligence

Topic modeling is a widely used technique for revealing underlying thematic structures within textual data. However, existing models have certain limitations, particularly when dealing with short text datasets that lack co-occurring words. Moreover, these models often neglect sentence-level semantics, focusing primarily on token-level semantics. In this paper, we propose PromptTopic, a novel topic modeling approach that harnesses the advanced language understanding of large l...

Find SimilarView on arXiv

Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine Translation

December 3, 2019

89% Match

Baijun Ji, Zhirui Zhang, Xiangyu Duan, Min Zhang, ... , Luo Weihua

Computation and Language

Transfer learning between different language pairs has shown its effectiveness for Neural Machine Translation (NMT) in low-resource scenario. However, existing transfer methods involving a common target language are far from success in the extreme scenario of zero-shot translation, due to the language space mismatch problem between transferor (the parent model) and transferee (the child model) on the source side. To address this challenge, we propose an effective transfer lea...

Find SimilarView on arXiv

Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model

October 30, 2018

89% Match

Muthuraman Chidambaram, Yinfei Yang, Daniel Cer, Steve Yuan, Yun-Hsuan Sung, ... , Kurzweil Ray

Computation and Language

A significant roadblock in multilingual neural language modeling is the lack of labeled non-English data. One potential method for overcoming this issue is learning cross-lingual text representations that can be used to transfer the performance from training on English tasks to non-English tasks, despite little to no task-specific non-English data. In this paper, we explore a natural setup for learning cross-lingual sentence representations: the dual-encoder. We provide a com...

Find SimilarView on arXiv

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

September 1, 2019

89% Match

Aditya Siddhant, Melvin Johnson, Henry Tsai, Naveen Arivazhagan, Jason Riesa, Ankur Bapna, ... , Raman Karthik

Computation and Language

The recently proposed massively multilingual neural machine translation (NMT) system has been shown to be capable of translating over 100 languages to and from English within a single model. Its improved translation performance on low resource languages hints at potential cross-lingual transfer capability for downstream tasks. In this paper, we evaluate the cross-lingual effectiveness of representations from the encoder of a massively multilingual NMT model on 5 downstream cl...

Find SimilarView on arXiv

Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling

Zero-shot Cross-lingual Transfer without Parallel Corpus

A General-Purpose Multilingual Document Encoder

From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers

Topic Modeling for Short Texts with Large Language Models

Fine-Tuning Transformers: Vocabulary Transfer

Cross-Lingual Text Classification with Multilingual Distillation and Zero-Shot-Aware Training

Prompting Large Language Models for Topic Modeling

Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine Translation

Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation