Multilingual Few-Shot Learning via Langu...

Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks

February 21, 2024

91% Match

Minju Seo, Jinheon Baek, ... , Hwang Sung Ju

Computation and Language

Artificial Intelligence

Machine Learning

Despite large successes of recent language models on diverse tasks, they suffer from severe performance degeneration in low-resource settings with limited training data available. Many existing works tackle this problem by generating synthetic data from the training data and then training models on them, recently using Large Language Models (LLMs). However, in low-resource settings, the amount of seed data samples to use for data augmentation is very small, which makes genera...

Find SimilarView on arXiv

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

May 29, 2022

91% Match

Xiang Chen, Lei Li, Ningyu Zhang, Xiaozhuan Liang, Shumin Deng, Chuanqi Tan, Fei Huang, ... , Chen Huajun

Computation and Language

Artificial Intelligence

Information Retrieval

Machine Learning

Prompt learning approaches have made waves in natural language processing by inducing better few-shot performance while they still follow a parametric-based learning paradigm; the oblivion and rote memorization problems in learning may encounter unstable generalization issues. Specifically, vanilla prompt learning may struggle to utilize atypical instances by rote during fully-supervised training or overfit shallow patterns with low-shot data. To alleviate such limitations, w...

Find SimilarView on arXiv

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

September 10, 2019

91% Match

Julian Martin Eisenschlos, Sebastian Ruder, Piotr Czapla, Marcin Kardas, ... , Howard Jeremy

Computation and Language

Machine Learning

Pretrained language models are promising particularly for low-resource languages as they only require unlabelled data. However, training existing models requires huge amounts of compute, while pretrained cross-lingual models often underperform on low-resource languages. We propose Multi-lingual language model Fine-Tuning (MultiFiT) to enable practitioners to train and fine-tune language models efficiently in their own language. In addition, we propose a zero-shot method using...

Find SimilarView on arXiv

Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review

May 26, 2023

91% Match

Fred Philippy, Siwen Guo, Shohreh Haddadan

Computation and Language

Artificial Intelligence

In recent years, pre-trained Multilingual Language Models (MLLMs) have shown a strong ability to transfer knowledge across different languages. However, given that the aspiration for such an ability has not been explicitly incorporated in the design of the majority of MLLMs, it is challenging to obtain a unique and straightforward explanation for its emergence. In this review paper, we survey literature that investigates different factors contributing to the capacity of MLLMs...

Find SimilarView on arXiv

Model Selection for Cross-Lingual Transfer

October 13, 2020

91% Match

Yang Chen, Alan Ritter

Computation and Language

Machine Learning

Transformers that are pre-trained on multilingual corpora, such as, mBERT and XLM-RoBERTa, have achieved impressive cross-lingual transfer capabilities. In the zero-shot transfer setting, only English training data is used, and the fine-tuned model is evaluated on another target language. While this works surprisingly well, substantial variance has been observed in target language performance between different fine-tuning runs, and in the zero-shot setup, no target-language d...

Find SimilarView on arXiv

LiST: Lite Prompted Self-training Makes Parameter-Efficient Few-shot Learners

October 12, 2021

91% Match

Yaqing Wang, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, ... , Gao Jianfeng

Computation and Language

We present a new method LiST is short for Lite Prompted Self-Training for parameter-efficient fine-tuning of large pre-trained language models (PLMs) for few-shot learning. LiST improves over recent methods that adopt prompt-based fine-tuning (FN) using two key techniques. The first is the use of self-training to leverage large amounts of unlabeled data for prompt-based FN in few-shot settings. We use self-training in conjunction with meta-learning for re-weighting noisy pseu...

Find SimilarView on arXiv

Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning

April 3, 2023

91% Match

Lifu Tu, Jin Qu, Semih Yavuz, Shafiq Joty, Wenhao Liu, ... , Zhou Yingbo

Computation and Language

Artificial Intelligence

Cross-lingual transfer of language models trained on high-resource languages like English has been widely studied for many NLP tasks, but focus on conversational tasks has been rather limited. This is partly due to the high cost of obtaining non-English conversational data, which results in limited coverage. In this work, we introduce XSGD for cross-lingual alignment pretraining, a parallel and large-scale multilingual conversation dataset that we created by translating the E...

Find SimilarView on arXiv

Unsupervised Cross-Task Generalization via Retrieval Augmentation

April 17, 2022

91% Match

Bill Yuchen Lin, Kangmin Tan, Chris Miller, ... , Ren Xiang

Computation and Language

Artificial Intelligence

Machine Learning

Humans can perform unseen tasks by recalling relevant skills acquired previously and then generalizing them to the target tasks, even if there is no supervision at all. In this paper, we aim to improve this kind of cross-task generalization ability of massive multi-task language models, such as T0 and FLAN, in an unsupervised setting. We propose a retrieval-augmentation method named ReCross that takes a few unlabelled examples as queries to retrieve a small subset of upstream...

Find SimilarView on arXiv

Multi-task Retrieval for Knowledge-Intensive Tasks

January 1, 2021

91% Match

Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oğuz, ... , Ghosh Gargi

Computation and Language

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking. Although neural retrieval outperforms traditional methods like tf-idf and BM25, its performance degrades considerably when applied to out-of-domain data. Driven by the question of whether a neural retrieval model can be universal and perform robustly on a wide variety of problems, we propose a multi-task trained model. Our approach not only ...

Find SimilarView on arXiv

Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models

June 25, 2021

91% Match

Robert L. IV Logan, Ivana Balažević, Eric Wallace, Fabio Petroni, ... , Riedel Sebastian

Computation and Language

Machine Learning

Prompting language models (LMs) with training examples and task descriptions has been seen as critical to recent successes in few-shot learning. In this work, we show that finetuning LMs in the few-shot setting can considerably reduce the need for prompt engineering. In fact, one can use null prompts, prompts that contain neither task-specific templates nor training examples, and achieve competitive accuracy to manually-tuned prompts across a wide range of tasks. While finetu...

Find SimilarView on arXiv

Multilingual Few-Shot Learning via Language Model Retrieval

Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review

Model Selection for Cross-Lingual Transfer

LiST: Lite Prompted Self-training Makes Parameter-Efficient Few-shot Learners

Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning

Unsupervised Cross-Task Generalization via Retrieval Augmentation

Multi-task Retrieval for Knowledge-Intensive Tasks

Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models