DRAFT: Dense Retrieval Augmented Few-sho...

SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation

May 16, 2024

91% Match

Abhishek Divekar, Greg Durrett

Computation and Language

Artificial Intelligence

Machine Learning

Large language models (LLMs) are versatile and can address many tasks, but for computational efficiency, it is often desirable to distill their capabilities into smaller student models. One way to do this for classification tasks is via dataset synthesis, which can be accomplished by generating examples of each label from the LLM. Prior approaches to synthesis use few-shot prompting, which relies on the LLM's parametric knowledge to generate usable examples. However, this lea...

Find SimilarView on arXiv

A Thorough Examination on Zero-shot Dense Retrieval

April 27, 2022

90% Match

Ruiyang Ren, Yingqi Qu, Jing Liu, Wayne Xin Zhao, Qifei Wu, Yuchen Ding, Hua Wu, ... , Wen Ji-Rong

Computation and Language

Information Retrieval

Recent years have witnessed the significant advance in dense retrieval (DR) based on powerful pre-trained language models (PLM). DR models have achieved excellent performance in several benchmark datasets, while they are shown to be not as competitive as traditional sparse retrieval models (e.g., BM25) in a zero-shot retrieval setting. However, in the related literature, there still lacks a detailed and comprehensive study on zero-shot retrieval. In this paper, we present the...

Find SimilarView on arXiv

Precise Zero-Shot Dense Retrieval without Relevance Labels

December 20, 2022

90% Match

Luyu Gao, Xueguang Ma, ... , Callan Jamie

Information Retrieval

Computation and Language

While dense retrieval has been shown effective and efficient across tasks and languages, it remains difficult to create effective fully zero-shot dense retrieval systems when no relevance label is available. In this paper, we recognize the difficulty of zero-shot learning and encoding relevance. Instead, we propose to pivot through Hypothetical Document Embeddings~(HyDE). Given a query, HyDE first zero-shot instructs an instruction-following language model (e.g. InstructGPT) ...

Find SimilarView on arXiv

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

February 15, 2023

90% Match

Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, ... , Chen Xilun

Information Retrieval

Computation and Language

Various techniques have been developed in recent years to improve dense retrieval (DR), such as unsupervised contrastive learning and pseudo-query generation. Existing DRs, however, often suffer from effectiveness tradeoffs between supervised and zero-shot retrieval, which some argue was due to the limited model capacity. We contradict this hypothesis and show that a generalizable DR can be trained to achieve high accuracy in both supervised and zero-shot retrieval without in...

Find SimilarView on arXiv

KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

July 4, 2023

90% Match

Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, ... , Iannacci Francis

Computation and Language

Artificial Intelligence

In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets. In contrast, general topic modeling methods possess the advantage of analyzing documents to extract meaningful patterns of words without the need of pretraining. To leverage topic modeling's unsupervised insights extraction on text classification tasks, we develop the Knowledge Distillation Semi...

Find SimilarView on arXiv

In-Context Learning for Text Classification with Many Labels

September 19, 2023

90% Match

Aristides Milios, Siva Reddy, Dzmitry Bahdanau

Computation and Language

Machine Learning

In-context learning (ICL) using large language models for tasks with many labels is challenging due to the limited context window, which makes it difficult to fit a sufficient number of examples in the prompt. In this paper, we use a pre-trained dense retrieval model to bypass this limitation, giving the model only a partial view of the full label space for each inference call. Testing with recent open-source LLMs (OPT, LLaMA), we set new state of the art performance in few-s...

Find SimilarView on arXiv

Automated Few-shot Classification with Instruction-Finetuned Language Models

May 21, 2023

90% Match

Rami Aly, Xingjian Shi, Kaixiang Lin, ... , Wilson Andrew Gordon

Computation and Language

A particularly successful class of approaches for few-shot learning combines language models with prompts -- hand-crafted task descriptions that complement data samples. However, designing prompts by hand for each task commonly requires domain knowledge and substantial guesswork. We observe, in the context of classification tasks, that instruction finetuned language models exhibit remarkable prompt robustness, and we subsequently propose a simple method to eliminate the need ...

Find SimilarView on arXiv

Mask-guided BERT for Few Shot Text Classification

February 21, 2023

90% Match

Wenxiong Liao, Zhengliang Liu, Haixing Dai, Zihao Wu, Yiyang Zhang, Xiaoke Huang, Yuzhong Chen, Xi Jiang, Wei Liu, Dajiang Zhu, Tianming Liu, Sheng Li, ... , Cai Hongmin

Computation and Language

Artificial Intelligence

Transformer-based language models have achieved significant success in various domains. However, the data-intensive nature of the transformer architecture requires much labeled data, which is challenging in low-resource scenarios (i.e., few-shot learning (FSL)). The main challenge of FSL is the difficulty of training robust models on small amounts of samples, which frequently leads to overfitting. Here we present Mask-BERT, a simple and modular framework to help BERT-based ar...

Find SimilarView on arXiv

Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models

July 17, 2023

90% Match

Zhiyuan Peng, Xuyang Wu, Yi Fang

Information Retrieval

Artificial Intelligence

Computation and Language

Machine Learning

Dense retrieval (DR) converts queries and documents into dense embeddings and measures the similarity between queries and documents in vector space. One of the challenges in DR is the lack of domain-specific training data. While DR models can learn from large-scale public datasets like MS MARCO through transfer learning, evidence shows that not all DR models and domains can benefit from transfer learning equally. Recently, some researchers have resorted to large language mode...

Find SimilarView on arXiv

Leveraging LLMs for Unsupervised Dense Retriever Ranking

February 7, 2024

90% Match

Ekaterina Khramtsova, Shengyao Zhuang, ... , Zuccon Guido

Information Retrieval

This paper introduces a novel unsupervised technique that utilizes large language models (LLMs) to determine the most suitable dense retriever for a specific test(target) corpus. Selecting the appropriate dense retriever is vital for numerous IR applications that employ these retrievers, trained on public datasets, to encode or conduct searches within a new private target corpus. The effectiveness of a dense retriever can significantly diminish when applied to a target corpus...

Find SimilarView on arXiv

DRAFT: Dense Retrieval Augmented Few-shot Topic classifier Framework

SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation

A Thorough Examination on Zero-shot Dense Retrieval

Precise Zero-Shot Dense Retrieval without Relevance Labels

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

In-Context Learning for Text Classification with Many Labels

Automated Few-shot Classification with Instruction-Finetuned Language Models

Mask-guided BERT for Few Shot Text Classification

Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models

Leveraging LLMs for Unsupervised Dense Retriever Ranking