Late Chunking: Contextual Chunk Embeddin...

Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models

January 28, 2025

92% Match

Minghan Li, Eric Gaussier, Guodong Zhou

Information Retrieval

In recent years, large language models (LLMs) have demonstrated exceptional power in various domains, including information retrieval. Most of the previous practices involve leveraging these models to create a single embedding for each query, each passage, or each document individually, a strategy exemplified and used by the Retrieval-Augmented Generation (RAG) framework. While this method has proven effective, we argue that it falls short in fully capturing the nuanced intri...

Find SimilarView on arXiv

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

February 12, 2024

92% Match

Jon Saad-Falcon, Daniel Y. Fu, Simran Arora, ... , Ré Christopher

Information Retrieval

Machine Learning

Retrieval pipelines-an integral component of many machine learning systems-perform poorly in domains where documents are long (e.g., 10K tokens or more) and where identifying the relevant document requires synthesizing information across the entire text. Developing long-context retrieval encoders suitable for these domains raises three challenges: (1) how to evaluate long-context retrieval performance, (2) how to pretrain a base language model to represent both short contexts...

Find SimilarView on arXiv

SeDR: Segment Representation Learning for Long Documents Dense Retrieval

November 20, 2022

92% Match

Junying Chen, Qingcai Chen, ... , Huang Yutao

Information Retrieval

Computation and Language

Recently, Dense Retrieval (DR) has become a promising solution to document retrieval, where document representations are used to perform effective and efficient semantic search. However, DR remains challenging on long documents, due to the quadratic complexity of its Transformer-based encoder and the finite capacity of a low-dimension embedding. Current DR models use suboptimal strategies such as truncating or splitting-and-pooling to long documents leading to poor utilizatio...

Find SimilarView on arXiv

ChuLo: Chunk-Level Key Information Representation for Long Document Processing

October 14, 2024

92% Match

Yan Li, Soyeon Caren Han, ... , Cao Feiqi

Computation and Language

Transformer-based models have achieved remarkable success in various Natural Language Processing (NLP) tasks, yet their ability to handle long documents is constrained by computational limitations. Traditional approaches, such as truncating inputs, sparse self-attention, and chunking, attempt to mitigate these issues, but they often lead to information loss and hinder the model's ability to capture long-range dependencies. In this paper, we introduce ChuLo, a novel chunk repr...

Find SimilarView on arXiv

Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

August 25, 2023

92% Match

Jiawen Xie, Pengyu Cheng, Xiao Liang, ... , Du Nan

Computation and Language

Artificial Intelligence

Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing, because the computational cost of self-attention operations in transformers swells quadratically with the input sequence length. To alleviate the complexity of long-sequence processing, we propose a simple framework to enable the offthe-shelf pre-trained transformers to process much longer sequences, while the computation and memory costs remai...

Find SimilarView on arXiv

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

February 18, 2024

91% Match

Kun Luo, Zheng Liu, ... , Liu Kang

Computation and Language

Large language models (LLMs) call for extension of context to handle many critical applications. However, the existing approaches are prone to expensive costs and inferior quality of context extension. In this work, we proposeExtensible Embedding, which realizes high-quality extension of LLM's context with strong flexibility and cost-effectiveness. Extensible embedding stand as an enhancement of typical token embedding, which represents the information for an extensible scope...

Find SimilarView on arXiv

Hierarchical Re-ranker Retriever (HRR)

March 4, 2025

91% Match

Ashish Singh, Priti Mohapatra

Information Retrieval

Computation and Language

Retrieving the right level of context for a given query is a perennial challenge in information retrieval - too large a chunk dilutes semantic specificity, while chunks that are too small lack broader context. This paper introduces the Hierarchical Re-ranker Retriever (HRR), a framework designed to achieve both fine-grained and high-level context retrieval for large language model (LLM) applications. In HRR, documents are split into sentence-level and intermediate-level (512 ...

Find SimilarView on arXiv

A Survey on Long Text Modeling with Transformers

February 28, 2023

91% Match

Zican Dong, Tianyi Tang, ... , Zhao Wayne Xin

Computation and Language

Modeling long texts has been an essential technique in the field of natural language processing (NLP). With the ever-growing number of long documents, it is important to develop effective modeling methods that can process and analyze such texts. However, long texts pose important research challenges for existing text models, with more complex semantics and special characteristics. In this paper, we provide an overview of the recent advances on long texts modeling based on Tra...

Find SimilarView on arXiv

Transformer Based Language Models for Similar Text Retrieval and Ranking

May 10, 2020

91% Match

Javed Qadrud-Din, Ashraf Bah Rabiou, Ryan Walker, Ravi Soni, Martin Gajek, ... , Rangaraj Akhil

Information Retrieval

Computation and Language

Most approaches for similar text retrieval and ranking with long natural language queries rely at some level on queries and responses having words in common with each other. Recent applications of transformer-based neural language models to text retrieval and ranking problems have been very promising, but still involve a two-step process in which result candidates are first obtained through bag-of-words-based approaches, and then reranked by a neural transformer. In this pape...

Find SimilarView on arXiv

Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

November 1, 2024

91% Match

Lixiao Yang, Mengyang Xu, Weimao Ke

Information Retrieval

Computation and Language

Machine Learning

Question-answering (QA) is an important application of Information Retrieval (IR) and language models, and the latest trend is toward pre-trained large neural networks with embedding parameters. Augmenting QA performances with these LLMs requires intensive computational resources for fine-tuning. We propose an innovative approach to improve QA task performances by integrating optimized vector retrievals and instruction methodologies. Based on retrieval augmentation, the proce...

Find SimilarView on arXiv

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models

Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

SeDR: Segment Representation Learning for Long Documents Dense Retrieval

ChuLo: Chunk-Level Key Information Representation for Long Document Processing

Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

Hierarchical Re-ranker Retriever (HRR)

A Survey on Long Text Modeling with Transformers

Transformer Based Language Models for Similar Text Retrieval and Ranking

Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions