Contextual Memory Reweaving in Large Lan...

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

September 9, 2024

91% Match

Hongjin Qian, Peitian Zhang, Zheng Liu, ... , Dou Zhicheng

Computation and Language

Artificial Intelligence

Retrieval-Augmented Generation (RAG) leverages retrieval tools to access external databases, thereby enhancing the generation quality of large language models (LLMs) through optimized context. However, the existing retrieval methods are constrained inherently, as they can only perform relevance matching between explicitly stated queries and well-formed knowledge, but unable to handle tasks involving ambiguous information needs or unstructured knowledge. Consequently, existing...

Find SimilarView on arXiv

ACER: Automatic Language Model Context Extension via Retrieval

October 11, 2024

91% Match

Luyu Gao, Yunyi Zhang, Jamie Callan

Computation and Language

Artificial Intelligence

Information Retrieval

Machine Learning

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through a carefully designed context extension stage, with the goal of producing generalist long-context capabilities. In our preliminary experiments, however, we discovered that the current open-weight generalist long-context models are still lack...

Find SimilarView on arXiv

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

May 24, 2023

91% Match

Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, ... , Chen Weizhu

Computation and Language

Large language models are powerful text processors and reasoners, but are still subject to limitations including outdated knowledge and hallucinations, which necessitates connecting them to the world. Retrieval-augmented large language models have raised extensive attention for grounding model generation on external knowledge. However, retrievers struggle to capture relevance, especially for queries with complex information needs. Recent work has proposed to improve relevance...

Find SimilarView on arXiv

Boosting Long-Context Management via Query-Guided Activation Refilling

December 17, 2024

91% Match

Hongjin Qian, Zheng Liu, Peitian Zhang, ... , Lian Defu

Computation and Language

Artificial Intelligence

Information Retrieval

Processing long contexts poses a significant challenge for large language models (LLMs) due to their inherent context-window limitations and the computational burden of extensive key-value (KV) activations, which severely impact efficiency. For information-seeking tasks, full context perception is often unnecessary, as a query's information needs can dynamically range from localized details to a global perspective, depending on its complexity. However, existing methods strugg...

Find SimilarView on arXiv

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

June 14, 2024

91% Match

Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Ivan Rodkin, Dmitry Sorokin, ... , Burtsev Mikhail

Computation and Language

Artificial Intelligence

In recent years, the input context sizes of large language models (LLMs) have increased dramatically. However, existing evaluation methods have not kept pace, failing to comprehensively assess the efficiency of models in handling long contexts. To bridge this gap, we introduce the BABILong benchmark, designed to test language models' ability to reason across facts distributed in extremely long documents. BABILong includes a diverse set of 20 reasoning tasks, including fact ch...

Find SimilarView on arXiv

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

June 18, 2024

91% Match

Weizhi Fei, Xueyan Niu, Guoqing Xie, Yanhua Zhang, Bo Bai, ... , Han Wei

Computation and Language

Artificial Intelligence

Current Large Language Models (LLMs) face inherent limitations due to their pre-defined context lengths, which impede their capacity for multi-hop reasoning within extensive textual contexts. While existing techniques like Retrieval-Augmented Generation (RAG) have attempted to bridge this gap by sourcing external information, they fall short when direct answers are not readily available. We introduce a novel approach that re-imagines information retrieval through dynamic in-c...

Find SimilarView on arXiv

Does RAG Really Perform Bad For Long-Context Processing?

February 17, 2025

91% Match

Kun Luo, Zheng Liu, Peitian Zhang, Hongjin Qian, ... , Liu Kang

Computation and Language

The efficient processing of long context poses a serious challenge for large language models (LLMs). Recently, retrieval-augmented generation (RAG) has emerged as a promising strategy for this problem, as it enables LLMs to make selective use of the long context for efficient computation. However, existing RAG approaches lag behind other long-context processing methods due to inherent limitations on inaccurate retrieval and fragmented contexts. To address these challenges, we...

Find SimilarView on arXiv

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

May 22, 2020

91% Match

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, ... , Kiela Douwe

Computation and Language

Machine Learning

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems...

Find SimilarView on arXiv

Disentangling Memory and Reasoning Ability in Large Language Models

November 20, 2024

91% Match

Mingyu Jin, Weidi Luo, Sitao Cheng, Xinyi Wang, Wenyue Hua, Ruixiang Tang, ... , Zhang Yongfeng

Computation and Language

Large Language Models (LLMs) have demonstrated strong performance in handling complex tasks requiring both extensive knowledge and reasoning abilities. However, the existing LLM inference pipeline operates as an opaque process without explicit separation between knowledge retrieval and reasoning steps, making the model's decision-making process unclear and disorganized. This ambiguity can lead to issues such as hallucinations and knowledge forgetting, which significantly impa...

Find SimilarView on arXiv

Augmenting Language Models with Long-Term Memory

June 12, 2023

91% Match

Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, ... , Wei Furu

Computation and Language

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models Augmented with Long-Term Memory (LongMem), which enables LLMs to memorize long history. We design a novel decoupled network architecture with the original backbone LLM frozen as a memory encoder and an adaptive residual side-network as a memo...

Find SimilarView on arXiv

Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

ACER: Automatic Language Model Context Extension via Retrieval

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Boosting Long-Context Management via Query-Guided Activation Refilling

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

Does RAG Really Perform Bad For Long-Context Processing?

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Disentangling Memory and Reasoning Ability in Large Language Models

Augmenting Language Models with Long-Term Memory