PLANNER: Generating Diversified Paragrap...

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

December 22, 2022

93% Match

Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, ... , Chen Weizhu

Computation and Language

Machine Learning

In this paper, we introduce a novel dIffusion language modEl pre-training framework for text generation, which we call GENIE. GENIE is a large-scale pretrained diffusion language model that consists of an encoder and a diffusion-based decoder, which can generate text by gradually transforming a random noise sequence into a coherent text sequence. To pre-train GENIE on a large-scale language corpus, we design a new continuous paragraph denoise objective, which encourages the d...

Find SimilarView on arXiv

Enforcing Paraphrase Generation via Controllable Latent Diffusion

April 13, 2024

92% Match

Wei Zou, Ziyuan Zhuang, Shujian Huang, ... , Chen Jiajun

Computation and Language

Paraphrase generation aims to produce high-quality and diverse utterances of a given text. Though state-of-the-art generation via the diffusion model reconciles generation quality and diversity, textual diffusion suffers from a truncation issue that hinders efficiency and quality control. In this work, we propose \textit{L}atent \textit{D}iffusion \textit{P}araphraser~(LDP), a novel paraphrase generation by modeling a controllable diffusion process given a learned latent spac...

Find SimilarView on arXiv

Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

December 15, 2024

92% Match

Xiaochen Zhu, Georgi Karadzhov, ... , Vlachos Andreas

Computation and Language

Artificial Intelligence

Diffusion models have shown promise in text generation but often struggle with generating long, coherent, and contextually accurate text. Token-level diffusion overlooks word-order dependencies and enforces short output windows, while passage-level diffusion struggles with learning robust representation for long-form text. To address these challenges, we propose Segment-Level Diffusion (SLD), a framework that enhances diffusion-based text generation through text segmentation,...

Find SimilarView on arXiv

Latent Diffusion for Language Generation

December 19, 2022

92% Match

Justin Lovelace, Varsha Kishore, Chao Wan, ... , Weinberger Kilian Q.

Computation and Language

Machine Learning

Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have presented diffusion as an alternative to existing pretrained language models. We view diffusion and existing language models as complementary. We demonstrate that encoder-decoder language models can be utilized to efficiently learn high-quality l...

Find SimilarView on arXiv

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

May 16, 2023

92% Match

Tong Wu, Zhihao Fan, Xiao Liu, Yeyun Gong, Yelong Shen, Jian Jiao, Hai-Tao Zheng, Juntao Li, Zhongyu Wei, Jian Guo, ... , Chen Weizhu

Computation and Language

Diffusion models have gained significant attention in the realm of image generation due to their exceptional performance. Their success has been recently expanded to text generation via generating all tokens within a sequence concurrently. However, natural language exhibits a far more pronounced sequential dependency in comparison to images, and the majority of existing language models are trained with a left-to-right auto-regressive approach. To account for the inherent sequ...

Find SimilarView on arXiv

PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling

November 10, 2024

91% Match

Hyukhun Koh, Minha Jhang, Dohyung Kim, ... , Jung Kyomin

Computation and Language

Artificial Intelligence

Recently, discrete diffusion language models have demonstrated promising results in NLP. However, there has been limited research on integrating Pretrained Language Models (PLMs) into discrete diffusion models, resulting in underwhelming performance in downstream NLP generation tasks. This integration is particularly challenging because of the discrepancy between step-wise denoising strategy of diffusion models and single-step mask prediction approach of MLM-based PLMs. In th...

Find SimilarView on arXiv

Diffusion Models for Non-autoregressive Text Generation: A Survey

March 12, 2023

91% Match

Yifan Li, Kun Zhou, ... , Wen Ji-Rong

Computation and Language

Non-autoregressive (NAR) text generation has attracted much attention in the field of natural language processing, which greatly reduces the inference latency but has to sacrifice the generation accuracy. Recently, diffusion models, a class of latent variable generative models, have been introduced into NAR text generation, showing an improved text generation quality. In this survey, we review the recent progress in diffusion models for NAR text generation. As the background,...

Find SimilarView on arXiv

DiffusER: Discrete Diffusion via Edit-based Reconstruction

October 30, 2022

91% Match

Machel Reid, Vincent J. Hellendoorn, Graham Neubig

Computation and Language

Machine Learning

In text generation, models that generate text from scratch one token at a time are currently the dominant paradigm. Despite being performant, these models lack the ability to revise existing text, which limits their usability in many practical scenarios. We look to address this, with DiffusER (Diffusion via Edit-based Reconstruction), a new edit-based generative model for text based on denoising diffusion models -- a class of models that use a Markov chain of denoising steps ...

Find SimilarView on arXiv

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

October 17, 2022

91% Match

Shansan Gong, Mukai Li, Jiangtao Feng, ... , Kong Lingpeng

Computation and Language

Machine Learning

Recently, diffusion models have emerged as a new paradigm for generative models. Despite the success in domains using continuous signals such as vision and audio, adapting diffusion models to natural language is under-explored due to the discrete nature of texts, especially for conditional generation. We tackle this challenge by proposing DiffuSeq: a diffusion model designed for sequence-to-sequence (Seq2Seq) text generation tasks. Upon extensive evaluation over a wide range ...

Find SimilarView on arXiv

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

October 31, 2022

91% Match

Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov

Computation and Language

Machine Learning

Despite the growing success of diffusion models in continuous-valued domains (e.g., images), similar efforts for discrete domains such as text have yet to match the performance of autoregressive language models. In this work, we present SSD-LM -- a diffusion-based language model with two key design choices. First, SSD-LM is semi-autoregressive, iteratively generating blocks of text, allowing for flexible output length at decoding time while enabling local bidirectional contex...

Find SimilarView on arXiv

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

Enforcing Paraphrase Generation via Controllable Latent Diffusion

Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

Latent Diffusion for Language Generation

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling

Diffusion Models for Non-autoregressive Text Generation: A Survey

DiffusER: Discrete Diffusion via Edit-based Reconstruction

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control