Analysis of Titles and Readers For Title...

DTATG: An Automatic Title Generator based on Dependency Trees

October 1, 2017

89% Match

Liqun Shao, Jie Wang

Information Retrieval

Computation and Language

We study automatic title generation for a given block of text and present a method called DTATG to generate titles. DTATG first extracts a small number of central sentences that convey the main meanings of the text and are in a suitable structure for conversion into a title. DTATG then constructs a dependency tree for each of these sentences and removes certain branches using a Dependency Tree Compression Model we devise. We also devise a title test to determine if a sentence...

Find SimilarView on arXiv

Development of an Extractive Title Generation System Using Titles of Papers of Top Conferences for Intermediate English Students

October 8, 2021

88% Match

Kento Kaku, Masato Kikuchi, ... , Shintani Toramatsu

Computation and Language

The formulation of good academic paper titles in English is challenging for intermediate English authors (particularly students). This is because such authors are not aware of the type of titles that are generally in use. We aim to realize a support system for formulating more effective English titles for intermediate English and beginner authors. This study develops an extractive title generation system that formulates titles from keywords extracted from an abstract. Moreove...

Find SimilarView on arXiv

Examining the Role of Clickbait Headlines to Engage Readers with Reliable Health-related Information

November 25, 2019

88% Match

Sima Bhowmik, Md Main Uddin Rony, Md Mahfuzul Haque, ... , Hassan Naeemul

Computation and Language

Computers and Society

Information Retrieval

Clickbait headlines are frequently used to attract readers to read articles. Although this headline type has turned out to be a technique to engage readers with misleading items, it is still unknown whether the technique can be used to attract readers to reliable pieces. This study takes the opportunity to test its efficacy to engage readers with reliable health articles. A set of online surveys would be conducted to test readers' engagement with and perception about clickbai...

Find SimilarView on arXiv

Headline Generation: Learning from Decomposable Document Titles

April 17, 2019

88% Match

Oleg Vasilyev, Tom Grek, John Bohannon

Computation and Language

We propose a novel method for generating titles for unstructured text documents. We reframe the problem as a sequential question-answering task. A deep neural network is trained on document-title pairs with decomposable titles, meaning that the vocabulary of the title is a subset of the vocabulary of the document. To train the model we use a corpus of millions of publicly available document-title pairs: news articles and headlines. We present the results of a randomized doubl...

Find SimilarView on arXiv

Domain Controlled Title Generation with Human Evaluation

March 8, 2021

87% Match

Abdul Waheed, Muskan Goyal, ... , Gupta Deepak

Computation and Language

We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer model and the additional token technique. Title tokens are sampled from a local...

Find SimilarView on arXiv

Systematic improvement of user engagement with academic titles using computational linguistics

June 23, 2019

87% Match

Nim Dvir, Ruti Gafni

Computation and Language

Digital Libraries

Human-Computer Interaction

This paper describes a novel approach to systematically improve information interactions based solely on its wording. Following an interdisciplinary literature review, we recognized three key attributes of words that drive user engagement: (1) Novelty (2) Familiarity (3) Emotionality. Based on these attributes, we developed a model to systematically improve a given content using computational linguistics, natural language processing (NLP) and text analysis (word frequency, se...

Find SimilarView on arXiv

Improving Scientific Article Visibility by Neural Title Simplification

April 5, 2019

87% Match

Alexander Shvets

Information Retrieval

Computation and Language

Machine Learning

The rapidly growing amount of data that scientific content providers should deliver to a user makes them create effective recommendation tools. A title of an article is often the only shown element to attract people's attention. We offer an approach to automatic generating titles with various levels of informativeness to benefit from different categories of users. Statistics from ResearchGate used to bias train datasets and specially designed post-processing step applied to n...

Find SimilarView on arXiv

KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles

June 3, 2013

86% Match

Marina Danilevsky, Chi Wang, Nihit Desai, ... , Han Jiawei

Machine Learning

Information Retrieval

We introduce KERT (Keyphrase Extraction and Ranking by Topic), a framework for topical keyphrase generation and ranking. By shifting from the unigram-centric traditional methods of unsupervised keyphrase extraction to a phrase-centric approach, we are able to directly compare and rank phrases of different lengths. We construct a topical keyphrase ranking function which implements the four criteria that represent high quality topical keyphrases (coverage, purity, phraseness, a...

Find SimilarView on arXiv

RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

May 21, 2024

86% Match

Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, ... , He Daqing

Computation and Language

This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques. Our Retrieval Augmented Generation (RAG) solution, enhanced by a reranking method, utilizes multiple knowledge sources to ensure the precision and pertinence of lay summaries. Additionally, our Reinforcement Learning for Readability Control (RLRC) strategy improves readability, making scientif...

Find SimilarView on arXiv

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation

January 21, 2025

86% Match

Junhong Lian, Xiang Ao, Xinyu Liu, ... , He Qing

Computation and Language

Artificial Intelligence

Personalized news headline generation aims to provide users with attention-grabbing headlines that are tailored to their preferences. Prevailing methods focus on user-oriented content preferences, but most of them overlook the fact that diverse stylistic preferences are integral to users' panoramic interests, leading to suboptimal personalization. In view of this, we propose a novel Stylistic-Content Aware Personalized Headline Generation (SCAPE) framework. SCAPE extracts bot...

Find SimilarView on arXiv

Analysis of Titles and Readers For Title Generation Centered on the Readers

DTATG: An Automatic Title Generator based on Dependency Trees

Development of an Extractive Title Generation System Using Titles of Papers of Top Conferences for Intermediate English Students

Examining the Role of Clickbait Headlines to Engage Readers with Reliable Health-related Information

Headline Generation: Learning from Decomposable Document Titles

Domain Controlled Title Generation with Human Evaluation

Systematic improvement of user engagement with academic titles using computational linguistics

Improving Scientific Article Visibility by Neural Title Simplification

KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles

RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation