ID: cs/0207003

Analysis of Titles and Readers For Title Generation Centered on the Readers

July 2, 2002

View on ArXiv

Similar papers 4

Learning to Determine the Quality of News Headlines

November 26, 2019

84% Match
Amin Omidvar, Hossein Poormodheji, ... , Edall Gordon
Information Retrieval
Computation and Language
Machine Learning

Today, most newsreaders read the online version of news articles rather than traditional paper-based newspapers. Also, news media publishers rely heavily on the income generated from subscriptions and website visits made by newsreaders. Thus, online user engagement is a very important issue for online newspapers. Much effort has been spent on writing interesting headlines to catch the attention of online users. On the other hand, headlines should not be misleading (e.g., clic...

Find SimilarView on arXiv

The Style-Content Duality of Attractiveness: Learning to Write Eye-Catching Headlines via Disentanglement

December 14, 2020

84% Match
Mingzhe Li, Xiuying Chen, Min Yang, Shen Gao, ... , Yan Rui
Computation and Language

Eye-catching headlines function as the first device to trigger more clicks, bringing reciprocal effect between producers and viewers. Producers can obtain more traffic and profits, and readers can have access to outstanding articles. When generating attractive headlines, it is important to not only capture the attractive content but also follow an eye-catching written style. In this paper, we propose a Disentanglement-based Attractive Headline Generator (DAHG) that generates ...

Find SimilarView on arXiv

TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation

June 2, 2022

84% Match
Sajad Sotudeh, Nazli Goharian
Computation and Language

Many scientific papers such as those in arXiv and PubMed data collections have abstracts with varying lengths of 50-1000 words and average length of approximately 200 words, where longer abstracts typically convey more information about the source paper. Up to recently, scientific summarization research has typically focused on generating short, abstract-like summaries following the existing datasets used for scientific summarization. In domains where the source text is relat...

Find SimilarView on arXiv

Zero-shot topic generation

April 29, 2020

84% Match
Oleg Vasilyev, Kathryn Evans, ... , Bohannon John
Computation and Language

We present an approach to generating topics using a model trained only for document title generation, with zero examples of topics given during training. We leverage features that capture the relevance of a candidate span in a document for the generation of a title for that document. The output is a weighted collection of the phrases that are most relevant for describing the document and distinguishing it within a corpus, without requiring access to the rest of the corpus. We...

Find SimilarView on arXiv

Extraction of Keyphrases from Text: Evaluation of Four Algorithms

December 8, 2002

84% Match
Peter D. National Research Council of Canada Turney
Machine Learning
Information Retrieval

This report presents an empirical evaluation of four algorithms for automatically extracting keywords and keyphrases from documents. The four algorithms are compared using five different collections of documents. For each document, we have a target set of keyphrases, which were generated by hand. The target keyphrases were generated for human readers; they were not tailored for any of the four keyphrase extraction algorithms. Each of the algorithms was evaluated by the degree...

Find SimilarView on arXiv

Marvista: Exploring the Design of a Human-AI Collaborative News Reading Tool

July 18, 2022

84% Match
Xiang 'Anthony' Chen, Chien-Sheng Wu, Lidiya Murakhovs'ka, Philippe Laban, Tong Niu, ... , Xiong Caiming
Human-Computer Interaction

We explore the design of Marvista -- a human-AI collaborative tool that employs a suite of natural language processing models to provide end-to-end support for reading online news articles. Before reading an article, Marvista helps a user plan what to read by filtering text based on how much time one can spend and what questions one is interested to find out from the article. During reading, Marvista helps the user reflect on their understanding of each paragraph with AI-gene...

Find SimilarView on arXiv

Breaking the News: First Impressions Matter on Online News

March 26, 2015

84% Match
Julio Reis, Fabrıcio Benevenuto, Melo Pedro O. S. Vaz de, Raquel Prates, ... , An Jisun
Computers and Society
Computation and Language

A growing number of people are changing the way they consume news, replacing the traditional physical newspapers and magazines by their virtual online versions or/and weblogs. The interactivity and immediacy present in online news are changing the way news are being produced and exposed by media corporations. News websites have to create effective strategies to catch people's attention and attract their clicks. In this paper we investigate possible strategies used by online n...

Find SimilarView on arXiv

SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation

March 26, 2024

84% Match
Dongqi Pu, Yifan Wang, ... , Demberg Vera
Computation and Language
Artificial Intelligence
Machine Learning

Scientific news reports serve as a bridge, adeptly translating complex research articles into reports that resonate with the broader public. The automated generation of such narratives enhances the accessibility of scholarly insights. In this paper, we present a new corpus to facilitate this paradigm development. Our corpus comprises a parallel compilation of academic publications and their corresponding scientific news reports across nine disciplines. To demonstrate the util...

Find SimilarView on arXiv

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

March 25, 2023

84% Match
Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Zuylen Madeleine van, Lucy Lu Wang, Christopher Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, ... , Weld Daniel S.
Human-Computer Interaction
Artificial Intelligence
Computation and Language

Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has changed little in decades. The PDF format for sharing research papers is widely used...

Find SimilarView on arXiv

This Just In: Fake News Packs a Lot in Title, Uses Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News

March 28, 2017

84% Match
Benjamin D. Horne, Sibel Adali
Social and Information Netwo...
Computation and Language

The problem of fake news has gained a lot of attention as it is claimed to have had a significant impact on 2016 US Presidential Elections. Fake news is not a new problem and its spread in social networks is well-studied. Often an underlying assumption in fake news discussion is that it is written to look like real news, fooling the reader who does not check for reliability of the sources or the arguments in its content. Through a unique study of three data sets and features ...

Find SimilarView on arXiv