Emotional Images: Assessing Emotions in Images and Potential Biases in Generative Models

November 8, 2024

Maneet Mehta, Cody Buntain

Computer Science

Computers and Society

Computer Vision and Pattern ...

This paper examines potential biases and inconsistencies in emotional evocation of images produced by generative artificial intelligence (AI) models and their potential bias toward negative emotions. In particular, we assess this bias by comparing the emotions evoked by an AI-produced image to the emotions evoked by prompts used to create those images. As a first step, the study evaluates three approaches for identifying emotions in images -- traditional supervised learning, zero-shot learning with vision-language models, and cross-modal auto-captioning -- using EmoSet, a large dataset of image-emotion annotations that categorizes images across eight emotional types. Results show fine-tuned models, particularly Google's Vision Transformer (ViT), significantly outperform zero-shot and caption-based methods in recognizing emotions in images. For a cross-modality comparison, we then analyze the differences between emotions in text prompts -- via existing text-based emotion-recognition models -- and the emotions evoked in the resulting images. Findings indicate that AI-generated images frequently lean toward negative emotional content, regardless of the original prompt. This emotional skew in generative models could amplify negative affective content in digital spaces, perpetuating its prevalence and impact. The study advocates for a multidisciplinary approach to better align AI emotion recognition with psychological insights and address potential biases in generative AI outputs across digital media.

The Good, The Bad, and Why: Unveiling Emotions in Generative AI

December 18, 2023

93% Match

Cheng Li, Jindong Wang, Yixuan Zhang, Kaijie Zhu, Xinyi Wang, Wenxin Hou, Jianxun Lian, Fang Luo, ... , Xie Xing

Artificial Intelligence

Computation and Language

Human-Computer Interaction

Emotion significantly impacts our daily behaviors and interactions. While recent generative AI models, such as large language models, have shown impressive performance in various tasks, it remains unclear whether they truly comprehend emotions. This paper aims to address this gap by incorporating psychological theories to gain a holistic understanding of emotions in generative AI models. Specifically, we propose three approaches: 1) EmotionPrompt to enhance AI model performan...

Find SimilarView on arXiv

Affect-Conditioned Image Generation

February 20, 2023

93% Match

Francisco Ibarrola, Rohan Lulham, Kazjon Grace

Artificial Intelligence

Human-Computer Interaction

In creativity support and computational co-creativity contexts, the task of discovering appropriate prompts for use with text-to-image generative models remains difficult. In many cases the creator wishes to evoke a certain impression with the image, but the task of conferring that succinctly in a text prompt poses a challenge: affective language is nuanced, complex, and model-specific. In this work we introduce a method for generating images conditioned on desired affect, qu...

Find SimilarView on arXiv

MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis

November 18, 2024

92% Match

Yingjie Zhou, Zicheng Zhang, Jiezhang Cao, Jun Jia, Yanwei Jiang, Farong Wen, Xiaohong Liu, ... , Zhai Guangtao

Computation and Language

Artificial Intelligence

Artificial Intelligence (AI) has demonstrated significant capabilities in various fields, and in areas such as human-computer interaction (HCI), embodied intelligence, and the design and animation of virtual digital humans, both practitioners and users are increasingly concerned with AI's ability to understand and express emotion. Consequently, the question of whether AI can accurately interpret human emotions remains a critical challenge. To date, two primary classes of AI m...

Find SimilarView on arXiv

EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models

January 9, 2024

92% Match

Jingyuan Yang, Jiawei Feng, Hui Huang

Computer Vision and Pattern ...

Recent years have witnessed remarkable progress in image generation task, where users can create visually astonishing images with high-quality. However, existing text-to-image diffusion models are proficient in generating concrete concepts (dogs) but encounter challenges with more abstract ones (emotions). Several efforts have been made to modify image emotions with color and style adjustments, facing limitations in effectively conveying emotions with fixed image contents. In...

Find SimilarView on arXiv

EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation

June 22, 2024

92% Match

Tianyu Wei, Shanmin Pang, Qi Guo, ... , Guo Qing

Computer Vision and Pattern ...

Text-to-image diffusion models can create realistic images based on input texts. Users can describe an object to convey their opinions visually. In this work, we unveil a previously unrecognized and latent risk of using diffusion models to generate images; we utilize emotion in the input texts to introduce negative contents, potentially eliciting unfavorable emotions in users. Emotions play a crucial role in expressing personal opinions in our daily interactions, and the incl...

Find SimilarView on arXiv

Bias in Emotion Recognition with ChatGPT

October 18, 2023

91% Match

Naoki Wake, Atsushi Kanehira, Kazuhiro Sasabuchi, ... , Ikeuchi Katsushi

Robotics

Computation and Language

This technical report explores the ability of ChatGPT in recognizing emotions from text, which can be the basis of various applications like interactive chatbots, data annotation, and mental health analysis. While prior research has shown ChatGPT's basic ability in sentiment analysis, its performance in more nuanced emotion recognition is not yet explored. Here, we conducted experiments to evaluate its performance of emotion recognition across different datasets and emotion l...

Find SimilarView on arXiv

Evaluating Vision-Language Models for Emotion Recognition

February 8, 2025

91% Match

Sree Bhattacharyya, James Z. Wang

Computer Vision and Pattern ...

Computation and Language

Large Vision-Language Models (VLMs) have achieved unprecedented success in several objective multimodal reasoning tasks. However, to further enhance their capabilities of empathetic and effective communication with humans, improving how VLMs process and understand emotions is crucial. Despite significant research attention on improving affective understanding, there is a lack of detailed evaluations of VLMs for emotion-related tasks, which can potentially help inform downstre...

Find SimilarView on arXiv

AffectGAN: Affect-Based Generative Art Driven by Semantics

September 30, 2021

91% Match

Theodoros Galanos, Antonios Liapis, Georgios N. Yannakakis

Computer Vision and Pattern ...

Machine Learning

This paper introduces a novel method for generating artistic images that express particular affective states. Leveraging state-of-the-art deep learning methods for visual generation (through generative adversarial networks), semantic models from OpenAI, and the annotated dataset of the visual art encyclopedia WikiArt, our AffectGAN model is able to generate images based on specific or broad semantic prompts and intended affective outcomes. A small dataset of 32 images generat...

Find SimilarView on arXiv

EmoEdit: Evoking Emotions through Image Manipulation

May 21, 2024

91% Match

Jingyuan Yang, Jiawei Feng, Weibin Luo, Dani Lischinski, ... , Huang Hui

Computer Vision and Pattern ...

Affective Image Manipulation (AIM) seeks to modify user-provided images to evoke specific emotional responses. This task is inherently complex due to its twofold objective: significantly evoking the intended emotion, while preserving the original image composition. Existing AIM methods primarily adjust color and style, often failing to elicit precise and profound emotional shifts. Drawing on psychological insights, we extend AIM by incorporating content modifications to enhan...

Find SimilarView on arXiv

A Sociotechnical Lens for Evaluating Computer Vision Models: A Case Study on Detecting and Reasoning about Gender and Emotion

June 12, 2024

91% Match

Sha Luo, Sang Jung Kim, ... , Chen Kaiping

Computer Vision and Pattern ...

Artificial Intelligence

Computers and Society

Human-Computer Interaction

In the evolving landscape of computer vision (CV) technologies, the automatic detection and interpretation of gender and emotion in images is a critical area of study. This paper investigates social biases in CV models, emphasizing the limitations of traditional evaluation metrics such as precision, recall, and accuracy. These metrics often fall short in capturing the complexities of gender and emotion, which are fluid and culturally nuanced constructs. Our study proposes a s...

Find SimilarView on arXiv