From Explicit CoT to Implicit CoT: Learn...

Implicit Chain of Thought Reasoning via Knowledge Distillation

November 2, 2023

93% Match

Yuntian Deng, Kiran Prasad, Roland Fernandez, Paul Smolensky, ... , Shieber Stuart

Computation and Language

Artificial Intelligence

Machine Learning

To augment language models with the ability to reason, researchers usually prompt or finetune them to produce chain of thought reasoning steps before producing the final answer. However, although people use natural language to reason effectively, it may be that LMs could reason more effectively with some intermediate computation that is not in natural language. In this work, we explore an alternative reasoning approach: instead of explicitly producing the chain of thought rea...

Find SimilarView on arXiv

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

January 28, 2022

92% Match

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, ... , Zhou Denny

Computation and Language

Artificial Intelligence

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that...

Find SimilarView on arXiv

Think Beyond Size: Adaptive Prompting for More Effective Reasoning

October 10, 2024

92% Match

Kamesh R

Machine Learning

Computation and Language

Pretrained large language models (LLMs) are increasingly utilized across a wide range of natural language processing (NLP) tasks due to their impressive capabilities as few-shot learners. Recent techniques, such as chain-of-thought (CoT) prompting, have significantly advanced multi-step reasoning by introducing step-by-step decomposition, achieving state-of-the-art results on complex reasoning benchmarks. However, these approaches often rely on static prompting templates that...

Find SimilarView on arXiv

Do LLMs Really Think Step-by-step In Implicit Reasoning?

November 24, 2024

92% Match

Yijiong Yu

Computation and Language

Artificial Intelligence

It has been well-known that Chain-of-Thought can remarkably enhance LLMs' performance on complex tasks. However, because it also introduces slower inference speeds and higher computational costs, many researches have attempted to use implicit CoT, which does not need LLMs to explicitly generate the intermediate steps. However, the invisible reasoning process leaves us a doubt that, can implicit CoT really be equal to explicit CoT? Therefore, in this study, we address this que...

Find SimilarView on arXiv

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

September 13, 2024

92% Match

Tianqiao Liu, Zui Chen, Zitao Liu, ... , Luo Weiqi

Computation and Language

Artificial Intelligence

Large language models (LLMs) have demonstrated remarkable capabilities in tasks requiring reasoning and multi-step problem-solving through the use of chain-of-thought (CoT) prompting. However, generating the full CoT process results in significantly longer output sequences, leading to increased computational costs and latency during inference. To address this challenge, we propose a novel approach to compress the CoT process through semantic alignment, enabling more efficient...

Find SimilarView on arXiv

Distilling Reasoning Capabilities into Smaller Language Models

December 1, 2022

92% Match

Kumar Shridhar, Alessandro Stolfo, Mrinmaya Sachan

Machine Learning

Computation and Language

Step-by-step reasoning approaches like chain of thought (CoT) have proved to be very effective in inducing reasoning capabilities in large language models. However, the success of the CoT approach is fundamentally tied to the model size, and billion parameter-scale models are often needed to get CoT to work. In this paper, we propose a knowledge distillation approach that leverages the step-by-step CoT reasoning capabilities of larger models and distills these abilities into ...

Find SimilarView on arXiv

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models

February 18, 2025

92% Match

Yingqian Cui, Pengfei He, Jingying Zeng, Hui Liu, Xianfeng Tang, Zhenwei Dai, Yan Han, Chen Luo, Jing Huang, Zhen Li, Suhang Wang, Yue Xing, ... , He Qi

Computation and Language

Artificial Intelligence

Machine Learning

Chain-of-Thought (CoT) reasoning, which breaks down complex tasks into intermediate reasoning steps, has significantly enhanced the performance of large language models (LLMs) on challenging tasks. However, the detailed reasoning process in CoT often incurs long generation times and high computational costs, partly due to the inclusion of unnecessary steps. To address this, we propose a method to identify critical reasoning steps using perplexity as a measure of their importa...

Find SimilarView on arXiv

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

October 8, 2023

92% Match

Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, ... , Wang Baoyuan

Computation and Language

Artificial Intelligence

Chain-of-Thought (CoT) prompting has proven to be effective in enhancing the reasoning capabilities of Large Language Models (LLMs) with at least 100 billion parameters. However, it is ineffective or even detrimental when applied to reasoning tasks in Smaller Language Models (SLMs) with less than 10 billion parameters. To address this limitation, we introduce Dialogue-guided Chain-of-Thought (DialCoT) which employs a dialogue format to generate intermediate reasoning steps, g...

Find SimilarView on arXiv

CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation

February 28, 2025

91% Match

Zhenyi Shen, Hanqi Yan, Linhai Zhang, Zhanghao Hu, ... , He Yulan

Computation and Language

Chain-of-Thought (CoT) enhances Large Language Models (LLMs) by enabling step-by-step reasoning in natural language. However, the language space may be suboptimal for reasoning. While implicit CoT methods attempt to enable reasoning without explicit CoT tokens, they have consistently lagged behind explicit CoT method in task performance. We propose CODI (Continuous Chain-of-Thought via Self-Distillation), a novel framework that distills CoT into a continuous space, where a sh...

Find SimilarView on arXiv

Think-to-Talk or Talk-to-Think? When LLMs Come Up with an Answer in Multi-Step Reasoning

December 2, 2024

91% Match

Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Ana Brassard, ... , Inui Kentaro

Computation and Language

This study investigates the internal reasoning mechanism of language models during symbolic multi-step reasoning, motivated by the question of whether chain-of-thought (CoT) outputs are faithful to the model's internals. Specifically, we inspect when they internally determine their answers, particularly before or after CoT begins, to determine whether models follow a post-hoc "think-to-talk" mode or a step-by-step "talk-to-think" mode of explanation. Through causal probing ex...

Find SimilarView on arXiv

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Implicit Chain of Thought Reasoning via Knowledge Distillation

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Think Beyond Size: Adaptive Prompting for More Effective Reasoning

Do LLMs Really Think Step-by-step In Implicit Reasoning?

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

Distilling Reasoning Capabilities into Smaller Language Models

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation

Think-to-Talk or Talk-to-Think? When LLMs Come Up with an Answer in Multi-Step Reasoning