Planning Actions, Not Predicting Tokens

Post Details

Company

AI21 Labs

Date Published

March 10, 2025

Author

AI21

Word Count

2,470

Company Posts That Month

6

Language

English

Hacker News Points

-

Source URL

www.ai21.com/blog/ai-reasoning-planning-vs-predicting

Summary

AI researchers are exploring more effective methods for reasoning in artificial intelligence beyond the traditional large language models (LLMs), which often focus on predicting the next token in a sequence. The emerging approach, called Large Reasoning Models (LRMs), incorporates intermediate "thinking" tokens and uses reinforcement learning to optimize for correct outcomes, but faces challenges such as inefficiency and lack of robust generalization. Current LRMs struggle with transparency, control, and deterministic outputs, which are critical for enterprise applications. An alternative method involves planning in the space of actions, allowing systematic exploration of action sequences rather than just token sequences, employing decision-theoretic planning, and integrating human users throughout the process. AI21, for instance, is developing a planning and orchestration system called Maestro to enhance the performance of LLMs by combining them with verifiers and employing parallelism, ultimately aiming for more reliable and adaptable AI systems.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	24	4,855	541	180	+51%
Real-time	1	4,629	997	226	+44%
Reinforcement learning	1	217	54	34	+41%