Limitations of Agents Simulated by Predictive Models

The article focuses on the limitations of predictive models when adapted into agent-like systems. Two structural reasons for failure are discussed: auto-suggestive delusions and predictor-policy incoherence. Auto-suggestive delusions refer to models failing to imitate agents that generated the training data if the agents relied on hidden observations. Predictor-policy incoherence refers to the issue where models choose actions as if they expect future actions to be suboptimal, leading to overly conservative decisions. The authors propose a solution in the form of a feedback loop from the environment, where the models are re-trained on their own actions.

Publication date: 9 Feb 2024
Project Page: Not Provided
Paper: https://arxiv.org/pdf/2402.05829

Post Views: 254

Limitations of Agents Simulated by Predictive Models

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Leave a Reply Cancel reply

Please allow ads on our site