Eureka: Human-Level Reward Design via Coding Large Language Models
This article presents EUREKA, a human-level reward design algorithm powered by Large Language Models (LLMs). It employs LLMs’ zero-shot generation and code-writing abilities to perform evolutionary optimization over reward code….
Continue reading