Tool Graph Papers - BytesArchive

Artificial Intelligence Computation and Language

TaskBench: Benchmarking Large Language Models for Task Automation

root December 1, 2023 0

The article introduces TaskBench, a benchmark for evaluating the capabilities of large language models (LLMs) in task automation. Task automation, which decomposes complex tasks into sub-tasks and invokes external tools…

Press ESC to close

Tool Graph

Please allow ads on our site