AlignBench: Benchmarking Chinese Alignment of Large Language Models
The article introduces ALIGN BENCH, a comprehensive benchmark for evaluating alignment in Large Language Models (LLMs) for the Chinese language. The benchmark utilizes a human-in-the-loop data curation pipeline and includes…
Continue reading