CyberMetric: A Benchmark Dataset for Evaluating Large Language Models Knowledge in Cybersecurity
The paper presents ‘CyberMetric’, a benchmark dataset composed of 10,000 questions from various cybersecurity sources. The dataset’s purpose is to assess and compare the knowledge of large language models (LLMs),…
Continue reading