Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

The paper introduces ‘Superfiltering’, a method for data filtering in instruction tuning of Large Language Models (LLMs). Instruction tuning improves LLMs but often suffers from low-quality and redundant data. The traditional filtering process incurs extra cost and computation. Superfiltering proposes using a smaller, weaker model to select data for fine-tuning a larger, stronger model. Despite the performance gap between weak and strong models, the smaller model effectively perceives instruction difficulty and data selection. This speeds up data filtering and improves the performance of the fine-tuned LLM. The approach has been validated through extensive experiments.

Publication date: 1 Feb 2024
Project Page: https://github.com/tianyi-lab/Superfiltering
Paper: https://arxiv.org/pdf/2402.00530

Post Views: 278

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

SA-MDKIF: A Scalable and Adaptable Medical Domain Knowledge Injection Framework for Large Language Models

Leave a Reply Cancel reply

Please allow ads on our site