language models

Artificial Intelligence Computation and Language

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity

root January 5, 2024 0

The article focuses on understanding the mechanisms of alignment algorithms, particularly Direct Preference Optimization (DPO), and how they reduce toxicity in language models like GPT2. The researchers first study how…

Artificial Intelligence Computation and Language

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

root December 23, 2023 0

The study introduces LM Assertions, a novel programming construct designed to enforce user-specified properties on Language Model (LM) outputs within a pipeline framework. This is integrated into the DSPy programming…

Computation and Language

Time is Encoded in the Weights of Finetuned Language Models

root December 23, 2023 0

The research presents a novel tool called ‘time vectors’ for customizing language models according to specific time periods. This is achieved by finetuning a language model on data from a…

Computation and Language

How to Prune Your Language Model: Recovering Accuracy on the Sparsity May Cry” Benchmark

root December 23, 2023 0

This study revisits the issue of pruning large language models (LLMs), particularly from the BERT family, in response to the Sparsity May Cry (SMC) benchmark. The benchmark highlights the complexity…

Computation and Language Machine Learning

Structured Probabilistic Coding

root December 23, 2023 0

The article introduces Structured Probabilistic Coding (SPC), a novel supervised representation learning framework that extracts compact and informative representations from input related to a target task. The SPC uses an…

Computation and Language

What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations

root December 2, 2023 0

Despite declining to respond to controversial prompts, Large Language Models (LLMs) may still exhibit sociodemographic biases in their latent representations. This study proposes a logistic Bradley Terry probe to detect…

Artificial Intelligence Computation and Language

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

root December 1, 2023 0

This article by researchers from UC Berkeley and Google DeepMind explores leveraging reinforcement learning (RL) to enhance the capabilities of large language models (LLMs). The authors note that standard prompting…

Computation and Language

Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension

root December 1, 2023 0

The study presents a dataset for testing the understanding of the rationale behind critical reasoning. Results show that recent large language models struggle to answer subquestions even if they can…

Computation and Language

IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions

root December 1, 2023 0

The article discusses a new framework called Induction-Augmented Generation (IAG) for answering reasoning questions. This framework utilizes inductive knowledge and retrieved documents for implicit reasoning. The authors propose two versions…

Artificial Intelligence Computation and Language

Universal Jailbreak Backdoors from Poisoned Human Feedback

root November 27, 2023 0

This research paper explores the potential for ‘jailbreak backdoors’ in large language models trained with Reinforcement Learning from Human Feedback (RLHF). It reveals that a malicious actor could potentially poison…

Page 1 of 4 Next

Press ESC to close

language models

Please allow ads on our site