constrained reinforcement learning

Artificial Intelligence Computation and Language

Universal Jailbreak Backdoors from Poisoned Human Feedback

root November 27, 2023 0

This research paper explores the potential for ‘jailbreak backdoors’ in large language models trained with Reinforcement Learning from Human Feedback (RLHF). It reveals that a malicious actor could potentially poison…

Cryptography and Security

AI-based Attack Graph Generation

root November 27, 2023 0

The paper presents a study on improving the security of Internet of Things (IoT) devices using artificial intelligence (AI) and reinforcement learning. It specifically focuses on the use of attack…

Machine Learning Robotics

Beyond Simulated Drivers: Evaluating the Impact of Real-World Car-Following in Mixed Traffic Control

root November 22, 2023 0

The article ‘Beyond Simulated Drivers: Evaluating the Impact of Real-World Car-Following in Mixed Traffic Control’ studies the effect of human driving behaviors on traffic congestion. It suggests that Human-driven Vehicles…

Robotics

Constraint-aware Policy for Compliant Manipulation

root November 21, 2023 0

The paper presents a constraint-aware policy for compliant robotic manipulation, particularly in physically-constrained environments such as household operations. The policy, developed using reinforcement learning, is designed to adjust to the…

Artificial Intelligence Robotics

Tactile Active Inference Reinforcement Learning for Efficient Robotic Manipulation Skill Acquisition

root November 21, 2023 0

The article discusses the challenges of applying robotic manipulation in varied scenarios due to the limitations of control-based approaches and inefficiency of existing learning methods. To address these, the authors…

Robotics

Interpretable Reinforcement Learning for Robotics and Continuous Control

root November 20, 2023 0

The paper proposes Interpretable Continuous Control Trees (ICCTs), a tree-based model that can be optimized using modern, gradient-based, reinforcement learning approaches to produce high-performing, interpretable policies. The ICCTs show potential…

Human-Computer Interaction Robotics

From Thumbs Up to 10 out of 10: Reconsidering Scalar Feedback in Interactive Reinforcement Learning

root November 20, 2023 0

The paper discusses the effectiveness of scalar feedback over binary feedback in reinforcement learning. While binary feedback has been more widely used due to its simplicity and reduced noise, scalar…

Robotics

Learning Agile Locomotion on Risky Terrains

root November 20, 2023 0

This article discusses the use of reinforcement learning to improve the mobility of quadruped robots on risky terrains. The researchers trained a generalist policy for agile locomotion on disorderly and…

Artificial Intelligence Computation and Language

On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models

root November 19, 2023 0

This academic article discusses the security vulnerabilities in Reinforcement Learning with Human Feedback (RLHF) in Large Language Models (LLMs). RLHF plays a crucial role in aligning LLMs with human preferences….

Artificial Intelligence Computation and Language

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization

root November 13, 2023 0

The paper investigates the impact of preference agreement on the efficacy of Reinforcement Learning from Human Feedback (RLHF) in text summarization. The authors demonstrate that including a diverse range of…

Previous Page 4 of 6 Next

Press ESC to close

constrained reinforcement learning

Please allow ads on our site