transformer-based models

Hydragen: High-Throughput LLM Inference with Shared Prefixes

root February 8, 2024 0

This research presents Hydragen, a hardware-aware model that improves the efficiency of transformer-based large language models (LLMs) working with shared prefixes. It is common for LLMs to perform inferences on…

Computation and Language Machine Learning

Benefits of Transformer: In-Context Learning in Linear Regression Tasks with Unstructured Data

root February 2, 2024 0

The paper explores the benefits of transformer-based models’ ability to learn in context from unstructured data during linear regression tasks. It conducts experiments to study the architecture of transformers and…

Computation and Language

Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space — Transformer Ensemble Models Tackling Deception and Persuasion

root December 2, 2023 0

The paper outlines an approach for the ‘Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023’, focusing on detection of disinformation and persuasion techniques. Using transformer-based models fine-tuned on Arabic language,…

Computation and Language

Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space — Transformer Ensemble Models Tackling Deception and Persuasion

root December 1, 2023 0

This paper presents the approach for the ‘Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023’, focusing on persuasion technique detection and disinformation detection. The authors experiment with several transformer-based models…

Machine Learning

Secure short-term load forecasting for smart grids with transformer-based federated learning

root October 27, 2023 0

The article explores the use of federated learning in electricity load forecasting. The authors propose a novel transformer-based deep learning approach that improves data privacy by training models locally on…

Page 1 of 1

Press ESC to close

transformer-based models

Please allow ads on our site