Transformer Neural Autoregressive Flows

The paper presents Transformer Neural Autoregressive Flows (T-NAFs), a novel solution to challenges in Normalizing Flows (NFs). NFs are a sequence of invertible transformations used in machine learning for density estimation. The most efficient members of the NF family are Neural Autoregressive Flows (NAFs) and Block Neural Autoregressive Flows (B-NAFs), but they face scalability issues and training instability. T-NAFs use transformers to treat each dimension of a random variable as a separate input token. The experimental results demonstrate that T-NAFs consistently match or outperform NAFs and B-NAFs across multiple datasets. They achieve these results using significantly fewer parameters than previous approaches.

Publication date: 3 Jan 2024
Project Page: https://arxiv.org/abs/2401.01855
Paper: https://arxiv.org/pdf/2401.01855

Post Views: 263

Transformer Neural Autoregressive Flows

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Optimal cross-learning for contextual bandits with unknown context distributions

The Power of Training: How Different Neural Network Setups Influence the Energy Demand

Leave a Reply Cancel reply

Please allow ads on our site