Backslash: Rate Constrained Optimized Training of Large Language Models

The rapid advancement of large-language models (LLMs) has driven extensive
research into parameter compression after training has been completed, yet
compression during the training phase remains largely unexplored. In this work,
we introduce Rate-Constrained Training (Backslash), a novel training-time
compression approach based on rate-distortion optimization (RDO). Backslash
enables a flexible trade-off between model accuracy and complexity,
significantly reducing parameter redundancy while preserving performance.
Experiments in various architectures and tasks demonstrate that Backslash can
reduce memory usage by 60\% – 90\% without accuracy loss and provides
significant compression gain compared to compression after training. Moreover,
Backslash proves to be highly versatile: it enhances generalization with small
Lagrange multipliers, improves model robustness to pruning (maintaining
accuracy even at 80\% pruning rates), and enables network simplification for
accelerated inference on edge devices.

Este artículo explora los viajes en el tiempo y sus implicaciones.

Descargar PDF:

2504.16968v1

Backslash: Rate Constrained Optimized Training of Large Language Models

Online Platform

Links

Verbalus Mater

Backslash: Rate Constrained Optimized Training of Large Language Models

Backslash: Rate Constrained Optimized Training of Large Language Models

Online Platform

Links

Verbalus Mater

Sign in

Sign up

— NEXT ONLINE COURSE STARTS 15 JANUARY —

The Real Science Behind Time Travel 25% DTO

The Real Science Behind
Time Travel
25% DTO