ReGATE: Learning Faster and Better with Fewer Tokens in MLLMs

Chaoyu Li; Yogesh Kulkarni; Pooyan Fazli

ReGATE: Learning Faster and Better with Fewer Tokens in MLLMs

Chaoyu Li, Yogesh Kulkarni, Pooyan Fazli

Abstract

The computational cost of training multimodal large language models (MLLMs) grows rapidly with the number of processed tokens. Existing efficiency methods mainly target inference via token reduction or merging, offering limited benefits during training. We introduce ReGATE (**Re**ference-**G**uided **A**daptive **T**oken **E**lision), an adaptive token pruning method for accelerating MLLM training. ReGATE adopts a teacher-student framework, in which a frozen teacher LLM provides per-token guidance losses that are fused with an exponential moving average of the student’s difficulty estimates. This adaptive scoring mechanism dynamically selects informative tokens while skipping redundant ones in the forward pass, substantially reducing computation without altering the model architecture. Across three representative MLLMs, ReGATE matches the peak accuracy of standard training on MVBench up to **2 × faster**, using only **38%** of the tokens. With extended training, it even surpasses the baseline across multiple multimodal benchmarks, cutting total token usage by over **41%**. Code and models will be released publicly.

Anthology ID:: 2026.acl-long.2154
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 46430–46446
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2154/
DOI:
Bibkey:
Cite (ACL):: Chaoyu Li, Yogesh Kulkarni, and Pooyan Fazli. 2026. ReGATE: Learning Faster and Better with Fewer Tokens in MLLMs. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 46430–46446, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ReGATE: Learning Faster and Better with Fewer Tokens in MLLMs (Li et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2154.pdf
Checklist:: 2026.acl-long.2154.checklist.pdf

PDF Cite Search Checklist Fix data