Ligong Han
2025
Hopscotch: Discovering and Skipping Redundancies in Language Models
Mustafa Eyceoz
|
Nikhil Shivakumar Nayak
|
Hao Wang
|
Ligong Han
|
Akash Srivastava
Findings of the Association for Computational Linguistics: EMNLP 2025
Modern causal language models stack many attention blocks to improve performance, but not all blocks are necessary for every task. We propose Hopscotch, a simple yet effective method that identifies and skips attention blocks with least contributions to a task and adapts to preserve output quality. Hopscotch jointly optimizes which blocks to skip and how to scale the outputs of the remaining layers. By introducing lightweight, trainable scaling parameters to attention and MLP blocks, it mitigates distribution shifts in hidden states caused by removing attention blocks. Hopscotch does not modify model weights or require access to pretraining or instruction-tuning data, and is compatible with existing model compression techniques. When applied to Llama-3.1-8B and Qwen-2.5-7B, Hopscotch achieves less than a 2% drop in performance even after skipping four attention blocks.
2024
Diffusion Models for Sign Language Video Anonymization
Zhaoyang Xia
|
Yang Zhou
|
Ligong Han
|
Carol Neidle
|
Dimitris N. Metaxas
Proceedings of the LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources
Search
Fix author
Co-authors
- Mustafa Eyceoz 1
- Dimitris N. Metaxas 1
- Nikhil Shivakumar Nayak 1
- Carol Neidle 1
- Akash Srivastava 1
- show all...