Joint Structured Pruning and Dense Knowledge Distillation for Efficient Transformer Model Compression

Publication date: Available online 27 May 2021Source: NeurocomputingAuthor(s): Baiyun Cui, Yingming Li, Zhongfei Zhang
Source: Neurocomputing - Category: Neuroscience Source Type: research