MicroBERT: Distilling MoE-Based Knowledge from BERT into a Lighter Model

Dashun Zheng, Jiaxuan Li, Yunchu Yang, Yapeng Wang, Patrick Cheong Iao Pang

Research output: Contribution to journalArticlepeer-review

Fingerprint

Dive into the research topics of 'MicroBERT: Distilling MoE-Based Knowledge from BERT into a Lighter Model'. Together they form a unique fingerprint.

Computer Science