MicroBERT: Distilling MoE-Based Knowledge from BERT into a Lighter Model

Dashun Zheng, Jiaxuan Li, Yunchu Yang, Yapeng Wang, Patrick Cheong Iao Pang

Research output: Contribution to journalArticlepeer-review

Search results