Title | Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation For Pretrained Models |
Publication Type | Journal Article |
Year of Publication | 2019 |
Authors | Liu, L., H. Wang, J. Lin, R. Socher, and C. Xiong |
Journal | ArXiv |
Volume | abs/1911.03588 |
URL | http://arxiv.org/abs/1911.03588 |