Distillation Column Design Tutorial

DIST+: Knowledge Distillation From a Stronger Adaptive Teacher

Abstract: The paper introduces DIST, an innovative knowledge distillation method that excels in learning from a superior teacher model. DIST differentiates itself from conventional techniques by ...

GitHub

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

Knowledge distillation involves transferring soft labels from a teacher to a student using a shared temperature-based softmax function. However, the assumption of a shared temperature between teacher ...

IEEE

Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks

Abstract: Previous knowledge distillation (KD) methods mostly focus on compressing network architectures, which is not thorough enough in deployment as some costs like transmission bandwidth and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

DIST+: Knowledge Distillation From a Stronger Adaptive Teacher

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks

Trending now