The Medium post goes over various flavors of distillation, including response-based distillation, feature-based distillation ...
There’s been an escalation in the generative AI large language model “wars” as Alibaba Qwen 2.5 launched Wednesday. This ...
The Allen Institute for AI and Alibaba have unveiled powerful language models that challenge DeepSeek's dominance in the open ...
[Notice] This list is not being maintained anymore because of the overwhelming amount of deep learning papers published every day since 2017. A curated list of the most cited deep learning papers ...