-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about the dimension #14
Comments
Hi @JINzezhong7 , You can see the code below. For logits with shape [B, C], we reshape it to [B, C, 1, 1], and use 1x1 convolutions in diffusion loss (diffusion model, autoencoder).
|
Hi, for this diffusion distillation, is it diffkd with both features and logits? Or just diffkd with features. looking forward to your reply. |
您好,我最近在看您的diffusion knowledge distillation 的paper,然后我尝试将它应用在我的任务上,但我发现ddim_loss 不收敛,请问是什么原因呢。非常期待您的回答。
发件人: Tao Huang ***@***.***>
日期: 星期一, 2024年3月11日 上午10:43
收件人: hunto/image_classification_sota ***@***.***>
抄送: JIN, Zezhong [Student] ***@***.***>, Mention ***@***.***>
主题: Re: [hunto/image_classification_sota] about the dimension (Issue #14)
CAUTION: This email is not originated from PolyU. Do not click links or open attachments unless you recognize the sender and know the content is safe.
Hi @JINzezhong7<https://github.com/JINzezhong7> ,
You can see the code below. For logits with shape [B, C], we reshape it to [B, C, 1, 1], and use 1x1 convolutions in diffusion loss (diffusion model, autoencoder).
https://github.com/hunto/image_classification_sota/blob/6cb144105fc5c2f778e51cc66e35314938f96fae/lib/models/losses/kd_loss.py#L138
―
Reply to this email directly, view it on GitHub<#14 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/A7NOWSTPPS5TW7P4BUDUFZTYXUK3RAVCNFSM6AAAAABEKKWJA6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBXGUZDONBUGQ>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
[https://www.polyu.edu.hk/emaildisclaimer/PolyU_Email_Signature.jpg]
Disclaimer:
This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful.
The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi, thanks for opening the source code. I read the paper, I find you use logits and features before pooling to perform diffusion. but for the logits, I guess the dimension is [B, C] B is the batch size, and C is the class number. This will cause a dimension mismatch in autoencoder, how to solve it. Thanks for your reply.
The text was updated successfully, but these errors were encountered: