Other Datasets Problem #9

makimon123 · 2024-11-26T14:39:29Z

Dear Author, I attempted to apply this method on other datasets; however I have observed that the mu_pdist、sigma_pdist and logits distributions are very concentrated during training , even though the distributions of mean and std themselves seem fine.

makimon123 · 2024-11-26T14:41:19Z

and the final training results are relatively poor. I suspect this is because the logits have not been trained well. plz

SanghyukChun · 2024-11-26T14:55:07Z

If you are trying to apply this method for from-scratch training (without any pre-trained weight), it would be difficult to optimize. I recently released a new probabilistic VLM project for from-scratch training:

Probabilistic Language-Image Pre-Training
https://arxiv.org/abs/2410.18857
https://github.com/naver-ai/prolip

There is no full training code yet, but you can easily implement the new loss function:
https://github.com/naver-ai/prolip/blob/89aed36968f055fca897dc51c25156b19412c56c/src/prolip/loss.py#L100-L115

If you need to use PCME++ loss for from-scratch training, you will need additional deterministic loss for a stable convergence. As shown in my new paper

makimon123 · 2024-11-27T11:51:14Z

Thank you very much for your help! I have tried the new method you suggested, but unfortunately, I am encountering an issue where the values in sigma_pdist remain abnormal and the distribution is very concentrated. This phenomenon has not improved during the training process.

I am wondering if this could be related to the data dimensions. In my dataset, both the mean and log variance are encoded with the shape (Batchsize, Dim), specifically (256, 512). I would appreciate your thoughts on whether the dimensionality could be contributing to this issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Other Datasets Problem #9

Other Datasets Problem #9

makimon123 commented Nov 26, 2024

makimon123 commented Nov 26, 2024

SanghyukChun commented Nov 26, 2024

makimon123 commented Nov 27, 2024

Other Datasets Problem #9

Other Datasets Problem #9

Comments

makimon123 commented Nov 26, 2024

makimon123 commented Nov 26, 2024

SanghyukChun commented Nov 26, 2024

makimon123 commented Nov 27, 2024