-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproduce the result in Tab.3 #5
Comments
Hi, did you enable these lines?
|
I haven't enabled that but after enable this, it still did work, only raise from 42.4 (not enable) to 43.7 (enabled) |
Does your code look like this? |
Here is my code looks like:
I'am not sure these two lines can have such huge effect? (49 to 43)
|
Can you follow the code above and re-run that? LLM is known to be sensitive to prompt format. |
|
Thx it works pretty well! |
Hi~ is it possible to provide your result's json file ? I'm not sure the matching is working properly. Thx a lot~ |
Hi, what do you mean by "the matching is not working properly"? |
When I use "training_analysis/llava/eval_imagewikiqa.ipynb" to get answers from "text", some times it will generate incorrect one. |
Hi, thx for your great work, I have encountered troubles when reproducing result in Tab.3.
I used your script in 'training_analysis/llava/eval_imagewikiqa.sh' to do so. For LLaVA1.5-7B, the script is
After running "training_analysis/llava/eval_imagewikiqa.ipynb" with "outputs/llava-v1.5-7b.jsonl", I got mean of acc ~ 39, which is pretty close to the reported result (denoted as "LLaVA1.5-7B" on right side of Tab.3).
For "LLaVA1.5-7B Finetuned on ImageNet+LLaVA", I merged your projector weights named "imagenet_and_llava_mm_projector.bin" from https://drive.google.com/drive/folders/182yUoLPK9nzKThp6MlzvTSnNpDrP7oXb using script "training_analysis/llava/process_model.ipynb" with "liuhaotian/llava-v1.5-7b". However, the result is around 42, way more lower than the reported one.
Would you please let me know the correct way to reproduced your results?
The text was updated successfully, but these errors were encountered: