Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the obj tag and text prompt #6

Open
Zysty opened this issue Apr 3, 2023 · 1 comment
Open

About the obj tag and text prompt #6

Zysty opened this issue Apr 3, 2023 · 1 comment

Comments

@Zysty
Copy link

Zysty commented Apr 3, 2023

Hello, thanks for your sharing the great work!

As we can see the eq.(1), the object tag is produced by a argmax operation, while the paper shows "we select one O at random for each time" in Sec 3.1.2.
So there is a doubt: when the object tag is firstly determined, how to judge such a situation ? (" For a certain P, we may have various options for O because the block may contain multiple objects.")

Looking forward for your reply!
Thanks😁!

@FingerRec
Copy link
Collaborator

Hi Zysty:

Thanks for your question and sorry for late reply.

  1. Yes, a grid may have multiple object tags. The case count a small part. Random select one object.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants