Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Badcase]: 函数调用出现不正常token(iNdEx) #991

Open
4 tasks done
abiaoa1314 opened this issue Sep 27, 2024 · 3 comments
Open
4 tasks done

[Badcase]: 函数调用出现不正常token(iNdEx) #991

abiaoa1314 opened this issue Sep 27, 2024 · 3 comments
Assignees

Comments

@abiaoa1314
Copy link

Model Series

Qwen2.5

What are the models used?

Qwen2.5-14B-int4

What is the scenario where the problem happened?

Qwen2.5-14B-int4 in transformers

Is this badcase known and can it be solved using avaiable techniques?

  • I have followed the GitHub README.
  • I have checked the Qwen documentation and cannot find a solution there.
  • I have checked the documentation of the related framework and cannot find useful information.
  • I have searched the issues and there is not a similar one.

Information about environment

OS: Windows 10
Python: Python 3.11.9
GPUs: 1 * 4060Ti
NVIDIA driver: 555.99(from nvidia-smi)
CUDA compiler: 12.1 (from nvcc -V)
PyTorch: 2.2.1+cu121 (from python -c "import troch; print(torch.version)")

Description

image
出现iNdEx不正常的token,做function call的时候,太不稳定了。

Copy link

github-actions bot commented Nov 9, 2024

This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.

@ChiNoel-osu
Copy link

我可以确认14B版本(原生精度)确有此问题。Tool Call不太稳定。
7B版本没有这个问题。使用的同样的生成参数多次测试得出的结论。

@github-actions github-actions bot removed the inactive label Nov 14, 2024
@jklj077
Copy link
Collaborator

jklj077 commented Nov 14, 2024

what model is Qwen2.5-14B-int4?

@jklj077 jklj077 assigned tuhahaha and unassigned JianxinMa Dec 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants