Add functionality to suppress tokens during generation #978

abheesht17 · 2023-04-10T13:49:25Z

Resolves #975

chenmoneygithub · 2023-04-10T18:27:22Z

Thanks Abi! I am a little unsure if a blocklist of token ids could work as expected, mainly due to subword tokenizer. For common English curse words, they are usually one token so it could work, but it does not work for phrases or words would be split by subword tokenizer. Blocklisting words is a complex task as I see, we will need to discuss a bit more about it, thanks!

mattdangerw · 2023-04-11T18:41:53Z

Yeah, I think the short status update is we should leave this to discuss post 0.5 release. Logit manipulation generally is a good topic for discussion in our generation flow.

abheesht17 · 2023-04-12T02:37:34Z

Gotcha! We will, however, have to do this for Whisper in any case: https://huggingface.co/openai/whisper-tiny/blob/main/generation_config.json#L126. I guess we can just modify the logits in the next() function for Whisper instead of changing the sampler API

mattdangerw · 2023-04-12T18:49:09Z

Gotcha! We will, however, have to do this for Whisper in any case: https://huggingface.co/openai/whisper-tiny/blob/main/generation_config.json#L126. I guess we can just modify the logits in the next() function for Whisper instead of changing the sampler API

Good to know! I think whisper (like all the seq2seq stuff) we will leave out of 0.5, but does sounds like we need to think through this soon.

Add functionality to suppress tokens during generation

09cfbeb

abheesht17 requested a review from chenmoneygithub April 10, 2023 13:49

abheesht17 marked this pull request as draft April 10, 2023 13:59

Add UTs and fixes

15ff374

abheesht17 marked this pull request as ready for review April 10, 2023 15:06

mattdangerw mentioned this pull request Apr 14, 2023

Consider a suppressed_tokens arg in generate() #975

Open

abheesht17 marked this pull request as draft April 15, 2023 05:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functionality to suppress tokens during generation #978

Add functionality to suppress tokens during generation #978

abheesht17 commented Apr 10, 2023 •

edited

Loading

chenmoneygithub commented Apr 10, 2023

mattdangerw commented Apr 11, 2023

abheesht17 commented Apr 12, 2023

mattdangerw commented Apr 12, 2023

Add functionality to suppress tokens during generation #978

Are you sure you want to change the base?

Add functionality to suppress tokens during generation #978

Conversation

abheesht17 commented Apr 10, 2023 • edited Loading

chenmoneygithub commented Apr 10, 2023

mattdangerw commented Apr 11, 2023

abheesht17 commented Apr 12, 2023

mattdangerw commented Apr 12, 2023

abheesht17 commented Apr 10, 2023 •

edited

Loading