Stop sequences fail for some sequences #48

joehoover · 2024-06-21T13:41:41Z

Observed Behavior

Some stop sequence inputs (e.g. "}) trigger an error:

Prediction failed.

E2102 TritonTokenizerError: Tokenizer error: in ensemble 'ensemble', Failed to process the request(s) for model instance 'preprocessing_0_126', message: ValueError: To standardize tokenizer behavior, we prepend '!' to the string representation of each stop sequence. We then strip the corresponding first token from the stop sequence IDs. However, the first token of the stop sequence IDs was not '{arbitrary_start_sequence_id}', which suggests there is a problem with the tokenizer that you are using. At: /src/triton_model_repo/preprocessing/1/model.py(287): _to_word_list_format /src/triton_model_repo/preprocessing/1/model.py(182): execute

Expected Behavior

All stop sequence inputs should be handled and applied, such that generation stops when those sequences are encountered.

Reproduce

These request against llama-3-70b triggers the error reliably:

https://replicate.com/p/1w0ht542kdrgj0cg7c2vpkr4a0

This request ran against:

official model: replicate/official-model-meta-llama-3-70b
model: replicate-internal/llama-3-70b-fp16-8xa100-triton:2d69fe7e

The text was updated successfully, but these errors were encountered:

joehoover · 2024-06-21T13:45:47Z

stop_sequence = "." also throws this error.

joehoover added the bug Something isn't working label Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop sequences fail for some sequences #48

Stop sequences fail for some sequences #48

joehoover commented Jun 21, 2024

joehoover commented Jun 21, 2024

Stop sequences fail for some sequences #48

Stop sequences fail for some sequences #48

Comments

joehoover commented Jun 21, 2024

Observed Behavior

Expected Behavior

Reproduce

joehoover commented Jun 21, 2024