how to generate router_logits in moe models using model.generate()? #31722

Jimmy-Lu · 2024-07-01T03:48:09Z

System Info

transformers version: 4.41.2
Platform: Linux-5.4.0-144-generic-x86_64-with-glibc2.31
Python version: 3.10.0
Huggingface_hub version: 0.23.4
Safetensors version: 0.4.3
Accelerate version: 0.31.0
Accelerate config: not found
PyTorch version (GPU?): 2.3.0+cu121 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?:
Using distributed or parallel set-up in script?:

Who can help?

@gante

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

from transformers import AutoModelForCausalLM, AutoTokenizer
device = "cuda" # the device to load the model onto

model = AutoModelForCausalLM.from_pretrained(
"/localssd/swlu/Qwen1.5-MoE-A2.7B-Chat",
torch_dtype="auto",
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained("/localssd/swlu/Qwen1.5-MoE-A2.7B-Chat")

prompt = "Give me a short introduction to large language model."
messages = [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(device)

generated_ids = model.generate(
model_inputs.input_ids,
max_new_tokens=512,
return_dict_in_generate = True,
output_router_logits = True
)
print("outputs:", generated_ids.router_logits)

Expected behavior

I want to get router_logits of moe models using model.generate() with the code above.
But got:
AttributeError: 'GenerateDecoderOnlyOutput' object has no attribute 'router_logits'

The text was updated successfully, but these errors were encountered:

amanjam · 2024-07-05T18:57:31Z

@amyeroberts can you explain the label Generation, is this issue still open, as I have started looking into it . Please let me know

amyeroberts · 2024-07-05T20:10:38Z

Hi @amanjam,

can you explain the label Generation

Are you asking about the reason for a label or why Generation applies to this issue?

is this issue still open

Yes, it it still open. You can check this by looking at the PR title, where there's a green Open badge

amanjam · 2024-07-05T20:16:55Z

hi @amyeroberts
Yes , I am asking why generation applies to this issue? Please explain it a bit

amyeroberts added the Generation label Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to generate router_logits in moe models using model.generate()? #31722

how to generate router_logits in moe models using model.generate()? #31722

Jimmy-Lu commented Jul 1, 2024

amanjam commented Jul 5, 2024

amyeroberts commented Jul 5, 2024 •

edited

Loading

amanjam commented Jul 5, 2024

how to generate router_logits in moe models using model.generate()? #31722

how to generate router_logits in moe models using model.generate()? #31722

Comments

Jimmy-Lu commented Jul 1, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

amanjam commented Jul 5, 2024

amyeroberts commented Jul 5, 2024 • edited Loading

amanjam commented Jul 5, 2024

amyeroberts commented Jul 5, 2024 •

edited

Loading