Model stuck until max_new_tokens?

#29

by danielelongo - opened 15 days ago

15 days ago

Hello, I'm using the model for an image captioning task.
I use the prompt:
prompt = f"""[INST]\You are an assistant tasked with summarizing images for image retrieval. \n

Context:\n
{context}[/INST]"""
But sometimes it gets stuck and repeats the same words until it gets to max_new_token. Is there a way to prevent this behavior?

Thanks!

RaushanTurganbay

Llava Hugging Face org 15 days ago

@danielelongo Does it get repetitive after certain length, maybe the model isn't good at long generations in general. You can mitigate repetitions with repetition_penalty (see https://huggingface.co/docs/transformers/v4.44.2/en/main_classes/text_generation#transformers.GenerationConfig) or you can also verify that the input is formatted correctly, in case the generation quality is bad in general

danielelongo

15 days ago

Thank you very much!

danielelongo changed discussion status to closed 15 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment