HuggingFaceM4/Idefics3-8B-Llama3 · How to use history prompts on the same image?

25 days ago

Is it possible to ask several prompts on the same image in a loop with respect to the assistant answer? In the example the messages use different images for each message.

HugoLaurencon

HuggingFaceM4 org 22 days ago

•

edited 22 days ago

Yes it's totally possible (it was mostly trained this way during the SFT)

MotiHa

18 days ago

Could you give example?

HugoLaurencon

HuggingFaceM4 org 18 days ago

It's exactly like the given snippet

messages = [
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "What do we see in this image?"},
        ]
    },
    {
        "role": "assistant",
        "content": [
            {"type": "text", "text": "In this image, we can see the city of New York, and more specifically the Statue of Liberty."},
        ]
    },
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "And how about this image?"},
        ]
    },       
]

but you can remove the extra {"type": "image"} where you don't want to put an image