How do model output interleaved text-image with multimodal input? #52

URRealHero · 2024-09-11T01:40:37Z

Does the model require further finetune? I'm wondering why the playground use a 'for' loop to generate a story

URRealHero · 2024-09-11T06:30:47Z

What I mean is how can it generate multi images with each image a caption. If I use a for loop, wouldn't the model repeatedly generate similar scene again and again? How did the playground.py ensure it can generate a sequence of story? How to make a coherent multi-images generation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do model output interleaved text-image with multimodal input? #52

How do model output interleaved text-image with multimodal input? #52

URRealHero commented Sep 11, 2024

URRealHero commented Sep 11, 2024

How do model output interleaved text-image with multimodal input? #52

How do model output interleaved text-image with multimodal input? #52

Comments

URRealHero commented Sep 11, 2024

URRealHero commented Sep 11, 2024