The pre-trained checkpoint generates very short output #38

Richar-Du · 2023-07-04T02:41:40Z

Thanks for your awesome work!

I want to utilize the model to generate the HTML of an image, so I choose the pre-trained checkpoint without fine-tuning. However, the generated output is very short. For example, the following code only generate <img_src=image> without any detailed struct.

from PIL import Image
import torch
from transformers import Pix2StructProcessor, Pix2StructForConditionalGeneration
device = torch.device("cuda")
processor = Pix2StructProcessor.from_pretrained("google/pix2struct-large")
model = Pix2StructForConditionalGeneration.from_pretrained("google/pix2struct-large").to(device)

img_path  = 'biography.png'
image = Image.open(img_path)
processor.image_processor.is_vqa=False

inputs = processor(images=image, return_tensors="pt").to(device)
generated_ids = model.generate(**inputs, max_length=1000)
generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(generated_text)

My transformers version is 4.28.0. Do you know how to solve this problem? Thanks in advance :)

The text was updated successfully, but these errors were encountered:

nbroad1881 · 2023-07-05T18:51:42Z

Should probably upgrade transformers huggingface/transformers#22903

Richar-Du · 2023-07-08T02:15:37Z

I have updated transformers to 4.30.2, but it doesn't work. The input of the processor is:

and I want to use pix2struct-large to generate its corresponding html. However, now the generated text is just: '<>'

@nbroad1881 @younesbelkada

HeimingX · 2023-07-16T12:20:41Z

Hi, I also encountered the same problem. I took a screenshot of the left subgraph of Figure 1 in the pix2struct paper, and the pix2struct-large model can only output the same '<>'. This is severely inconsistent with expectations and I am quite confused. I am eagerly anticipating the response from the author. Thanks a lot.

PS: my transformer version is 4.31.0.

ChenDelong1999 · 2023-07-25T08:21:38Z

+1

nbroad1881 · 2023-07-25T20:48:20Z

@kentonl, is there a prompt for pretraining?

luukvankooten · 2023-10-03T10:12:09Z

+1

Alexwangziyu · 2023-11-13T04:20:07Z

+1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The pre-trained checkpoint generates very short output #38

The pre-trained checkpoint generates very short output #38

Richar-Du commented Jul 4, 2023

nbroad1881 commented Jul 5, 2023

Richar-Du commented Jul 8, 2023 •

edited

Loading

HeimingX commented Jul 16, 2023

ChenDelong1999 commented Jul 25, 2023

nbroad1881 commented Jul 25, 2023

luukvankooten commented Oct 3, 2023

Alexwangziyu commented Nov 13, 2023

The pre-trained checkpoint generates very short output #38

The pre-trained checkpoint generates very short output #38

Comments

Richar-Du commented Jul 4, 2023

nbroad1881 commented Jul 5, 2023

Richar-Du commented Jul 8, 2023 • edited Loading

HeimingX commented Jul 16, 2023

ChenDelong1999 commented Jul 25, 2023

nbroad1881 commented Jul 25, 2023

luukvankooten commented Oct 3, 2023

Alexwangziyu commented Nov 13, 2023

Richar-Du commented Jul 8, 2023 •

edited

Loading