Images in table #21

chillyoung4679 · 2024-05-23T10:14:43Z

Hello,

My PDF file contains long tables, and the tables include images. I tried

md_text = pymupdf4llm.to_markdown("input.pdf", write_images=True)

and the result was that the images were extracted but placed below the table.

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text |  |
| Text | Text |  |
| Text | Text |  |

![image1](images1.png)
![image2](images2.png)
![image3](images3.png)

However, I want the images to be inside the table. like:

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text | ![image1](images1.png) |
| Text | Text | ![image2](images2.png) |
| Text | Text | ![image3](images3.png) |

How can I achieve this?

Best

The text was updated successfully, but these errors were encountered:

JorjMcKie · 2024-05-23T10:23:47Z

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

JorjMcKie added the enhancement New feature or request label May 23, 2024

JorjMcKie added the postponed label Jun 8, 2024

JorjMcKie mentioned this issue Jun 13, 2024

Embedded links inside the table are not extracted #42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Images in table #21

Images in table #21

chillyoung4679 commented May 23, 2024

JorjMcKie commented May 23, 2024

Images in table #21

Images in table #21

Comments

chillyoung4679 commented May 23, 2024

JorjMcKie commented May 23, 2024