Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Images in table #21

Open
chillyoung4679 opened this issue May 23, 2024 · 1 comment
Open

Images in table #21

chillyoung4679 opened this issue May 23, 2024 · 1 comment
Labels
enhancement New feature or request postponed

Comments

@chillyoung4679
Copy link

Hello,

My PDF file contains long tables, and the tables include images. I tried

md_text = pymupdf4llm.to_markdown("input.pdf", write_images=True)

and the result was that the images were extracted but placed below the table.

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text |  |
| Text | Text |  |
| Text | Text |  |

![image1](images1.png)
![image2](images2.png)
![image3](images3.png)

However, I want the images to be inside the table. like:

| Col1  | Col2  | Image |
|---|---|---|
| Text | Text | ![image1](images1.png) |
| Text | Text | ![image2](images2.png) |
| Text | Text | ![image3](images3.png) |

How can I achieve this?

Best

@JorjMcKie
Copy link
Contributor

No, this is currently not supported.
But let me check if there is any chance to tweak the table finder.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request postponed
Projects
None yet
Development

No branches or pull requests

2 participants