You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To automate the process of solving exam questions using gpt-resolve, it would be much more efficient to extract each question as a separate image from a PDF exam. Exams with many questions make the process of extracting questions manually very slow and subject to mistakes.
Some Approaches
OpenCV
Convert PDF to Images: Use tools like pdf2image to transform each page of the PDF into an image.
Segment Questions with OpenCV: Apply OpenCV to detect and isolate individual questions through contour or edge detection techniques.
Save and Name Output: Store each extracted question as a separate image file, naming them sequentially (e.g., q1.png, q2.png).
This method may encounter challenges with questions spanning multiple pages but should effectively process the majority of exams, especially those with numerous pages.
Docling
Consider using Docling, a tool designed to parse documents and export them into various formats, which may assist in extracting questions from PDFs. This can be quite good, since it seems to use advanced models for many kind of document extractions.
The text was updated successfully, but these errors were encountered:
Objective
To automate the process of solving exam questions using gpt-resolve, it would be much more efficient to extract each question as a separate image from a PDF exam. Exams with many questions make the process of extracting questions manually very slow and subject to mistakes.
Some Approaches
OpenCV
pdf2image
to transform each page of the PDF into an image.q1.png
,q2.png
).This method may encounter challenges with questions spanning multiple pages but should effectively process the majority of exams, especially those with numerous pages.
Docling
Consider using Docling, a tool designed to parse documents and export them into various formats, which may assist in extracting questions from PDFs. This can be quite good, since it seems to use advanced models for many kind of document extractions.
The text was updated successfully, but these errors were encountered: