Replies: 2 comments
-
This is a huge need! My understanding is that Anthropic is using a ColPali style late interaction mechanism which is far superior to OCR + embedding etc. Much simpler for user experience as well (provided files fit within sizing parameters). Please add! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Checked
Feature request
I propose adding native support for reading PDF files in the Anthropic and Gemini models via their respective APIs (Anthropic API and Vertex AI). This feature would allow users to upload a PDF file directly for processing, enabling the models to extract both text and visual elements, such as images.
Expected functionality:
Motivation
The ability to work with PDFs natively is essential for a wide range of use cases, including legal document analysis, technical reports, academic studies, and any context involving a combination of text and images.
Currently, users need to preprocess PDFs manually before sending them to the models, which adds complexity, time, and potential errors to the workflow. Implementing native support would streamline the process, improve efficiency, and enhance the versatility of the APIs.
Proposal (If applicable)
No response
Beta Was this translation helpful? Give feedback.
All reactions