You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi , I'm using partition_pdf to extract images with below config:
unstructured_kwargs={
"strategy": "hi_res",
"extract_images_in_pdf":True,
"extract_image_block_types": ["Image"],
"extract_image_block_to_payload": True,
"languages": ["eng", "chi_sim","chi"],
"chunking_strategy":"basic",
"max_characters":1000,
}, The result shows that one image will be cut into 2 or even worse, for example:
original image in PDF:
extracted images from PDF:
Is there any way to do image-extraction quality enhancement? thanks
The text was updated successfully, but these errors were encountered:
Hi , I'm using partition_pdf to extract images with below config:
unstructured_kwargs={
"strategy": "hi_res",
"extract_images_in_pdf":True,
"extract_image_block_types": ["Image"],
"extract_image_block_to_payload": True,
"languages": ["eng", "chi_sim","chi"],
"chunking_strategy":"basic",
"max_characters":1000,
},
The result shows that one image will be cut into 2 or even worse, for example:
The text was updated successfully, but these errors were encountered: