OCR on image

#28

by glitchyordis - opened 10 days ago

10 days ago

Obtaining key information is quite straightforward but Is there a way to obtain bbox locations from texts detected?

glitchyordis changed discussion title from OCR text to OCR on image 10 days ago

maxiw

9 days ago

You can prompt the model to return bbox locations (see here: https://huggingface.co/spaces/maxiw/Qwen2-VL-Detection). I also tried "detect all texts" but the results are not super precise.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment