GOT-OCR2.0-OpenVINO

Hi there! This repo shows how to use openvion to accelerate GOT-OCR2.0 model.

Usage

Download all files from the origin repo on huggingface, then move all files to the weight folder. The file structure will eventually look like this:

.
│  app.py
│  convert_model.py
├─ weight
│      config.json
│      generation_config.json
│      got_vision_b.py
│      modeling_GOT.py
│      qwen.tiktoken
│      render_tools.py
│      special_tokens_map.json
│      tokenization_qwen.json
│      tokenizer_config.json

Run the following command

python app.py --image-file /path/to/image

It will automatically convert the model into OpenVINO IR using INT4 quantization. For more information about quantization with OpenVINO, please refer to nncf.

Notes

Original version generates 19 Token/s, while OV with INT4 quantiztion speed up to 37 Token/s (Only test on Intel i7-1360P, 16GB, Windows 11 Pro).
Accuracy has not been tested yet, but it seems good to me.
Some code is generated from ov_qwen2_audio_helper.py.

Acknowledgement

GOT-OCR2.0: Towards OCR-2.0 via a Unified End-to-end Model

OpenVINO: Open-source software toolkit for optimizing and deploying deep learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
convert_model.py		convert_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GOT-OCR2.0-OpenVINO

Usage

Notes

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

can-gaa-hou/GOT-OCR2.0-OpenVINO

Folders and files

Latest commit

History

Repository files navigation

GOT-OCR2.0-OpenVINO

Usage

Notes

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages