Instructions to use PaddlePaddle/PaddleOCR-VL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PaddleOCR
How to use PaddlePaddle/PaddleOCR-VL with PaddleOCR:
# See https://www.paddleocr.ai/latest/version3.x/pipeline_usage/PaddleOCR-VL.html to installation from paddleocr import PaddleOCRVL pipeline = PaddleOCRVL(pipeline_version="v1") output = pipeline.predict("path/to/document_image.png") for res in output: res.print() res.save_to_json(save_path="output") res.save_to_markdown(save_path="output") - Notebooks
- Google Colab
- Kaggle
"Dependency Error"
I've tried installing and running this locally. I followed all of the steps in the documentation and can import paddle successfully. When I do paddle.utils.run_check(), it says:
"PaddlePaddle works well on 1 GPU.
PaddlePaddle is installed successfully! Let's start deep learning with PaddlePaddle now."
But when I try calling PaddleOCRVL() to make a pipeline, it errors out: "RuntimeError: A dependency error occurred during pipeline creation. Please refer to the installation documentation to ensure all required dependencies are installed."
I'm using Python 3.11.9, CUDA 12.9. Anyone have this issue and/or a fix?
This is a known issue.
Please refer to here: https://github.com/PaddlePaddle/PaddleX/issues/4552
Solution: pip install paddlex[ocr]