Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
          ocr          pdf-parser          kie          document-translation          rag          chineseocr          ai4science          pp-ocr          document-parsing          pp-structure          pdf-extractor-rag          pdf2markdown      
    - 
            Updated
            Oct 22, 2025 
- Python