WeSearch

Turbo-OCR Update: Layout Model + Multilingual

· 0 reactions · 0 comments · 2 views
Turbo-OCR Update: Layout Model + Multilingual

Follow-up to my post 18 days ago about the C++/CUDA OCR server. Two additions: What's New: Layout model: Added PP-StructureV3 for layout detection Multilingual: No longer Latin-only. Now supports Chinese, Japanese, Korean, Cyrillic, Arabic, and Latin-script languages. Same stack: C++, TensorRT FP16, multi-stream, gRPC/HTTP, direct pdf endpoint. Benchmarks (Linux / RTX 5090 / CUDA 13.2): Very text-heavy images: 100+ img/s Sparse/Low-text: 1,000+ img/s 270p/s on FUNSD Dataset Source: github.com/ai

Original article
Reddit
Read full at Reddit →
Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Email

Discussion

0 comments

More from Reddit