Modal + Datalab: Deploy high-throughput document intelligence in <5 minutes
Blog post from Modal
Modal has partnered with Datalab to enable rapid deployment of high-throughput document intelligence models, notably Marker and Surya, which are designed for efficient document structure parsing and OCR tasks. Marker, a sub-billion-parameter model, offers deterministic and stable document parsing at lower costs compared to larger language models, and has gained significant popularity and trust in the tech community. Modal's infrastructure supports scalable and reliable model serving, allowing developers to deploy these tools quickly, with GPU acceleration for enhanced throughput. These tools are free for research, personal use, and small startups, with commercial licensing available. Marker supports over 90 languages and excels in extracting complex data from PDFs, outperforming other services in accuracy and throughput when deployed on Modal's platform. The partnership emphasizes ease of deployment and scalability, making sophisticated document intelligence accessible to a wider range of users and applications.