NVIDIA's Nemotron Nano 2 VL is an advanced open-source vision language model designed for high performance and scalability, particularly in the financial services sector, where it excels in tasks such as know-your-customer compliance, intelligent document processing, and fraud detection. Built on a base of the Nemotron Nano 2, a 9 billion parameter foundation model, Nemotron Nano 2 VL features a 12 billion parameter architecture with a hybrid Mamba-Transformer design, offering enhanced accuracy and efficiency for tasks like multi-image understanding, document intelligence, and video captioning. This model, available on Baseten, supports robust inference capabilities using NVIDIA NIM microservices for high throughput and low latency performance, with applications extending to various industries such as healthcare and media. Additionally, a smaller model, Nemotron Parse 1.1, provides a cost-effective solution for straightforward optical character recognition tasks. Baseten enhances the deployment of these models with enterprise-grade security, multi-cloud infrastructure, and technical support, making them suitable for building secure, reliable AI agents in production environments.