Ollama has introduced streaming responses with tool calling, allowing chat applications to stream content and call tools in real-time, utilizing models like Qwen 3, Llama 3.1, and Llama 4. This functionality enables users to perform tasks such as fetching weather data or adding numbers through tool calls integrated into the chat interface. The system employs a new parser that focuses on understanding the structure of tool calls rather than relying solely on JSON parsing, which facilitates incremental parsing and improves tool call reliability. Ollama's Model Context Protocol (MCP) further enhances tool calling by allowing expanded context windows, thereby boosting accuracy and performance. The update also includes support for Python and JavaScript, enabling developers to integrate and utilize these tool calls in their applications efficiently.