|
Juggernaut FLUX is live on DeepInfra!
|
Oguz Vuruskaner |
2025-03-25 |
349 |
--
|
|
Enhancing Open-Source LLMs with Function Calling Feature
|
Pernekhan Utemuratov |
2024-01-26 |
1,025 |
--
|
|
Guaranteed JSON output on Open-Source LLMs.
|
Patrick Reiter Horn |
2024-03-08 |
624 |
--
|
|
How to use CivitAI LoRAs: 5-Minute AI Guide to Stunning Double Exposure …
|
Oguz Vuruskaner |
2025-01-23 |
391 |
--
|
|
Introducing Tool Calling with LangChain, Search the Web with Tavily and Tool …
|
Oguz Vuruskaner |
2024-07-05 |
583 |
--
|
|
FLUX.1-dev Guide: Mastering Text-to-Image AI Prompts for Stunning and Consistent Visuals
|
Oguz Vuruskaner |
2024-09-04 |
1,276 |
--
|
|
How to deploy Databricks Dolly v2 12b, instruction tuned casual language model.
|
Yessen Kanapin |
2023-04-12 |
349 |
--
|
|
A Milestone on Our Journey Building Deep Infra and Scaling Open Source …
|
Yessen Kanapin |
2025-04-22 |
589 |
--
|
|
Model Distillation Making AI Models Efficient
|
Deep |
2025-04-10 |
1,426 |
--
|
|
Fork of Text Generation Inference.
|
Nikola Borisov |
2023-08-09 |
417 |
--
|
|
Getting Started
|
Nikola Borisov |
2023-03-02 |
278 |
--
|
|
Long Context models incoming
|
Iskren Chernev |
2023-11-21 |
628 |
--
|
|
The easiest way to build AI applications with Llama 2 LLMs.
|
Nikola Borisov |
2023-08-02 |
603 |
--
|
|
A short intro on running Stable Diffusion on DeepInfra
|
Iskren |
2023-03-08 |
218 |
--
|
|
Use OpenAI API clients with LLaMas
|
Iskren Chernev |
2023-08-28 |
343 |
--
|
|
Inference LoRA adapter model
|
Askar Aitzhan |
2024-12-06 |
459 |
--
|
|
Unleashing the Potential of AI for Exceptional Gaming Experiences
|
Tsveta Gavanozova |
2023-11-10 |
500 |
--
|
|
Chat with books using DeepInfra and LlamaIndex
|
Oguz Vuruskaner |
2024-06-07 |
565 |
--
|
|
Seed Anchoring and Parameter Tweaking with SDXL Turbo: Create Stunning Cubist Art
|
Oguz Vuruskaner |
2024-09-12 |
1,233 |
--
|
|
Deploy Custom LLMs on DeepInfra
|
Iskren Chernev |
2024-03-01 |
276 |
--
|
|
Introducing GPU Instances: On-Demand GPU Compute for AI Workloads
|
Deep |
2025-06-09 |
792 |
--
|
|
How to OpenAI Whisper with per-sentence and per-word timestamp segmentation using DeepInfra
|
Yessen Kanapin |
2023-04-05 |
323 |
--
|
|
Building a Voice Assistant with Whisper, LLM, and TTS
|
Askar Aitzhan |
2024-09-20 |
748 |
--
|
|
Search That Actually Works: A Guide to LLM Rerankers
|
Deep |
2025-09-10 |
2,122 |
--
|
|
Lzlv model for roleplaying and creative work
|
Nikola Borisov |
2023-11-02 |
532 |
--
|
|
Compare Llama2 vs OpenAI models for FREE.
|
Nikola Borisov |
2023-09-28 |
406 |
--
|
|
Langchain improvements: async and streaming
|
Iskren Chernev |
2023-10-25 |
292 |
--
|
|
How to deploy google/flan-ul2 - simple. (open source ChatGPT alternative)
|
Nikola Borisov |
2023-03-17 |
495 |
--
|
|
Art That Talks Back: A Hands-On Tutorial on Talking Images
|
Oguz Vuruskaner |
2025-03-07 |
591 |
--
|
|
Deep Infra Launches Access to NVIDIA Nemotron Models for Vision, Retrieval, and …
|
Yessen Kanapin |
2025-10-28 |
814 |
--
|
|
How to deploy Databricks Dolly v2 12b, instruction tuned casual language model.
|
Yessen Kanapin |
2023-04-12 |
541 |
--
|
|
Power the Next Era of Image Generation with FLUX.2 Visual Intelligence on …
|
Deep |
2025-11-25 |
749 |
--
|
|
Kimi K2 0905 API from Deepinfra: Practical Speed, Predictable Costs, Built for …
|
Deep |
2025-12-01 |
1,837 |
--
|
|
GLM-4.6 API: Get fast first tokens at the best $/M from Deepinfra's …
|
Deep |
2025-12-01 |
2,022 |
--
|
|
Llama 3.1 70B Instruct API from DeepInfra: Snappy Starts, Fair Pricing, Production …
|
Deep |
2025-12-01 |
2,197 |
--
|
|
Accelerating Reasoning Workflows with Nemotron 3 Nano on DeepInfra
|
Yessen Kanapin |
2025-12-15 |
909 |
--
|
|
Pricing 101: Token Math & Cost-Per-Completion Explained
|
Deep |
2026-01-13 |
6,002 |
--
|
|
From Precision to Quantization: A Practical Guide to Faster, Cheaper LLMs
|
Deep |
2026-01-13 |
2,911 |
--
|
|
How the Models Perform on DeepInfra: Long-Context Performance, Throughput, and Cost
|
Deep |
2026-01-13 |
1,730 |
--
|
|
Nemotron 3 Nano vs GPT-OSS-20B: Performance, Benchmarks & DeepInfra Results
|
Deep |
2026-01-13 |
1,673 |
--
|
|
Build an OCR-Powered PDF Reader & Summarizer with DeepInfra (Kimi K2)
|
Deep |
2026-01-13 |
3,944 |
--
|
|
LLM API Provider Performance KPIs 101: TTFT, Throughput & End-to-End Goals
|
Deep |
2026-01-13 |
2,103 |
--
|
|
Nemotron 3 Nano Explained: NVIDIA’s Efficient Small LLM and Why It Matters
|
Deep |
2026-01-13 |
2,280 |
--
|