|
LLMs in Prod Comes to Bangalore
|
Vrushank Vyas |
2024-07-29 |
1,045 |
--
|
|
AI audit checklist for internal AI platforms & enablement teams
|
Drishti Shah |
2025-12-10 |
2,089 |
--
|
|
May: Major Updates
|
Vrushank Vyas |
2024-05-31 |
355 |
--
|
|
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace - …
|
The Quill |
2023-04-20 |
197 |
--
|
|
Agent observability: measuring tools, plans, and outcomes
|
Drishti Shah |
2025-11-28 |
1,619 |
--
|
|
My Journey with AI-Driven Development: From Curiosity to Necessity
|
Ayush |
2023-08-18 |
602 |
--
|
|
We're Afraid Language Models Aren't Modeling Ambiguity - Summary
|
The Quill |
2023-05-07 |
225 |
--
|
|
Language Models are Few-Shot Learners - Summary
|
Rohit Agarwal |
2023-04-15 |
230 |
--
|
|
What It Means To Go To Prod
|
Rohit Agarwal |
2024-04-13 |
312 |
--
|
|
Instruction Tuning with GPT-4 - Summary
|
The Quill |
2023-04-16 |
241 |
--
|
|
CAMEL: Communicative Agents for "Mind" Exploration of LLMs - Summary
|
Rohit Agarwal |
2023-04-14 |
352 |
--
|
|
LLMs in Prod: The Reality of AI Outages, No LLM is Immune
|
Siddharth Sambharia |
2024-12-14 |
504 |
--
|
|
Gemini 3.0 vs GPT-5.1: a clear comparison for builders
|
Drishti Shah |
2025-11-19 |
921 |
--
|
|
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller …
|
The Quill |
2023-05-07 |
237 |
--
|
|
Towards Reasoning in Large Language Models: A Survey - Summary
|
The Quill |
2023-06-09 |
248 |
--
|
|
A Survey of Large Language Models - Summary
|
The Quill |
2023-04-16 |
274 |
--
|
|
Training language models to follow instructions with human feedback - Summary
|
Rohit Agarwal |
2023-04-15 |
249 |
--
|
|
Attention Is All You Need - Summary
|
Rohit Agarwal |
2023-04-14 |
248 |
--
|
|
Expressive Text-to-Image Generation with Rich Text - Summary
|
Rohit Agarwal |
2023-04-15 |
264 |
--
|
|
The Confidence Checklist for LLMs in Production
|
Vrushank Vyas |
2023-07-01 |
51 |
--
|
|
Eight Things to Know about Large Language Models - Summary
|
The Quill |
2023-04-16 |
132 |
--
|
|
LLMs In Prod: Day 1
|
Siddharth Sambharia |
2024-12-13 |
505 |
--
|
|
Fortifying Your AI Stack: Palo Alto Networks Prisma AIRS Now on Portkey
|
Jason Roberts |
2025-08-13 |
547 |
--
|
|
AI tool sprawl: causes, risks, and how teams can regain control
|
Drishti Shah |
2025-11-20 |
1,327 |
--
|
|
June at Portkey: Agents, AI Governance, and Twenty Six Cookbooks
|
Vrushank Vyas |
2024-07-08 |
405 |
--
|
|
LLM access control in multi-provider environments
|
Drishti Shah |
2025-12-03 |
1,531 |
--
|
|
Portkey in September
|
Vrushank Vyas |
2024-10-05 |
922 |
--
|
|
GPT-4 is Getting Faster 🐇
|
Vrushank Vyas |
2023-10-16 |
257 |
--
|
|
How Snorkel evaluates and trains top AI models
|
Shae Selix |
2025-11-04 |
2,388 |
--
|
|
LoRA: Low-Rank Adaptation of Large Language Models - Summary
|
Rohit Agarwal |
2023-04-15 |
271 |
--
|
|
Open WebUI vs LibreChat: Choose the Right ChatGPT UI for Your Organization
|
Vrushank Vyas |
2025-02-19 |
1,154 |
--
|
|
Elevate Your ToolJet Experience with Portkey AI
|
Kavya MD |
2024-10-29 |
853 |
--
|
|
OpenAI DevDay's Implications for LLM Apps in Prod
|
Rohit Agarwal |
2023-11-07 |
1,041 |
--
|
|
Are We Really Making Much Progress in Text Classification? A Comparative Review …
|
The Quill |
2023-06-09 |
239 |
--
|
|
Building Production-Ready RAG Apps
|
Team Hasura |
2023-10-18 |
1,143 |
--
|
|
Launching Prompt Engineering Studio
|
Vrushank Vyas |
2025-03-17 |
840 |
--
|
|
How does an AI gateway improve building AI apps
|
Drishti Shah |
2025-12-16 |
1,527 |
--
|
|
Portkey Named a Cool Vendor in the 2025 Gartner® Cool Vendors™ in …
|
Drishti Shah |
2025-10-29 |
1,051 |
--
|
|
April Cool: 154 PR Merges
|
Rohit Agarwal |
2024-04-30 |
410 |
--
|
|
Portkey is Joining Hacktoberfest
|
Ashirwad Karande |
2024-10-09 |
339 |
--
|
|
Prompt Injection Attacks in LLMs: What Are They and How to Prevent …
|
Sabrina Shoshani |
2024-12-10 |
3,011 |
--
|
|
Generative Agents: Interactive Simulacra of Human Behavior - Summary
|
The Quill |
2023-04-16 |
283 |
--
|
|
⭐ The Developer’s Guide to OpenTelemetry: A Real-Time Journey into Observability
|
Kavya MD |
2024-10-15 |
1,124 |
--
|
|
Tracking LLM token usage across providers, teams and workloads
|
Drishti Shah |
2025-12-04 |
1,437 |
--
|
|
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, …
|
The Quill |
2024-12-26 |
471 |
--
|
|
Unpacking Semantic Caching at Walmart
|
Vrushank Vyas |
2024-02-05 |
576 |
--
|
|
Architecting for Trust: A Strategic Perspective on the MCP Registry for the …
|
Rohit Agarwal |
2025-09-09 |
793 |
--
|
|
Portkey on the AWS Marketplace
|
Siddharth Sambharia |
2025-03-16 |
572 |
--
|
|
Scaling Transformer to 1M tokens and beyond with RMT - Summary
|
The Quill |
2023-05-07 |
233 |
--
|
|
GPT Understands, Too - Summary
|
Rohit Agarwal |
2023-04-15 |
244 |
--
|
|
MiLMo:Minority Multilingual Pre-trained Language Model - Summary
|
The Quill |
2023-04-20 |
200 |
--
|
|
⭐️ Analyze your LLM calls - 2.0
|
Rohit Agarwal |
2023-08-07 |
621 |
--
|
|
Everything We Know About Claude Code Limits
|
Rohit Agarwal |
2025-07-29 |
736 |
--
|
|
Buyer’s guide to LLM observability tools 2026
|
Drishti Shah |
2025-11-27 |
1,518 |
--
|
|
SLiC-HF: Sequence Likelihood Calibration with Human Feedback - Summary
|
The Quill |
2023-05-21 |
222 |
--
|
|
Deep Dive: OpenAI's o1 - The Dawn of Deliberate AI
|
Rohit Agarwal |
2024-12-08 |
1,268 |
--
|
|
Segment Everything Everywhere All at Once - Summary
|
The Quill |
2023-04-16 |
201 |
--
|
|
AI cost observability: A practical guide to understanding and managing LLM spend
|
Drishti Shah |
2025-11-21 |
1,861 |
--
|
|
⭐ Building Reliable LLM Apps: 5 Things To Know
|
Rohit Agarwal |
2023-08-01 |
1,159 |
--
|
|
Supercharging Open-source LLMs: Your Gateway to 250+ Models
|
Vrushank Vyas |
2024-08-05 |
890 |
--
|
|
Attention Isn’t All You Need
|
Siddharth Sambharia |
2024-08-22 |
838 |
--
|
|
Generative Agents: Interactive Simulacra of Human Behavior - Summary
|
The Quill |
2023-04-16 |
197 |
--
|
|
Mixtral of Experts - Summary
|
The Quill |
2024-01-09 |
343 |
--
|
|
Portkey x Pillar - Enterprise-grade Security for LLMs in Production
|
Vrushank Vyas |
2024-08-15 |
485 |
--
|
|
Post Processing Recommender Systems with Knowledge Graphs for Recency, Popularity, and Diversity …
|
The Quill |
2023-06-02 |
221 |
--
|
|
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding - Summary
|
The Quill |
2023-08-21 |
174 |
--
|
|
Expanding the AI Gateway with Google Vertex AI Integration
|
The Quill |
2024-04-29 |
394 |
--
|
|
Anyscale's OSS Models + Portkey's Ops Stack
|
Rohit Agarwal |
2023-12-12 |
372 |
--
|
|
OpenAI - Fine-tune GPT-4o with images and text
|
Kavya MD |
2024-10-20 |
1,044 |
--
|
|
Evaluating Long-Context LLMs
|
The Quill |
2025-02-14 |
371 |
--
|
|
How to differentiate your AI product - Jasper style!
|
The Quill |
2023-09-20 |
439 |
--
|
|
August at Portkey: 2 BILLION Requests, Guardrails, Tracing, and More
|
Vrushank Vyas |
2024-09-05 |
559 |
--
|
|
Discovering Language Model Behaviors with Model-Written Evaluations - Summary
|
The Quill |
2023-05-08 |
415 |
--
|
|
LLM routing techniques for high-volume applications
|
Drishti Shah |
2025-12-05 |
1,453 |
--
|
|
Understanding MCP Authorization
|
Drishti Shah |
2025-12-17 |
1,188 |
--
|
|
What is a virtual MCP server: Need, benefits, use cases
|
Drishti Shah |
2025-12-22 |
634 |
--
|
|
Enterprise MCP access control: managing tools, servers, and agents
|
Drishti Shah |
2025-12-23 |
1,024 |
--
|
|
MCP tool discovery for autonomous LLM agents
|
Drishti Shah |
2025-12-26 |
737 |
--
|
|
OpenCode: token usage, costs, and access control
|
Drishti Shah |
2025-12-30 |
870 |
--
|
|
LLM hallucinations in production
|
Danna Wermus |
2026-01-06 |
1,147 |
--
|
|
How Fontys ICT built an institutional AI platform with a gateway architecture
|
Koen Suilen |
2026-01-10 |
1,751 |
--
|
|
We Tracked $93M in LLM Spends Last Year. Now the Data is …
|
Vrushank Vyas |
2026-01-12 |
717 |
--
|
|
The State of AI FinOps 2025: Key Insights from FinOps Foundation's Latest …
|
Vrushank Vyas |
2025-02-20 |
1,378 |
--
|
|
Benchmarking the new moderation model from OpenAI
|
Rohit Agarwal |
2024-09-27 |
1,678 |
--
|
|
Beyond Implementation: Why Audit Logs are Critical for Enterprise AI Governance
|
Vrushank Vyas |
2025-01-28 |
378 |
--
|
|
Dive into what is LLMOps
|
Vrushank Vyas |
2023-07-01 |
6,444 |
--
|
|
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models - Summary
|
The Quill |
2023-10-14 |
230 |
--
|
|
Open Sourcing Guardrails on the Gateway Framework
|
Rohit Agarwal |
2024-08-14 |
542 |
--
|
|
Multi-LLM Text Summarization
|
The Quill |
2024-12-26 |
396 |
--
|
|
MCP primitives: the mental model behind the protocol
|
Drishti Shah |
2025-12-15 |
952 |
--
|
|
Beyond the Hype: The Enterprise AI Blueprint You Need Now (And Why …
|
Rohit Agarwal |
2025-04-08 |
1,309 |
--
|
|
Transforming E-Commerce Search with Semantic Cache: Insights from Walmart's Journey
|
Rohit Agarwal |
2024-02-09 |
861 |
--
|
|
OpenAI's New Agent Tools: Navigating Strategic Implications for Enterprise AI
|
Vrushank Vyas |
2025-03-12 |
1,742 |
--
|
|
Jamming on Event-Driven Architecture and MCP for Multi-Agentic Systems
|
Siddharth Sambharia |
2025-01-13 |
530 |
--
|
|
Introducing the MCP Gateway
|
Rohit Agarwal |
2026-01-21 |
1,055 |
--
|