Portkey Blog - Plushcap

Blog URL

portkey.ai/blog

Posts YTD

44 ↑ vs 9 last year

Avg Posts/Month

2.8 since 2023

Monthly Post Volume

Start year: 2023 2024 2025 2026

Post Details

Search:

Title	Author	Published	Words	HN Pts
LLMs in Prod Comes to Bangalore	Vrushank Vyas	2024-07-29	1,045	--
AI audit checklist for internal AI platforms & enablement teams	Drishti Shah	2025-12-10	2,089	--
May: Major Updates	Vrushank Vyas	2024-05-31	355	--
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace - …	The Quill	2023-04-20	197	--
Agent observability: measuring tools, plans, and outcomes	Drishti Shah	2025-11-28	1,619	--
My Journey with AI-Driven Development: From Curiosity to Necessity	Ayush	2023-08-18	602	--
We're Afraid Language Models Aren't Modeling Ambiguity - Summary	The Quill	2023-05-07	225	--
Language Models are Few-Shot Learners - Summary	Rohit Agarwal	2023-04-15	230	--
What It Means To Go To Prod	Rohit Agarwal	2024-04-13	312	--
Instruction Tuning with GPT-4 - Summary	The Quill	2023-04-16	241	--
CAMEL: Communicative Agents for "Mind" Exploration of LLMs - Summary	Rohit Agarwal	2023-04-14	352	--
LLMs in Prod: The Reality of AI Outages, No LLM is Immune	Siddharth Sambharia	2024-12-14	504	--
Gemini 3.0 vs GPT-5.1: a clear comparison for builders	Drishti Shah	2025-11-19	921	--
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller …	The Quill	2023-05-07	237	--
Towards Reasoning in Large Language Models: A Survey - Summary	The Quill	2023-06-09	248	--
A Survey of Large Language Models - Summary	The Quill	2023-04-16	274	--
Training language models to follow instructions with human feedback - Summary	Rohit Agarwal	2023-04-15	249	--
Attention Is All You Need - Summary	Rohit Agarwal	2023-04-14	248	--
Expressive Text-to-Image Generation with Rich Text - Summary	Rohit Agarwal	2023-04-15	264	--
The Confidence Checklist for LLMs in Production	Vrushank Vyas	2023-07-01	51	--
Eight Things to Know about Large Language Models - Summary	The Quill	2023-04-16	132	--
LLMs In Prod: Day 1	Siddharth Sambharia	2024-12-13	505	--
Fortifying Your AI Stack: Palo Alto Networks Prisma AIRS Now on Portkey	Jason Roberts	2025-08-13	547	--
AI tool sprawl: causes, risks, and how teams can regain control	Drishti Shah	2025-11-20	1,327	--
June at Portkey: Agents, AI Governance, and Twenty Six Cookbooks	Vrushank Vyas	2024-07-08	405	--
LLM access control in multi-provider environments	Drishti Shah	2025-12-03	1,531	--
Portkey in September	Vrushank Vyas	2024-10-05	922	--
GPT-4 is Getting Faster 🐇	Vrushank Vyas	2023-10-16	257	--
How Snorkel evaluates and trains top AI models	Shae Selix	2025-11-04	2,388	--
LoRA: Low-Rank Adaptation of Large Language Models - Summary	Rohit Agarwal	2023-04-15	271	--
Open WebUI vs LibreChat: Choose the Right ChatGPT UI for Your Organization	Vrushank Vyas	2025-02-19	1,154	--
Elevate Your ToolJet Experience with Portkey AI	Kavya MD	2024-10-29	853	--
OpenAI DevDay's Implications for LLM Apps in Prod	Rohit Agarwal	2023-11-07	1,041	--
Are We Really Making Much Progress in Text Classification? A Comparative Review …	The Quill	2023-06-09	239	--
Building Production-Ready RAG Apps	Team Hasura	2023-10-18	1,143	--
Launching Prompt Engineering Studio	Vrushank Vyas	2025-03-17	840	--
How does an AI gateway improve building AI apps	Drishti Shah	2025-12-16	1,527	--
Portkey Named a Cool Vendor in the 2025 Gartner® Cool Vendors™ in …	Drishti Shah	2025-10-29	1,051	--
April Cool: 154 PR Merges	Rohit Agarwal	2024-04-30	410	--
Portkey is Joining Hacktoberfest	Ashirwad Karande	2024-10-09	339	--
Prompt Injection Attacks in LLMs: What Are They and How to Prevent …	Sabrina Shoshani	2024-12-10	3,011	--
Generative Agents: Interactive Simulacra of Human Behavior - Summary	The Quill	2023-04-16	283	--
⭐ The Developer’s Guide to OpenTelemetry: A Real-Time Journey into Observability	Kavya MD	2024-10-15	1,124	--
Tracking LLM token usage across providers, teams and workloads	Drishti Shah	2025-12-04	1,437	--
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, …	The Quill	2024-12-26	471	--
Unpacking Semantic Caching at Walmart	Vrushank Vyas	2024-02-05	576	--
Architecting for Trust: A Strategic Perspective on the MCP Registry for the …	Rohit Agarwal	2025-09-09	793	--
Portkey on the AWS Marketplace	Siddharth Sambharia	2025-03-16	572	--
Scaling Transformer to 1M tokens and beyond with RMT - Summary	The Quill	2023-05-07	233	--
GPT Understands, Too - Summary	Rohit Agarwal	2023-04-15	244	--
MiLMo:Minority Multilingual Pre-trained Language Model - Summary	The Quill	2023-04-20	200	--
⭐️ Analyze your LLM calls - 2.0	Rohit Agarwal	2023-08-07	621	--
Everything We Know About Claude Code Limits	Rohit Agarwal	2025-07-29	736	--
Buyer’s guide to LLM observability tools 2026	Drishti Shah	2025-11-27	1,518	--
SLiC-HF: Sequence Likelihood Calibration with Human Feedback - Summary	The Quill	2023-05-21	222	--
Deep Dive: OpenAI's o1 - The Dawn of Deliberate AI	Rohit Agarwal	2024-12-08	1,268	--
Segment Everything Everywhere All at Once - Summary	The Quill	2023-04-16	201	--
AI cost observability: A practical guide to understanding and managing LLM spend	Drishti Shah	2025-11-21	1,861	--
⭐ Building Reliable LLM Apps: 5 Things To Know	Rohit Agarwal	2023-08-01	1,159	--
Supercharging Open-source LLMs: Your Gateway to 250+ Models	Vrushank Vyas	2024-08-05	890	--
Attention Isn’t All You Need	Siddharth Sambharia	2024-08-22	838	--
Generative Agents: Interactive Simulacra of Human Behavior - Summary	The Quill	2023-04-16	197	--
Mixtral of Experts - Summary	The Quill	2024-01-09	343	--
Portkey x Pillar - Enterprise-grade Security for LLMs in Production	Vrushank Vyas	2024-08-15	485	--
Post Processing Recommender Systems with Knowledge Graphs for Recency, Popularity, and Diversity …	The Quill	2023-06-02	221	--
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding - Summary	The Quill	2023-08-21	174	--
Expanding the AI Gateway with Google Vertex AI Integration	The Quill	2024-04-29	394	--
Anyscale's OSS Models + Portkey's Ops Stack	Rohit Agarwal	2023-12-12	372	--
OpenAI - Fine-tune GPT-4o with images and text	Kavya MD	2024-10-20	1,044	--
Evaluating Long-Context LLMs	The Quill	2025-02-14	371	--
How to differentiate your AI product - Jasper style!	The Quill	2023-09-20	439	--
August at Portkey: 2 BILLION Requests, Guardrails, Tracing, and More	Vrushank Vyas	2024-09-05	559	--
Discovering Language Model Behaviors with Model-Written Evaluations - Summary	The Quill	2023-05-08	415	--
LLM routing techniques for high-volume applications	Drishti Shah	2025-12-05	1,453	--
Understanding MCP Authorization	Drishti Shah	2025-12-17	1,188	--
What is a virtual MCP server: Need, benefits, use cases	Drishti Shah	2025-12-22	634	--
Enterprise MCP access control: managing tools, servers, and agents	Drishti Shah	2025-12-23	1,024	--
MCP tool discovery for autonomous LLM agents	Drishti Shah	2025-12-26	737	--
OpenCode: token usage, costs, and access control	Drishti Shah	2025-12-30	870	--
LLM hallucinations in production	Danna Wermus	2026-01-06	1,147	--
How Fontys ICT built an institutional AI platform with a gateway architecture	Koen Suilen	2026-01-10	1,751	--
We Tracked $93M in LLM Spends Last Year. Now the Data is …	Vrushank Vyas	2026-01-12	717	--
The State of AI FinOps 2025: Key Insights from FinOps Foundation's Latest …	Vrushank Vyas	2025-02-20	1,378	--
Benchmarking the new moderation model from OpenAI	Rohit Agarwal	2024-09-27	1,678	--
Beyond Implementation: Why Audit Logs are Critical for Enterprise AI Governance	Vrushank Vyas	2025-01-28	378	--
Dive into what is LLMOps	Vrushank Vyas	2023-07-01	6,444	--
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models - Summary	The Quill	2023-10-14	230	--
Open Sourcing Guardrails on the Gateway Framework	Rohit Agarwal	2024-08-14	542	--
Multi-LLM Text Summarization	The Quill	2024-12-26	396	--
MCP primitives: the mental model behind the protocol	Drishti Shah	2025-12-15	952	--
Beyond the Hype: The Enterprise AI Blueprint You Need Now (And Why …	Rohit Agarwal	2025-04-08	1,309	--
Transforming E-Commerce Search with Semantic Cache: Insights from Walmart's Journey	Rohit Agarwal	2024-02-09	861	--
OpenAI's New Agent Tools: Navigating Strategic Implications for Enterprise AI	Vrushank Vyas	2025-03-12	1,742	--
Jamming on Event-Driven Architecture and MCP for Multi-Agentic Systems	Siddharth Sambharia	2025-01-13	530	--
Introducing the MCP Gateway	Rohit Agarwal	2026-01-21	1,055	--
Securing the MCP Gateway: Lasso Partners with Portkey to Deliver Enterprise-Grade Agentic …	Eliran Suisa	2026-02-16	908	--
Portkey Raises $15M Series A to Scale the Unified Control Plane for …	Rohit Agarwal	2026-02-19	698	--
Open AI Responses API vs. Chat Completions vs. Anthropic Messages API	Drishti Shah	2026-02-23	1,377	--
The best approach to compare LLM outputs	Drishti Shah	2026-02-24	1,143	--
How to host an AI Hackathon without losing control of your keys …	Drishti Shah	2026-02-25	1,326	--
LLM Deployment Pipeline Explained Step by Step	Rebecca McCandler	2026-02-27	1,845	--
Claude Code agents: what they are, how they work, and how to …	Drishti Shah	2026-03-09	1,096	--
Claude Code best practices for enterprise teams	Drishti Shah	2026-03-10	1,336	--
MCP vs RAG Compared for Production Teams	Drishti Shah	2026-03-12	1,686	--
What Makes Enterprise LLMs Different from General-Purpose AI Tools	Rebecca McCandler	2026-03-13	1,699	--
1 Trillion Tokens and the Death of the Chatbot	Rohit Agarwal	2026-03-19	1,194	--
MCP vs Function Calling – How They Actually Work Together	Drishti Shah	2026-03-20	1,979	--
GPT-5.4 vs Claude Opus 4.6: a guide to choosing the right model	Drishti Shah	2026-03-23	1,777	--
The Gateway Grew Up	Swetha Sridhar	2026-03-24	600	--
Enterprise AI Architecture From Pilot to Production	Vrushank Vyas	2026-03-23	2,900	--
What is AI lifecycle management?	Drishti Shah	2026-03-24	1,138	--
Akto Partners with Portkey to Bring Guardrails to AI Gateway	Drishti Shah	2026-03-31	891	--
What is autoinstrumentation?	Drishti Shah	2026-04-01	1,061	--
Stop hardcoding API keys in your AI apps	Drishti Shah	2026-04-02	980	--
Rate limiting for LLM applications: Why it matters and how to implement …	Drishti Shah	2026-04-06	1,375	--
Tool Provisioning in MCP Servers: Controlling AI Agent Access in Production	Siddharth Sambharia	2026-04-07	1,633	--
Cursor best practices for enterprise teams	Drishti Shah	2026-04-09	1,515	--
The Harness Tax: The Dead Weight Inside Your Coding Agent	Siddharth Sambharia	2026-04-13	861	--
LLM pricing is 100x harder than you think	Siddharth Sambharia	2026-04-15	1,395	--
Moving Fast Has a Security Bill and It Just Came Due	Rohit Agarwal	2026-04-16	1,749	--
What is AIOps?	Drishti Shah	2026-04-16	1,175	--
Conductor × Portkey is now live	Swetha Sridhar	2026-04-17	452	--
How to choose the right AIOps platform	Drishti Shah	2026-04-17	868	--
Semantic caching thresholds and why they matter	Swetha Sridhar	2026-04-18	2,267	--
AI Agent governance	Drishti Shah	2026-04-19	1,058	--
OpenAI Codex best practices	Drishti Shah	2026-04-20	1,058	--
n8n Best Practices	Drishti Shah	2026-04-21	1,427	--
Introducing the Agent Gateway	Rohit Agarwal	2026-04-21	440	--
Your First AI Agent Will Go Fine. Your Fiftieth Is Where Things …	Swetha Sridhar	2026-04-22	1,405	--
Who owns Claude Code at your company? A platform team's guide to …	Siddharth Sambharia	2026-04-24	1,234	--
Introducing Skills Registry	Siddharth Sambharia	2026-04-23	503	--
GitHub Copilot best practices for teams	Drishti Shah	2026-04-27	1,035	--
What is AgentOps?	Drishti Shah	2026-04-29	1,208	--
What’s an agent gateway?	Drishti Shah	2026-05-03	1,266	--
What MCP Governance Actually Means in Production	Swetha Sridhar	2026-05-24	1,821	--

Plushcap, by Matt Makai. 2021-2026.