Modal Blog - Plushcap

62 blog posts published by month since the start of 2024. Start from a different year: 2024
2024
2025

Blog URL

Posts year-to-date

6 (9 posts by this month last year.)

Average posts per month since 2024

2.6

Post details (2024 to today)

Title	Author	Date	Word count	HN points
What is Flux Dev?	Kenny Ning	Oct 17, 2024	469	-
Top open-source text-to-speech libraries in 2025	Yiren Lu	Mar 10, 2025	876	-
How Modal speeds up container launches in the cloud	Yiren Lu	Aug 16, 2024	1082	-
Top embedding models for RAG	Yiren Lu	Oct 30, 2024	557	-
A1111 vs ComfyUI	Kenny Ning	Aug 23, 2024	640	-
RabbitMQ vs. Kafka: choosing the right messaging system	Yiren Lu	Sep 25, 2024	920	-
How Contextual AI automated CI with Modal GPUs	-	Sep 18, 2024	740	-
Google Cloud Run vs. Cloud Run Functions: understanding Google's serverless offerings	Yiren Lu	Sep 25, 2024	757	-
How OpenArt scaled their Gen AI art platform on hundreds of GPUs	-	Nov 20, 2024	620	-
A10 vs. A100 vs. H100 - Which one should you choose?	Yiren Lu	Jan 27, 2025	844	-
Why Substack moved their AI and ML pipelines to Modal	-	May 20, 2024	453	-
Best open-source LLMs in 2025	Yiren Lu	Mar 10, 2025	1344	-
Fast, lazy container loading in Modal.com	Jonathan Belotti	Sep 08, 2024	2582	-
How Suno shaved 4 months off their launch timeline with Modal	-	Feb 21, 2024	509	-
Open-source AI agents	Kenny Ning	Sep 23, 2024	692	-
Build interactive workflows using Kestra and Modal	Anna Geller	Oct 15, 2024	1883	-
Introducing: Region selection	-	May 13, 2024	481	-
Inside the Modal Code Playground	Rachel Park	Aug 16, 2024	672	-
How Ramp automated receipt processing with fine-tuned LLMs	-	Mar 26, 2024	517	2
Google Cloud Run functions pricing: understanding costs and optimization	Yiren Lu	Sep 25, 2024	742	-
Batch processing vs. stream processing by example	Yiren Lu	Sep 04, 2024	615	-
Dogfooding Modal: What we learned at our internal hackathon	-	Dec 09, 2024	611	-
WireGuard at Modal: Static IPs for Serverless Containers	Eric Zhang	Dec 02, 2024	3035	125
How to get GPUs with a Jupyter notebook on Modal	Yiren Lu	Sep 15, 2024	261	-
Introducing: L40S GPUs on Modal	-	Dec 19, 2024	466	-
Beating Proprietary Models with a Quick Fine-Tune	Jason Liu	Apr 26, 2024	2384	6
Stable Diffusion 3.5 vs. Flux	Yiren Lu	Nov 02, 2024	643	-
How to deploy code in AWS Lambda: the easy way for beginners	Yiren Lu	Sep 14, 2024	752	-
How to deploy a Gradio app	Yiren Lu	Sep 15, 2024	632	-
Run GPU jobs from Airflow with Modal	Kenny Ning	Jun 20, 2024	1664	2
Best practices for serverless inference	Yiren Lu	Sep 25, 2024	636	-
How to run Ollama	Yiren Lu	Sep 15, 2024	537	-
How to run XTTS	Yiren Lu	Sep 15, 2024	460	-
How to run Llama 3.1 as an API	Kenny Ning	Sep 18, 2024	396	-
What is Flash Attention?	Yiren Lu	Oct 16, 2024	627	-
Glossary: LLM fine-tuning hyperparameters	Yiren Lu	Oct 15, 2024	681	-
All the open-source Whisper variations	Yiren Lu	Aug 15, 2024	703	-
How much VRAM do I need for LLM model fine-tuning?	Yiren Lu	Sep 01, 2024	393	-
vLLM vs. TGI	Yiren Lu	Oct 15, 2024	541	-
ChatTTS: Running an open source text-to-speech model	Yiren Lu	Sep 15, 2024	498	-
Llama3-405B: How to run an extra large open source LLM on Modal	Yiren Lu	Sep 15, 2024	515	-
Upload files to S3 with AWS Lambda and AWS API Gateway in TypeScript: A Step-by-Step Guide	Yiren Lu	Sep 04, 2024	640	-
How a top tier European soccer team sped up their data processing and reduced costs by 50%	-	Dec 04, 2024	525	-
Fine-tuning vs. RAG	Yiren Lu	Oct 15, 2024	1523	-
How much VRAM do I need for LLM inference?	Yiren Lu	Sep 01, 2024	261	-
Top ComfyUI custom node packs	Kenny Ning	Nov 12, 2024	874	-
Embedding English Wikipedia in under 15 minutes	Jason Liu	Jan 23, 2024	2433	7
Top embedding models on the MTEB leaderboard	Yiren Lu	Jan 27, 2025	701	-
Top 5 serverless GPU providers	Yiren Lu	Sep 27, 2024	857	-
How much is an Nvidia H100?	Yiren Lu	Aug 15, 2024	531	-
AWS Lambda vs. Google Cloud functions: a comprehensive comparison	Yiren Lu	Sep 25, 2024	853	-
How much is an Nvidia A100?	-	Oct 31, 2024	794	-
Top open-source text-to-video AI models	Yiren Lu	Oct 30, 2024	563	-
Best frameworks for fine-tuning LLMs in 2025	Yiren Lu	Jan 27, 2025	614	-
Create an infinite icon library by fine-tuning Stable Diffusion	Yiren Lu	May 21, 2024	2435	-
How to run cron jobs	Kenny Ning	Apr 30, 2024	681	-
Create a custom video generator by fine-tuning a Mochi LoRA on Modal	-	Nov 26, 2024	640	-
Building a cost-effective analytics stack with Modal, dlt, and dbt	Kenny Ning	Sep 10, 2024	2487	-
Modal is SOC 2 Type II Compliant	-	Jan 02, 2025	216	-
Top image segmentation models	Yiren Lu	Oct 30, 2024	648	-
Dagster vs. Airflow: a comprehensive comparison	Yiren Lu	Sep 25, 2024	767	-
LoRA vs. QLoRA: Efficient fine-tuning techniques for LLMs	Yiren Lu	Aug 22, 2024	757	-

Modal blog content

62 blog posts published by month since the start of 2024. Start from a different year: 202420242025

Post details (2024 to today)

62 blog posts published by month since the start of 2024. Start from a different year: 2024
2024
2025