62 blog posts published by month since the start of 2024. Start from a different year:

Blog URL
Posts year-to-date
6 (8 posts by this month last year.)
Average posts per month since 2024
2.6

Post details (2024 to today)

Title Author Date Word count HN points
What is Flux Dev? Kenny Ning Oct 17, 2024 469 -
Top open-source text-to-speech libraries in 2025 Yiren Lu Mar 10, 2025 876 -
How Modal speeds up container launches in the cloud Yiren Lu Aug 16, 2024 1082 -
Top embedding models for RAG Yiren Lu Oct 30, 2024 557 -
A1111 vs ComfyUI Kenny Ning Aug 23, 2024 640 -
RabbitMQ vs. Kafka: choosing the right messaging system Yiren Lu Sep 25, 2024 920 -
How Contextual AI automated CI with Modal GPUs - Sep 18, 2024 740 -
Google Cloud Run vs. Cloud Run Functions: understanding Google's serverless offerings Yiren Lu Sep 25, 2024 757 -
How OpenArt scaled their Gen AI art platform on hundreds of GPUs - Nov 20, 2024 620 -
A10 vs. A100 vs. H100 - Which one should you choose? Yiren Lu Jan 27, 2025 844 -
Why Substack moved their AI and ML pipelines to Modal - May 20, 2024 453 -
Best open-source LLMs in 2025 Yiren Lu Mar 10, 2025 1344 -
Fast, lazy container loading in Modal.com Jonathan Belotti Sep 08, 2024 2582 -
How Suno shaved 4 months off their launch timeline with Modal - Feb 21, 2024 509 -
Open-source AI agents Kenny Ning Sep 23, 2024 692 -
Build interactive workflows using Kestra and Modal Anna Geller Oct 15, 2024 1883 -
Introducing: Region selection - May 13, 2024 481 -
Inside the Modal Code Playground Rachel Park Aug 16, 2024 672 -
How Ramp automated receipt processing with fine-tuned LLMs - Mar 26, 2024 517 2
Google Cloud Run functions pricing: understanding costs and optimization Yiren Lu Sep 25, 2024 742 -
Batch processing vs. stream processing by example Yiren Lu Sep 04, 2024 615 -
Dogfooding Modal: What we learned at our internal hackathon - Dec 09, 2024 611 -
WireGuard at Modal: Static IPs for Serverless Containers Eric Zhang Dec 02, 2024 3035 125
How to get GPUs with a Jupyter notebook on Modal Yiren Lu Sep 15, 2024 261 -
Introducing: L40S GPUs on Modal - Dec 19, 2024 466 -
Beating Proprietary Models with a Quick Fine-Tune Jason Liu Apr 26, 2024 2384 6
Stable Diffusion 3.5 vs. Flux Yiren Lu Nov 02, 2024 643 -
How to deploy code in AWS Lambda: the easy way for beginners Yiren Lu Sep 14, 2024 752 -
How to deploy a Gradio app Yiren Lu Sep 15, 2024 632 -
Run GPU jobs from Airflow with Modal Kenny Ning Jun 20, 2024 1664 2
Best practices for serverless inference Yiren Lu Sep 25, 2024 636 -
How to run Ollama Yiren Lu Sep 15, 2024 537 -
How to run XTTS Yiren Lu Sep 15, 2024 460 -
How to run Llama 3.1 as an API Kenny Ning Sep 18, 2024 396 -
What is Flash Attention? Yiren Lu Oct 16, 2024 627 -
Glossary: LLM fine-tuning hyperparameters Yiren Lu Oct 15, 2024 681 -
All the open-source Whisper variations Yiren Lu Aug 15, 2024 703 -
How much VRAM do I need for LLM model fine-tuning? Yiren Lu Sep 01, 2024 393 -
vLLM vs. TGI Yiren Lu Oct 15, 2024 541 -
ChatTTS: Running an open source text-to-speech model Yiren Lu Sep 15, 2024 498 -
Llama3-405B: How to run an extra large open source LLM on Modal Yiren Lu Sep 15, 2024 515 -
Upload files to S3 with AWS Lambda and AWS API Gateway in TypeScript: A Step-by-Step Guide Yiren Lu Sep 04, 2024 640 -
How a top tier European soccer team sped up their data processing and reduced costs by 50% - Dec 04, 2024 525 -
Fine-tuning vs. RAG Yiren Lu Oct 15, 2024 1523 -
How much VRAM do I need for LLM inference? Yiren Lu Sep 01, 2024 261 -
Top ComfyUI custom node packs Kenny Ning Nov 12, 2024 874 -
Embedding English Wikipedia in under 15 minutes Jason Liu Jan 23, 2024 2433 7
Top embedding models on the MTEB leaderboard Yiren Lu Jan 27, 2025 701 -
Top 5 serverless GPU providers Yiren Lu Sep 27, 2024 857 -
How much is an Nvidia H100? Yiren Lu Aug 15, 2024 531 -
AWS Lambda vs. Google Cloud functions: a comprehensive comparison Yiren Lu Sep 25, 2024 853 -
How much is an Nvidia A100? - Oct 31, 2024 794 -
Top open-source text-to-video AI models Yiren Lu Oct 30, 2024 563 -
Best frameworks for fine-tuning LLMs in 2025 Yiren Lu Jan 27, 2025 614 -
Create an infinite icon library by fine-tuning Stable Diffusion Yiren Lu May 21, 2024 2435 -
How to run cron jobs Kenny Ning Apr 30, 2024 681 -
Create a custom video generator by fine-tuning a Mochi LoRA on Modal - Nov 26, 2024 640 -
Building a cost-effective analytics stack with Modal, dlt, and dbt Kenny Ning Sep 10, 2024 2487 -
Modal is SOC 2 Type II Compliant - Jan 02, 2025 216 -
Top image segmentation models Yiren Lu Oct 30, 2024 648 -
Dagster vs. Airflow: a comprehensive comparison Yiren Lu Sep 25, 2024 767 -
LoRA vs. QLoRA: Efficient fine-tuning techniques for LLMs Yiren Lu Aug 22, 2024 757 -