32 Hacker News submissions by month with at least  points since the start of

32 submissions with 1 points or greater

HN Points HN Title (Links to original post) Submitted Date
308 Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models 2023-08-11
143 Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaper 2023-08-29
110 Continuous batching to increase LLM inference throughput and reduce p50 latency 2023-08-15
95 Numbers every LLM Developer should know 2023-08-12
78 ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs 2023-08-30
36 Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system 2023-01-24
24 Anyscale's Aviary: Open-Source Multi-LLM Serving 2023-05-31
22 Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2 2023-09-06
14 Anyscale's Aviary is a dashboard for evaluating Open Source LLMs 2023-05-31
11 Production Guide for Building Rag-Based LLM Applications 2023-09-13
7 ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data 2023-08-15
5 Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints 2023-10-13
5 Lessons from training a Stable Diffusion model on 2B images 2024-05-11
4 Scaling data loading for ML training with Ray Data 2023-09-15
4 Cloud Infrastructure for LLM and Generative AI Applications 2023-09-14
3 Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning 2023-10-24
3 How to build a LLM search engine using a self-hosted LLM 2023-04-21
2 Anyscale Appoints Keerti Melkote as CEO 2024-07-31
2 Canva Built a Modern AI Platform Using Anyscale 2024-04-03
2 Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs 2023-12-21
2 Anyscale Endpoints: JSON Mode and Function Calling Features 2023-12-14
2 Reproducible Performance Metrics for LLM Inference 2023-11-02
2 Fine Tuning is for form not facts 2023-08-27
1 Direct Preference Optimization with Synthetic Data on Anyscale 2024-08-21
1 Building an LLM Router for High-Quality and Cost-Effective Responses 2024-07-02
1 End-to-End LLM Workflows Guide 2024-06-18
1 Fine-tuning LLMs for longer context and better RAG systems 2024-02-13
1 RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone 2024-01-16
1 LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality 2023-11-10
1 Anyscale Endpoints: LLM inference and fine-tuning 2023-10-25
1 Ray solves common production challenges for generative AI infrastructure 2023-03-28
1 Uv and Ray: Pain-Free Python Dependencies in Clusters 2025-02-28