Anyscale Hacker News

Filters

Min points: 1 10 25 50 100 250 500

Year:

Posts by Month (23 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models	308	--	2023-08-11
Llama 2 is about as factually accurate as GPT-4 for summaries and …	143	--	2023-08-29
Continuous batching to increase LLM inference throughput and reduce p50 latency	110	--	2023-08-15
Numbers every LLM Developer should know	95	--	2023-08-12
ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs	78	--	2023-08-30
Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system	36	--	2023-01-24
Anyscale's Aviary: Open-Source Multi-LLM Serving	24	--	2023-05-31
Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2	22	--	2023-09-06
Anyscale's Aviary is a dashboard for evaluating Open Source LLMs	14	--	2023-05-31
Production Guide for Building Rag-Based LLM Applications	11	--	2023-09-13
ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data	7	--	2023-08-15
Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints	5	--	2023-10-13
Scaling data loading for ML training with Ray Data	4	--	2023-09-15
Cloud Infrastructure for LLM and Generative AI Applications	4	--	2023-09-14
Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning	3	--	2023-10-24
How to build a LLM search engine using a self-hosted LLM	3	--	2023-04-21
Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs	2	--	2023-12-21
Anyscale Endpoints: JSON Mode and Function Calling Features	2	--	2023-12-14
Reproducible Performance Metrics for LLM Inference	2	--	2023-11-02
Fine Tuning is for form not facts	2	--	2023-08-27
LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality	1	--	2023-11-10
Anyscale Endpoints: LLM inference and fine-tuning	1	--	2023-10-25
Ray solves common production challenges for generative AI infrastructure	1	--	2023-03-28

Anyscale on HN