Anyscale Hacker News

Filters

Since:

Posts by Month (51 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models	308	--	2023-08-11
Llama 2 is about as factually accurate as GPT-4 for summaries and …	143	--	2023-08-29
Continuous batching to increase LLM inference throughput and reduce p50 latency	110	--	2023-08-15
Numbers every LLM Developer should know	95	--	2023-08-12
ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs	78	--	2023-08-30
Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system	36	--	2023-01-24
Anyscale's Aviary: Open-Source Multi-LLM Serving	24	--	2023-05-31
Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2	22	--	2023-09-06
Anyscale's Aviary is a dashboard for evaluating Open Source LLMs	14	--	2023-05-31
Production Guide for Building Rag-Based LLM Applications	11	--	2023-09-13
ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data	7	--	2023-08-15
Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints	5	--	2023-10-13
Lessons from training a Stable Diffusion model on 2B images	5	--	2024-05-11
Scaling data loading for ML training with Ray Data	4	--	2023-09-15
Cloud Infrastructure for LLM and Generative AI Applications	4	--	2023-09-14
Model Batch Inference in Ray: Actors, ActorPool, and Datasets	4	--	2022-11-04
Ant Group – scaling to 1.37M QPS on Ray	3	--	2022-12-13
Ant Group Uses Ray to Build a Large-Scale Online Serverless Platform	3	--	2022-12-12
Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning	3	--	2023-10-24
How to build a LLM search engine using a self-hosted LLM	3	--	2023-04-21
An informal introduction to reinforcement learning	3	--	2022-02-23
Joins and Hash-Shuffle in Ray Data	3	--	2025-07-09
Anyscale Appoints Keerti Melkote as CEO	2	--	2024-07-31
Canva Built a Modern AI Platform Using Anyscale	2	--	2024-04-03
Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs	2	--	2023-12-21
Anyscale Endpoints: JSON Mode and Function Calling Features	2	--	2023-12-14
Reproducible Performance Metrics for LLM Inference	2	--	2023-11-02
Fine Tuning is for form not facts	2	--	2023-08-27
Serving PyTorch Models with FastAPI and Ray Serve	2	--	2022-12-17
Ray Datasets for large-scale machine learning ingest and scoring	2	--	2022-02-25
Ray 1.10 Released	2	--	2022-02-25
Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure	2	--	2025-07-01
Native LLM APIs in Ray Data and Ray Serve	2	--	2025-07-10
Massively Parallel Agentic Simulations with Ray	2	--	2025-09-11
Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput	2	--	2026-03-26
Direct Preference Optimization with Synthetic Data on Anyscale	1	--	2024-08-21
Building an LLM Router for High-Quality and Cost-Effective Responses	1	--	2024-07-02
End-to-End LLM Workflows Guide	1	--	2024-06-18
Fine-tuning LLMs for longer context and better RAG systems	1	--	2024-02-13
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone	1	--	2024-01-16
LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality	1	--	2023-11-10
Anyscale Endpoints: LLM inference and fine-tuning	1	--	2023-10-25
Ray solves common production challenges for generative AI infrastructure	1	--	2023-03-28
Training One Million Machine Learning Models in Record Time with Ray	1	--	2022-12-18
Gang Scheduling Ray Clusters on K8s with Multi-Cluster-App-Dispatcher (MCAD)	1	--	2022-11-16
Redis in Ray: Past and Future	1	--	2022-03-18
Ray 1.11 Released	1	--	2022-03-11
Uv and Ray: Pain-Free Python Dependencies in Clusters	1	--	2025-02-28
An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch …	1	--	2025-06-13
Open Source RL Libraries for LLMs	1	--	2025-07-02
Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes	1	--	2025-08-11

Plushcap, by Matt Makai. 2021-2026.

Anyscale on HN