Home / Companies / Anyscale / Hacker News

Anyscale on HN

50 posts with 1+ points since 2022

Filters
Since:
Posts by Month (50 total)
Hacker News Posts
Title Points Comments Date
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models 308 -- 2023-08-11
Llama 2 is about as factually accurate as GPT-4 for summaries and … 143 -- 2023-08-29
Continuous batching to increase LLM inference throughput and reduce p50 latency 110 -- 2023-08-15
Numbers every LLM Developer should know 95 -- 2023-08-12
ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs 78 -- 2023-08-30
Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system 36 -- 2023-01-24
Anyscale's Aviary: Open-Source Multi-LLM Serving 24 -- 2023-05-31
Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2 22 -- 2023-09-06
Anyscale's Aviary is a dashboard for evaluating Open Source LLMs 14 -- 2023-05-31
Production Guide for Building Rag-Based LLM Applications 11 -- 2023-09-13
ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data 7 -- 2023-08-15
Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints 5 -- 2023-10-13
Lessons from training a Stable Diffusion model on 2B images 5 -- 2024-05-11
Scaling data loading for ML training with Ray Data 4 -- 2023-09-15
Cloud Infrastructure for LLM and Generative AI Applications 4 -- 2023-09-14
Model Batch Inference in Ray: Actors, ActorPool, and Datasets 4 -- 2022-11-04
Ant Group – scaling to 1.37M QPS on Ray 3 -- 2022-12-13
Ant Group Uses Ray to Build a Large-Scale Online Serverless Platform 3 -- 2022-12-12
Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning 3 -- 2023-10-24
How to build a LLM search engine using a self-hosted LLM 3 -- 2023-04-21
An informal introduction to reinforcement learning 3 -- 2022-02-23
Joins and Hash-Shuffle in Ray Data 3 -- 2025-07-09
Anyscale Appoints Keerti Melkote as CEO 2 -- 2024-07-31
Canva Built a Modern AI Platform Using Anyscale 2 -- 2024-04-03
Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs 2 -- 2023-12-21
Anyscale Endpoints: JSON Mode and Function Calling Features 2 -- 2023-12-14
Reproducible Performance Metrics for LLM Inference 2 -- 2023-11-02
Fine Tuning is for form not facts 2 -- 2023-08-27
Serving PyTorch Models with FastAPI and Ray Serve 2 -- 2022-12-17
Ray Datasets for large-scale machine learning ingest and scoring 2 -- 2022-02-25
Ray 1.10 Released 2 -- 2022-02-25
Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure 2 -- 2025-07-01
Native LLM APIs in Ray Data and Ray Serve 2 -- 2025-07-10
Massively Parallel Agentic Simulations with Ray 2 -- 2025-09-11
Direct Preference Optimization with Synthetic Data on Anyscale 1 -- 2024-08-21
Building an LLM Router for High-Quality and Cost-Effective Responses 1 -- 2024-07-02
End-to-End LLM Workflows Guide 1 -- 2024-06-18
Fine-tuning LLMs for longer context and better RAG systems 1 -- 2024-02-13
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone 1 -- 2024-01-16
LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality 1 -- 2023-11-10
Anyscale Endpoints: LLM inference and fine-tuning 1 -- 2023-10-25
Ray solves common production challenges for generative AI infrastructure 1 -- 2023-03-28
Training One Million Machine Learning Models in Record Time with Ray 1 -- 2022-12-18
Gang Scheduling Ray Clusters on K8s with Multi-Cluster-App-Dispatcher (MCAD) 1 -- 2022-11-16
Redis in Ray: Past and Future 1 -- 2022-03-18
Ray 1.11 Released 1 -- 2022-03-11
Uv and Ray: Pain-Free Python Dependencies in Clusters 1 -- 2025-02-28
An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch … 1 -- 2025-06-13
Open Source RL Libraries for LLMs 1 -- 2025-07-02
Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes 1 -- 2025-08-11