Light
Home
/
Companies
/
Anyscale
/
Hacker News
Anyscale on HN
23 posts with 1+ points in 2023
Filters
Min points:
1
10
25
50
100
250
500
Year:
2020
2021
2022
2023
2024
2025
2026
Posts by Month (23 total)
Hacker News Posts
Search:
Title
Points
Comments
Date
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
308
--
2023-08-11
Llama 2 is about as factually accurate as GPT-4 for summaries and …
143
--
2023-08-29
Continuous batching to increase LLM inference throughput and reduce p50 latency
110
--
2023-08-15
Numbers every LLM Developer should know
95
--
2023-08-12
ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs
78
--
2023-08-30
Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system
36
--
2023-01-24
Anyscale's Aviary: Open-Source Multi-LLM Serving
24
--
2023-05-31
Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2
22
--
2023-09-06
Anyscale's Aviary is a dashboard for evaluating Open Source LLMs
14
--
2023-05-31
Production Guide for Building Rag-Based LLM Applications
11
--
2023-09-13
ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data
7
--
2023-08-15
Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints
5
--
2023-10-13
Scaling data loading for ML training with Ray Data
4
--
2023-09-15
Cloud Infrastructure for LLM and Generative AI Applications
4
--
2023-09-14
Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning
3
--
2023-10-24
How to build a LLM search engine using a self-hosted LLM
3
--
2023-04-21
Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs
2
--
2023-12-21
Anyscale Endpoints: JSON Mode and Function Calling Features
2
--
2023-12-14
Reproducible Performance Metrics for LLM Inference
2
--
2023-11-02
Fine Tuning is for form not facts
2
--
2023-08-27
LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality
1
--
2023-11-10
Anyscale Endpoints: LLM inference and fine-tuning
1
--
2023-10-25
Ray solves common production challenges for generative AI infrastructure
1
--
2023-03-28