Home / Companies / Together AI / Hacker News

Together AI on HN

34 posts with 1+ points since 2022

Filters
Since:
Posts by Month (34 total)
Hacker News Posts
Title Points Comments Date
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-Precision 287 -- 2024-07-11
RedPajama v2 Open Dataset with 30T Tokens for Training LLMs 236 -- 2023-10-30
Paving the way to efficient architectures: StripedHyena-7B 221 -- 2023-12-08
AdapTive-LeArning Speculator System (ATLAS): Faster LLM inference 198 -- 2025-10-12
Based: Simple linear attention language models 165 -- 2024-03-05
Dragonfly: A large vision-language model with multi-resolution zoom 143 -- 2024-06-06
Llama 32K Context Released by Together AI 84 -- 2023-07-29
A practitioner's guide to testing and running GPU clusters 80 -- 2024-08-13
Together AI raises a $102.5M Series A 70 -- 2023-11-29
Llama 2 on togetherAI is as bad of a privacy nightmare as … 54 -- 2023-09-08
Direct Preference Optimization vs. RLHF 37 -- 2025-05-25
DeepCoder: An Open-Source 14B Coder at O3-Mini Level 31 -- 2025-04-09
The Mamba in the Llama: Distilling and Accelerating Hybrid Models 4 -- 2024-09-09
LlamaTutor 4 -- 2024-07-24
Fine-tuning Llama-3 to get 90% of GPT-4's performance at a fraction of … 3 -- 2024-07-19
Together Inference Engine 2.0 with new Turbo and Lite endpoints 3 -- 2024-07-18
Fine-Tuning LLMs for Multi-Turn Conversations: A Technical Deep Dive 3 -- 2024-11-27
Together AI acquires CodeSandbox to launch code interpreter for generative AI 3 -- 2024-12-12
Speculative decoding for high-throughput long-context inference 2 -- 2024-09-05
Together MoA–collective intelligence of open-source models pushing LLM frontier 2 -- 2024-06-15
Evo: Long-context modeling from molecular to genome scale 2 -- 2024-02-27
Together Inference Engine – the fastest inference available 2 -- 2023-12-12
Generate react apps with Llama 3.1 2 -- 2024-08-02
Llama-2-7B-32K-Instruct – and fine-tuning for Llama-2 models with Together API 2 -- 2023-08-22
FlashAttention-2: Faster attention with better parallelism and work partitioning 2 -- 2023-07-17
Together API hosts open source models 2 -- 2023-07-14
Flux API available on Together AI:FLUX1.1 [pro] and free access FLUX.1 [schnell] 1 -- 2024-10-03
Together AI embeddings endpoint with higher quality, 4x lower cost than OpenAI 1 -- 2024-01-11
Linearizing LLMs with LoLCATs 1 -- 2024-10-15
Free Llama 3.2 vision API 1 -- 2024-09-25
New SOTA Reranker from Salesforce 1 -- 2024-09-10
RedPajama-Data-v2: An open dataset with 30T tokens (2023) 1 -- 2024-04-22
Together Code Sandbox 1 -- 2025-05-20
The Frontier Is Open 1 -- 2025-06-09