Home
/
Companies
/
Together AI
/
Hacker News
Together AI on HN
34 posts with 1+ points since 2022
Filters
Min points:
1
10
25
50
100
250
500
Since:
2023
2024
2025
2026
Posts by Month (34 total)
Hacker News Posts
Search:
Title
Points
Comments
Date
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-Precision
287
--
2024-07-11
RedPajama v2 Open Dataset with 30T Tokens for Training LLMs
236
--
2023-10-30
Paving the way to efficient architectures: StripedHyena-7B
221
--
2023-12-08
AdapTive-LeArning Speculator System (ATLAS): Faster LLM inference
198
--
2025-10-12
Based: Simple linear attention language models
165
--
2024-03-05
Dragonfly: A large vision-language model with multi-resolution zoom
143
--
2024-06-06
Llama 32K Context Released by Together AI
84
--
2023-07-29
A practitioner's guide to testing and running GPU clusters
80
--
2024-08-13
Together AI raises a $102.5M Series A
70
--
2023-11-29
Llama 2 on togetherAI is as bad of a privacy nightmare as …
54
--
2023-09-08
Direct Preference Optimization vs. RLHF
37
--
2025-05-25
DeepCoder: An Open-Source 14B Coder at O3-Mini Level
31
--
2025-04-09
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
4
--
2024-09-09
LlamaTutor
4
--
2024-07-24
Fine-tuning Llama-3 to get 90% of GPT-4's performance at a fraction of …
3
--
2024-07-19
Together Inference Engine 2.0 with new Turbo and Lite endpoints
3
--
2024-07-18
Fine-Tuning LLMs for Multi-Turn Conversations: A Technical Deep Dive
3
--
2024-11-27
Together AI acquires CodeSandbox to launch code interpreter for generative AI
3
--
2024-12-12
Speculative decoding for high-throughput long-context inference
2
--
2024-09-05
Together MoA–collective intelligence of open-source models pushing LLM frontier
2
--
2024-06-15
Evo: Long-context modeling from molecular to genome scale
2
--
2024-02-27
Together Inference Engine – the fastest inference available
2
--
2023-12-12
Generate react apps with Llama 3.1
2
--
2024-08-02
Llama-2-7B-32K-Instruct – and fine-tuning for Llama-2 models with Together API
2
--
2023-08-22
FlashAttention-2: Faster attention with better parallelism and work partitioning
2
--
2023-07-17
Together API hosts open source models
2
--
2023-07-14
Flux API available on Together AI:FLUX1.1 [pro] and free access FLUX.1 [schnell]
1
--
2024-10-03
Together AI embeddings endpoint with higher quality, 4x lower cost than OpenAI
1
--
2024-01-11
Linearizing LLMs with LoLCATs
1
--
2024-10-15
Free Llama 3.2 vision API
1
--
2024-09-25
New SOTA Reranker from Salesforce
1
--
2024-09-10
RedPajama-Data-v2: An open dataset with 30T tokens (2023)
1
--
2024-04-22
Together Code Sandbox
1
--
2025-05-20
The Frontier Is Open
1
--
2025-06-09