Together AI Hacker News

Filters

Min points: 1 10 25 50 100 250 500

Year:

Posts by Month (20 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-Precision	287	--	2024-07-11
Based: Simple linear attention language models	165	--	2024-03-05
Dragonfly: A large vision-language model with multi-resolution zoom	143	--	2024-06-06
A practitioner's guide to testing and running GPU clusters	80	--	2024-08-13
The Mamba in the Llama: Distilling and Accelerating Hybrid Models	4	--	2024-09-09
LlamaTutor	4	--	2024-07-24
Fine-tuning Llama-3 to get 90% of GPT-4's performance at a fraction of …	3	--	2024-07-19
Together Inference Engine 2.0 with new Turbo and Lite endpoints	3	--	2024-07-18
Fine-Tuning LLMs for Multi-Turn Conversations: A Technical Deep Dive	3	--	2024-11-27
Together AI acquires CodeSandbox to launch code interpreter for generative AI	3	--	2024-12-12
Speculative decoding for high-throughput long-context inference	2	--	2024-09-05
Together MoA–collective intelligence of open-source models pushing LLM frontier	2	--	2024-06-15
Evo: Long-context modeling from molecular to genome scale	2	--	2024-02-27
Generate react apps with Llama 3.1	2	--	2024-08-02
Flux API available on Together AI:FLUX1.1 [pro] and free access FLUX.1 [schnell]	1	--	2024-10-03
Together AI embeddings endpoint with higher quality, 4x lower cost than OpenAI	1	--	2024-01-11
Linearizing LLMs with LoLCATs	1	--	2024-10-15
Free Llama 3.2 vision API	1	--	2024-09-25
New SOTA Reranker from Salesforce	1	--	2024-09-10
RedPajama-Data-v2: An open dataset with 30T tokens (2023)	1	--	2024-04-22

Together AI on HN