HuggingFace Hacker News

Filters

Min points: 1 10 25 50 100 250 500

Year:

Posts by Month (39 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
MonadGPT – What would have happened if ChatGPT was invented in the …	323	--	2023-11-24
LLM in a Flash: Efficient LLM Inference with Limited Memory	252	--	2023-12-20
Falcon 180B	238	--	2023-09-06
OpenLLaMA 13B Released	229	--	2023-06-18
Hugging Face Releases Agents	214	--	2023-05-10
BigCode Project Releases StarCoder: A 15B Code LLM	185	--	2023-05-04
StackLlama: A hands-on guide to train LlaMa with RLHF	165	--	2023-04-06
Mistral-8x7B-Chat	131	--	2023-12-10
Yi-34B-Chat	115	--	2023-11-24
GPT-3.5 and Wolfram Alpha via LangChain	107	--	2023-01-18
The Falcon has landed in the Hugging Face ecosystem	105	--	2023-06-05
Hugging Face and AWS partner to make AI more accessible	102	--	2023-02-21
HuggingFace Training Cluster as a Service	101	--	2023-09-05
Segmind Stable Diffusion – A smaller version of Stable Diffusion XL	95	--	2023-10-25
HuggingChat	93	--	2023-04-25
Yarn-Mistral-7B-128k	88	--	2023-11-11
Sparse LLM Inference on CPU: 75% fewer parameters	78	--	2023-10-19
Switch Transformers C – 2048 experts (1.6T params for 3.1 TB) (2022)	73	--	2023-11-20
Multimodal Neurons in Pretrained Text-Only Transformers	66	--	2023-08-04
HuggingChat – ChatGPT alternative with open source models	61	--	2023-12-15
OpenLLaMA 7B Training Completed to 1T Tokens	58	--	2023-06-07
Phi-2	57	--	2023-12-13
Dolphin-2_6-Phi-2	56	--	2023-12-24
Alibaba releases 72B LLM with 32k context length	55	--	2023-11-30
Open LLAMA 13B released, trained on 1T tokens	47	--	2023-06-19
4-Bit Quantization and QLoRA	41	--	2023-05-25
BLOOMChat, a 176B parameter, Multi-lingual, fine tuned chat	40	--	2023-05-19
What's Going on with the Open LLM Leaderboard?	40	--	2023-06-23
Kai-Fu Li's Yi-34B uses exactly Llama's architecture except for 2 tensor renamed	39	--	2023-11-14
Zephyr 7B – Mistral Finetune that responds like ChatGPT	37	--	2023-10-15
Whisper Jax: Transcribe a 1 hour of audio in under 15 seconds	36	--	2023-04-22
MistralLite by Amazon Web Services	34	--	2023-11-01
Mixture of Experts Explained	29	--	2023-12-11
TinyLlama at 2T of 3T	29	--	2023-11-19
Real-Time Latent Consistency Model	27	--	2023-10-30
Language Modeling Is Compression	27	--	2023-09-21
Pixel Art XL: Stable Diffusion XL for Pixel Art	26	--	2023-08-03
UC Berkeley's open-source Vicuna LLM chatbot released new improved model weights	26	--	2023-04-14
Llama 1.3B Trained on 200B Tokens for Commercial Use	25	--	2023-04-28

Plushcap, by Matt Makai. 2021-2026.

HuggingFace on HN