Light
Home
/
Companies
/
HuggingFace
/
Hacker News
HuggingFace on HN
63 posts with 10+ points in 2023
Filters
Min points:
1
10
25
50
100
250
500
Year:
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
Posts by Month (63 total)
Hacker News Posts
Search:
Title
Points
Comments
Date
MonadGPT – What would have happened if ChatGPT was invented in the …
323
--
2023-11-24
LLM in a Flash: Efficient LLM Inference with Limited Memory
252
--
2023-12-20
Falcon 180B
238
--
2023-09-06
OpenLLaMA 13B Released
229
--
2023-06-18
Hugging Face Releases Agents
214
--
2023-05-10
BigCode Project Releases StarCoder: A 15B Code LLM
185
--
2023-05-04
StackLlama: A hands-on guide to train LlaMa with RLHF
165
--
2023-04-06
Mistral-8x7B-Chat
131
--
2023-12-10
Yi-34B-Chat
115
--
2023-11-24
GPT-3.5 and Wolfram Alpha via LangChain
107
--
2023-01-18
The Falcon has landed in the Hugging Face ecosystem
105
--
2023-06-05
Hugging Face and AWS partner to make AI more accessible
102
--
2023-02-21
HuggingFace Training Cluster as a Service
101
--
2023-09-05
Segmind Stable Diffusion – A smaller version of Stable Diffusion XL
95
--
2023-10-25
HuggingChat
93
--
2023-04-25
Yarn-Mistral-7B-128k
88
--
2023-11-11
Sparse LLM Inference on CPU: 75% fewer parameters
78
--
2023-10-19
Switch Transformers C – 2048 experts (1.6T params for 3.1 TB) (2022)
73
--
2023-11-20
Multimodal Neurons in Pretrained Text-Only Transformers
66
--
2023-08-04
HuggingChat – ChatGPT alternative with open source models
61
--
2023-12-15
OpenLLaMA 7B Training Completed to 1T Tokens
58
--
2023-06-07
Phi-2
57
--
2023-12-13
Dolphin-2_6-Phi-2
56
--
2023-12-24
Alibaba releases 72B LLM with 32k context length
55
--
2023-11-30
Open LLAMA 13B released, trained on 1T tokens
47
--
2023-06-19
4-Bit Quantization and QLoRA
41
--
2023-05-25
BLOOMChat, a 176B parameter, Multi-lingual, fine tuned chat
40
--
2023-05-19
What's Going on with the Open LLM Leaderboard?
40
--
2023-06-23
Kai-Fu Li's Yi-34B uses exactly Llama's architecture except for 2 tensor renamed
39
--
2023-11-14
Zephyr 7B – Mistral Finetune that responds like ChatGPT
37
--
2023-10-15
Whisper Jax: Transcribe a 1 hour of audio in under 15 seconds
36
--
2023-04-22
MistralLite by Amazon Web Services
34
--
2023-11-01
Mixture of Experts Explained
29
--
2023-12-11
TinyLlama at 2T of 3T
29
--
2023-11-19
Real-Time Latent Consistency Model
27
--
2023-10-30
Language Modeling Is Compression
27
--
2023-09-21
Pixel Art XL: Stable Diffusion XL for Pixel Art
26
--
2023-08-03
UC Berkeley's open-source Vicuna LLM chatbot released new improved model weights
26
--
2023-04-14
Llama 1.3B Trained on 200B Tokens for Commercial Use
25
--
2023-04-28
NousResearch/Nous-Hermes-2-Yi-34B
24
--
2023-12-26
Accelerating Stable Diffusion XL Inference with Jax on Cloud TPU v5e
23
--
2023-10-03
Llama 22B: 13B V2 with 33B attention heads frankensteined on
22
--
2023-08-18
Mistral-7B-OpenOrca. First 7B model to beat all other models <30B
21
--
2023-10-02
Würstchen: Fast Diffusion for Image Generation
21
--
2023-09-13
AMD and: Large Language Models Out-of-the-Box Acceleration with AMD GPU
19
--
2023-12-13
Encrypted Large Language Models with Homomorphic Encryption
18
--
2023-08-03
Orca 2: Teaching Small Language Models How to Reason
18
--
2023-11-21
Show HN: MiniSearch, a minimalist search engine with integrated browser-based AI
17
--
2023-10-15
Gemini vs. GPT-4V: A Preliminary Comparison Through Qualitative Cases
17
--
2023-12-28
Una-Cybertron-7B
17
--
2023-12-08
GPT Baker lets you build your own open-source GPTs
17
--
2023-11-23
Deploy Livebook (Elixir) Notebooks as Apps to Hugging Face Spaces
17
--
2023-06-15
ChatRWKV
17
--
2023-03-23
Airoboros-13B: 98% against GPT-3.5
14
--
2023-05-22
Create a GPT3 powered Q&A Chatbot for *any* GitHub repo by posting …
13
--
2023-02-05
Attention Sinks in LLMs for endless fluency
12
--
2023-10-09
Idefics: Open Access 60B multimodal model
12
--
2023-08-22
30B uncensored OSS model with no guardrails
11
--
2023-11-07
Hierarchical Masked 3D Diffusion Model for Video Outpainting
11
--
2023-09-06
Shallow Feed-Forward Neural Networks as Alternative to Attention in Transformers
11
--
2023-11-21
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
10
--
2023-09-11
Origin of LLMs: An Evolutionary Tree and Graph for 15K Large Language …
10
--
2023-07-20
Show HN: Image Filtering App Using Homomorphic Encryption
10
--
2023-02-23