Light
Home
/
Companies
/
HuggingFace
/
Hacker News
HuggingFace on HN
77 posts with 10+ points in 2024
Filters
Min points:
1
10
25
50
100
250
500
Year:
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
Posts by Month (77 total)
Hacker News Posts
Search:
Title
Points
Comments
Date
Uncensor any LLM with abliteration
586
--
2024-06-13
Llama-3.3-70B-Instruct
425
--
2024-12-06
A Replacement for BERT
348
--
2024-12-19
Microsoft Phi-2 model changes licence to MIT
240
--
2024-01-06
Space secrets leak disclosure
197
--
2024-06-01
Best 7B LLM on leaderboards made by an amateur following a medium …
181
--
2024-01-05
Llama 3 8B is almost as good as Wizard 2 8x22B
168
--
2024-04-19
Nvidia releases NVLM 1.0 72B open weight model
167
--
2024-10-02
Explaining the SDXL Latent Space
163
--
2024-02-05
Hugging Face and Google partner for AI collaboration
152
--
2024-01-25
A CC-By Open-Source TTS Model with Voice Cloning
131
--
2024-11-04
FineWeb: Decanting the web for the finest text data at scale
127
--
2024-06-02
HuggingChat: Chat with Open Source Models
103
--
2024-02-21
More than 80 AI models from Qualcomm
95
--
2024-02-28
LLaMA-Pro-8B
94
--
2024-01-06
Apple/OpenELM: Efficient Open-Source Family Language Models
82
--
2024-04-24
YouTube-Commons: Audio transcripts of 2,063,066 YouTube videos, CC-By license
75
--
2024-04-18
Show HN: Simply Reading Analog Gauges – GPT4, CogVLM Can't
66
--
2024-01-22
MSFT's WizardLM2 models have been taken down
58
--
2024-04-16
LiteLlama-460M-1T has 460M parameters trained with 1T tokens
54
--
2024-01-07
Fine-Tuning LLMs to 1.58bit
52
--
2024-09-18
LLaMA 3 70B Llamafiles
51
--
2024-04-19
DeepSeek v3 beats Claude sonnet 3.5 and way cheaper
48
--
2024-12-26
Improving Parquet Dedupe on Hugging Face Hub
47
--
2024-10-08
Open-LLM performances are plateauing
46
--
2024-06-29
Mixtral-8x22B on HuggingFace
33
--
2024-04-10
General OCR Theory: Towards OCR-2.0 via a Unified End-to-End Model
31
--
2024-09-11
Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat
30
--
2024-04-12
OpenFLUX.1
30
--
2024-10-04
Mistral 7B v0.2
29
--
2024-03-31
Video2Game: Real-Time, Interactive, Realistic Environment from a Single Video
28
--
2024-04-16
Llama-3.2-3B-Instruct-uncensored
26
--
2024-09-27
Llama can now see and run on your device – welcome Llama …
26
--
2024-09-25
New Phi-3.5 Models from Microsoft, including new MoE
25
--
2024-08-20
LLM: Transformer Is Linear
25
--
2024-05-24
HuggingFace - Tencent launches Hunyuan Large which outperforms Llama 3.1 405B
23
--
2024-11-05
Lineage Explorer for open source models – Hugging Face Space
22
--
2024-01-18
Show HN: Fineweb-Edu-Fortified dataset: Fineweb-Edu deduped, embeddings included
22
--
2024-08-14
Llama 3.2
21
--
2024-09-25
Fine-tune and deploy open LLMs as containers using AIKit - Part 1
19
--
2024-06-06
makeMoE: Implement a Sparse Mixture of Experts LLM from Scratch
19
--
2024-01-23
HuggingFace to Replace Git LFS with Xet
18
--
2024-08-23
Fake Insects: a game where you have to identify AI-generated insects
18
--
2024-08-17
Mixtral-8x22B-Instruct-v0.1
18
--
2024-04-17
Hermes-2-Pro-Llama-3-8B
18
--
2024-05-01
StableLM-2-12B
17
--
2024-04-08
NuExtract: A LLM for Structured Extraction
16
--
2024-06-29
An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct
16
--
2024-06-09
Phi-3 Weights Released
16
--
2024-04-23
New medical LLM beats Med-PaLM-2, GPT-4 on MMLU benchmarks
16
--
2024-07-31
Miqu 70B – possible leak of the mistral-medium LLM
16
--
2024-01-29
Ollama can run any GGUF Model on Hugging Face Hub now
15
--
2024-10-16
Llama-3-70B-Instruct-Gradient-1048k
14
--
2024-05-04
New finance LLM passed the CFA Level III exam
14
--
2024-07-31
Run Mistral 7B model using less than 4GB of memory on your …
14
--
2024-07-23
Stable Diffusion 3 Medium Released
14
--
2024-06-12
Pre-computed vector embeddings available on HuggingFace
14
--
2024-01-22
Yi-9B-200K
13
--
2024-03-17
An Introduction to Vision-Language Modeling
13
--
2024-05-28
FineWeb: 15T tokens of the finest data the web has to offer
12
--
2024-04-21
Language model can listen while speaking
12
--
2024-08-07
ML for 3D Course on Hugging Face
12
--
2024-05-16
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
12
--
2024-04-09
Command-R: open weights 35B params / 128k tokens context length model by …
12
--
2024-03-11
StarCoder2 and The Stack v2: new code LLMs and dataset
12
--
2024-02-28
Jamba-v0.1: An Apache 2.0 licensed 52B Mamba Transformer hybrid LLM base model
12
--
2024-03-28
HuggingFace Is Down
11
--
2024-02-28
Experiments with Bitnet 1.5 (Ngmi)
11
--
2024-03-23
FalconMamba 7B: The first attention-free and general-purpose pure Mamba model
11
--
2024-08-13
NPC-Playground, a 3D playground to interact with LLM-powered NPCs
11
--
2024-06-05
Open LLM Leaderboard
11
--
2024-01-02
CryptGPT: A Simple Approach to Privacy-Preserving LLMs Using Vigenere Cipher
10
--
2024-06-15
Whisperfile
10
--
2024-08-19
Llava Model for Video
10
--
2024-05-16
Show HN: Encrypted Credit Card Approval Using Homomorphic Encryption
10
--
2024-01-31
Vector embeddings model for medical literature
10
--
2024-01-08
Show HN: Downloadable AI Musical Instruments
10
--
2024-12-10