HuggingFace Hacker News

Filters

Min points: 1 10 25 50 100 250 500

Year:

Posts by Month (77 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
Uncensor any LLM with abliteration	586	--	2024-06-13
Llama-3.3-70B-Instruct	425	--	2024-12-06
A Replacement for BERT	348	--	2024-12-19
Microsoft Phi-2 model changes licence to MIT	240	--	2024-01-06
Space secrets leak disclosure	197	--	2024-06-01
Best 7B LLM on leaderboards made by an amateur following a medium …	181	--	2024-01-05
Llama 3 8B is almost as good as Wizard 2 8x22B	168	--	2024-04-19
Nvidia releases NVLM 1.0 72B open weight model	167	--	2024-10-02
Explaining the SDXL Latent Space	163	--	2024-02-05
Hugging Face and Google partner for AI collaboration	152	--	2024-01-25
A CC-By Open-Source TTS Model with Voice Cloning	131	--	2024-11-04
FineWeb: Decanting the web for the finest text data at scale	127	--	2024-06-02
HuggingChat: Chat with Open Source Models	103	--	2024-02-21
More than 80 AI models from Qualcomm	95	--	2024-02-28
LLaMA-Pro-8B	94	--	2024-01-06
Apple/OpenELM: Efficient Open-Source Family Language Models	82	--	2024-04-24
YouTube-Commons: Audio transcripts of 2,063,066 YouTube videos, CC-By license	75	--	2024-04-18
Show HN: Simply Reading Analog Gauges – GPT4, CogVLM Can't	66	--	2024-01-22
MSFT's WizardLM2 models have been taken down	58	--	2024-04-16
LiteLlama-460M-1T has 460M parameters trained with 1T tokens	54	--	2024-01-07
Fine-Tuning LLMs to 1.58bit	52	--	2024-09-18
LLaMA 3 70B Llamafiles	51	--	2024-04-19
DeepSeek v3 beats Claude sonnet 3.5 and way cheaper	48	--	2024-12-26
Improving Parquet Dedupe on Hugging Face Hub	47	--	2024-10-08
Open-LLM performances are plateauing	46	--	2024-06-29
Mixtral-8x22B on HuggingFace	33	--	2024-04-10
General OCR Theory: Towards OCR-2.0 via a Unified End-to-End Model	31	--	2024-09-11
Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat	30	--	2024-04-12
OpenFLUX.1	30	--	2024-10-04
Mistral 7B v0.2	29	--	2024-03-31
Video2Game: Real-Time, Interactive, Realistic Environment from a Single Video	28	--	2024-04-16
Llama-3.2-3B-Instruct-uncensored	26	--	2024-09-27
Llama can now see and run on your device – welcome Llama …	26	--	2024-09-25
New Phi-3.5 Models from Microsoft, including new MoE	25	--	2024-08-20
LLM: Transformer Is Linear	25	--	2024-05-24
HuggingFace - Tencent launches Hunyuan Large which outperforms Llama 3.1 405B	23	--	2024-11-05
Lineage Explorer for open source models – Hugging Face Space	22	--	2024-01-18
Show HN: Fineweb-Edu-Fortified dataset: Fineweb-Edu deduped, embeddings included	22	--	2024-08-14
Llama 3.2	21	--	2024-09-25
Fine-tune and deploy open LLMs as containers using AIKit - Part 1	19	--	2024-06-06
makeMoE: Implement a Sparse Mixture of Experts LLM from Scratch	19	--	2024-01-23
HuggingFace to Replace Git LFS with Xet	18	--	2024-08-23
Fake Insects: a game where you have to identify AI-generated insects	18	--	2024-08-17
Mixtral-8x22B-Instruct-v0.1	18	--	2024-04-17
Hermes-2-Pro-Llama-3-8B	18	--	2024-05-01
StableLM-2-12B	17	--	2024-04-08
NuExtract: A LLM for Structured Extraction	16	--	2024-06-29
An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct	16	--	2024-06-09
Phi-3 Weights Released	16	--	2024-04-23
New medical LLM beats Med-PaLM-2, GPT-4 on MMLU benchmarks	16	--	2024-07-31
Miqu 70B – possible leak of the mistral-medium LLM	16	--	2024-01-29
Ollama can run any GGUF Model on Hugging Face Hub now	15	--	2024-10-16
Llama-3-70B-Instruct-Gradient-1048k	14	--	2024-05-04
New finance LLM passed the CFA Level III exam	14	--	2024-07-31
Run Mistral 7B model using less than 4GB of memory on your …	14	--	2024-07-23
Stable Diffusion 3 Medium Released	14	--	2024-06-12
Pre-computed vector embeddings available on HuggingFace	14	--	2024-01-22
Yi-9B-200K	13	--	2024-03-17
An Introduction to Vision-Language Modeling	13	--	2024-05-28
FineWeb: 15T tokens of the finest data the web has to offer	12	--	2024-04-21
Language model can listen while speaking	12	--	2024-08-07
ML for 3D Course on Hugging Face	12	--	2024-05-16
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs	12	--	2024-04-09
Command-R: open weights 35B params / 128k tokens context length model by …	12	--	2024-03-11
StarCoder2 and The Stack v2: new code LLMs and dataset	12	--	2024-02-28
Jamba-v0.1: An Apache 2.0 licensed 52B Mamba Transformer hybrid LLM base model	12	--	2024-03-28
HuggingFace Is Down	11	--	2024-02-28
Experiments with Bitnet 1.5 (Ngmi)	11	--	2024-03-23
FalconMamba 7B: The first attention-free and general-purpose pure Mamba model	11	--	2024-08-13
NPC-Playground, a 3D playground to interact with LLM-powered NPCs	11	--	2024-06-05
Open LLM Leaderboard	11	--	2024-01-02
CryptGPT: A Simple Approach to Privacy-Preserving LLMs Using Vigenere Cipher	10	--	2024-06-15
Whisperfile	10	--	2024-08-19
Llava Model for Video	10	--	2024-05-16
Show HN: Encrypted Credit Card Approval Using Homomorphic Encryption	10	--	2024-01-31
Vector embeddings model for medical literature	10	--	2024-01-08
Show HN: Downloadable AI Musical Instruments	10	--	2024-12-10

Plushcap, by Matt Makai. 2021-2026.

HuggingFace on HN