Home / Companies / HuggingFace / Hacker News

HuggingFace on HN

77 posts with 10+ points in 2024

Filters
Year:
Posts by Month (77 total)
Hacker News Posts
Title Points Comments Date
Uncensor any LLM with abliteration 586 -- 2024-06-13
Llama-3.3-70B-Instruct 425 -- 2024-12-06
A Replacement for BERT 348 -- 2024-12-19
Microsoft Phi-2 model changes licence to MIT 240 -- 2024-01-06
Space secrets leak disclosure 197 -- 2024-06-01
Best 7B LLM on leaderboards made by an amateur following a medium … 181 -- 2024-01-05
Llama 3 8B is almost as good as Wizard 2 8x22B 168 -- 2024-04-19
Nvidia releases NVLM 1.0 72B open weight model 167 -- 2024-10-02
Explaining the SDXL Latent Space 163 -- 2024-02-05
Hugging Face and Google partner for AI collaboration 152 -- 2024-01-25
A CC-By Open-Source TTS Model with Voice Cloning 131 -- 2024-11-04
FineWeb: Decanting the web for the finest text data at scale 127 -- 2024-06-02
HuggingChat: Chat with Open Source Models 103 -- 2024-02-21
More than 80 AI models from Qualcomm 95 -- 2024-02-28
LLaMA-Pro-8B 94 -- 2024-01-06
Apple/OpenELM: Efficient Open-Source Family Language Models 82 -- 2024-04-24
YouTube-Commons: Audio transcripts of 2,063,066 YouTube videos, CC-By license 75 -- 2024-04-18
Show HN: Simply Reading Analog Gauges – GPT4, CogVLM Can't 66 -- 2024-01-22
MSFT's WizardLM2 models have been taken down 58 -- 2024-04-16
LiteLlama-460M-1T has 460M parameters trained with 1T tokens 54 -- 2024-01-07
Fine-Tuning LLMs to 1.58bit 52 -- 2024-09-18
LLaMA 3 70B Llamafiles 51 -- 2024-04-19
DeepSeek v3 beats Claude sonnet 3.5 and way cheaper 48 -- 2024-12-26
Improving Parquet Dedupe on Hugging Face Hub 47 -- 2024-10-08
Open-LLM performances are plateauing 46 -- 2024-06-29
Mixtral-8x22B on HuggingFace 33 -- 2024-04-10
General OCR Theory: Towards OCR-2.0 via a Unified End-to-End Model 31 -- 2024-09-11
Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat 30 -- 2024-04-12
OpenFLUX.1 30 -- 2024-10-04
Mistral 7B v0.2 29 -- 2024-03-31
Video2Game: Real-Time, Interactive, Realistic Environment from a Single Video 28 -- 2024-04-16
Llama-3.2-3B-Instruct-uncensored 26 -- 2024-09-27
Llama can now see and run on your device – welcome Llama … 26 -- 2024-09-25
New Phi-3.5 Models from Microsoft, including new MoE 25 -- 2024-08-20
LLM: Transformer Is Linear 25 -- 2024-05-24
HuggingFace - Tencent launches Hunyuan Large which outperforms Llama 3.1 405B 23 -- 2024-11-05
Lineage Explorer for open source models – Hugging Face Space 22 -- 2024-01-18
Show HN: Fineweb-Edu-Fortified dataset: Fineweb-Edu deduped, embeddings included 22 -- 2024-08-14
Llama 3.2 21 -- 2024-09-25
Fine-tune and deploy open LLMs as containers using AIKit - Part 1 19 -- 2024-06-06
makeMoE: Implement a Sparse Mixture of Experts LLM from Scratch 19 -- 2024-01-23
HuggingFace to Replace Git LFS with Xet 18 -- 2024-08-23
Fake Insects: a game where you have to identify AI-generated insects 18 -- 2024-08-17
Mixtral-8x22B-Instruct-v0.1 18 -- 2024-04-17
Hermes-2-Pro-Llama-3-8B 18 -- 2024-05-01
StableLM-2-12B 17 -- 2024-04-08
NuExtract: A LLM for Structured Extraction 16 -- 2024-06-29
An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct 16 -- 2024-06-09
Phi-3 Weights Released 16 -- 2024-04-23
New medical LLM beats Med-PaLM-2, GPT-4 on MMLU benchmarks 16 -- 2024-07-31
Miqu 70B – possible leak of the mistral-medium LLM 16 -- 2024-01-29
Ollama can run any GGUF Model on Hugging Face Hub now 15 -- 2024-10-16
Llama-3-70B-Instruct-Gradient-1048k 14 -- 2024-05-04
New finance LLM passed the CFA Level III exam 14 -- 2024-07-31
Run Mistral 7B model using less than 4GB of memory on your … 14 -- 2024-07-23
Stable Diffusion 3 Medium Released 14 -- 2024-06-12
Pre-computed vector embeddings available on HuggingFace 14 -- 2024-01-22
Yi-9B-200K 13 -- 2024-03-17
An Introduction to Vision-Language Modeling 13 -- 2024-05-28
FineWeb: 15T tokens of the finest data the web has to offer 12 -- 2024-04-21
Language model can listen while speaking 12 -- 2024-08-07
ML for 3D Course on Hugging Face 12 -- 2024-05-16
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs 12 -- 2024-04-09
Command-R: open weights 35B params / 128k tokens context length model by … 12 -- 2024-03-11
StarCoder2 and The Stack v2: new code LLMs and dataset 12 -- 2024-02-28
Jamba-v0.1: An Apache 2.0 licensed 52B Mamba Transformer hybrid LLM base model 12 -- 2024-03-28
HuggingFace Is Down 11 -- 2024-02-28
Experiments with Bitnet 1.5 (Ngmi) 11 -- 2024-03-23
FalconMamba 7B: The first attention-free and general-purpose pure Mamba model 11 -- 2024-08-13
NPC-Playground, a 3D playground to interact with LLM-powered NPCs 11 -- 2024-06-05
Open LLM Leaderboard 11 -- 2024-01-02
CryptGPT: A Simple Approach to Privacy-Preserving LLMs Using Vigenere Cipher 10 -- 2024-06-15
Whisperfile 10 -- 2024-08-19
Llava Model for Video 10 -- 2024-05-16
Show HN: Encrypted Credit Card Approval Using Homomorphic Encryption 10 -- 2024-01-31
Vector embeddings model for medical literature 10 -- 2024-01-08
Show HN: Downloadable AI Musical Instruments 10 -- 2024-12-10