Reinforcement learning

Reinforcement and preference learning for AI models

Mentions Over Time
512
Total Mentions
200
Posts with Topic
6
Companies Mentioning
-74.5%
WoW Change
Historical Data
Week Mentions Posts Companies WoW Change
Mar 23, 2026 14 6 6 -74.5%
Mar 16, 2026 55 21 14 +129.2%
Mar 09, 2026 24 12 9 +14.3%
Mar 02, 2026 21 8 6 -12.5%
Feb 23, 2026 24 15 11 -31.4%
Feb 16, 2026 35 8 6 -16.7%
Feb 09, 2026 42 16 14 +100.0%
Feb 02, 2026 21 15 12 0%
Jan 26, 2026 21 10 10 0%
Jan 19, 2026 21 9 8 -56.3%
Jan 12, 2026 48 11 9 +14.3%
Jan 05, 2026 42 18 11 +50.0%
Dec 29, 2025 28 4 2 +47.4%
Dec 22, 2025 19 5 4 -42.4%
Dec 15, 2025 33 13 12 -10.8%
Dec 08, 2025 37 16 10 +37.0%
Dec 01, 2025 27 13 9 --
Recent Blog Posts 50 posts
Date Company Title Mentions
2026-03-27 Deepchecks 7 Top Enterprise Generative AI Tools for Fine-Tuning 4
2026-03-27 Cursor A technical report on Composer 2 2
2026-03-26 StackHawk Best AI Pentesting Tools in 2026: Top Picks Compared 1
2026-03-25 AI21 Labs Stride and prejudice: How a 32-bit overflow corrupted a CUDA kernel (and stayed… 2
2026-03-25 Browserbase Introducing BrowserEnv: Train browser agents on real websites 4
2026-03-24 Vantage Cursor's Composer 2: What It Means for Your AI Coding Costs 1
2026-03-20 Roboflow DeepSeek Vision Models 1
2026-03-19 Cursor Introducing Composer 2 2
2026-03-19 Deepchecks Human Feedback vs. Synthetic Feedback in LLM Precision 1
2026-03-19 Harness CI Pipeline Optimization Guide for Platform Engineering Lead 1
2026-03-18 Bright Data Web Data for AI Agents: 6 Use Cases and the Benchmarks That Tell You Which Tool… 1
2026-03-18 Together AI Mamba-3 1
2026-03-17 AI21 Labs Mind the gap 1
2026-03-17 Galileo What MT-Bench and Chatbot Arena Reveal About Most LLM Judges 1
2026-03-17 Prem AI Which LLM Alignment Method? RLHF vs DPO vs KTO Tradeoffs Explained 31
2026-03-17 Cursor Training Composer for longer horizons 2
2026-03-17 Galileo What MT-Bench and Chatbot Arena Reveal About Most LLM Judges 1
2026-03-17 HuggingFace Holotron-12B - High Throughput Computer Use Agent 1
2026-03-17 HuggingFace Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 1
2026-03-17 HuggingFace State of Open Source on Hugging Face: Spring 2026 1
2026-03-17 Prem AI How to Generate Synthetic Training Data for LLM Fine-Tuning (2026 Guide) 1
2026-03-17 WorkOS Prompt injection attacks: What are they and how to defend against them 1
2026-03-16 Prem AI Air-Gapped AI Fine-Tuning: How to Train Custom LLMs Without Internet Access 2
2026-03-16 HuggingFace Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models… 1
2026-03-16 Lambda Lambda at NVIDIA GTC 2026: building the Superintelligence Cloud 2
2026-03-16 LangChain LangChain Announces Enterprise Agentic AI Platform Built with NVIDIA 1
2026-03-16 Redpanda Redpanda pushes the envelope on NVIDIA Vera 1
2026-03-12 Unsloth What are RL environments and how to build them 5
2026-03-12 Nanonets Are OpenAI and Google intentionally downgrading their models? 1
2026-03-12 Prem AI Reasoning Models Explained: OpenAI o1/o3 vs DeepSeek R1 vs QwQ-32B 3
2026-03-11 Zilliz Top 10 Context Engineering Techniques You Should Know for Production RAG 1
2026-03-11 Together AI Together AI Brings NVIDIA Nemotron 3 to Developers on Day 0 1
2026-03-10 Fireworks AI Training-Inference Parity in MoE Models: Where Numerics Drift 5
2026-03-10 Bugcrowd Bugcrowd policy changes to address “AI slop” submissions 1
2026-03-10 HuggingFace Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries 1
2026-03-10 HuggingFace How NVIDIA Builds Open Data for AI 1
2026-03-10 Hume Opensourcing TADA: Fast, Reliable Speech Generation Through Text-Acoustic Synch… 1
2026-03-10 Fireworks AI Best Open Source LLMs in 2026: We Reviewed 7 Models 3
2026-03-09 HuggingFace LeRobot v0.5.0: Scaling Every Dimension 1
2026-03-08 HuggingFace ShopRLVE-GYM: Adaptive Verifiable Environments for E-Commerce Conversational Ag… 6
2026-03-06 Clarifai MiniMax M2.5 vs GPT-5.2 vs Claude Opus 4.6 vs Gemini 3.1 Pro 1
2026-03-06 Deepchecks The Best 5 LLM Fine-Tuning Tools of 2026 3
2026-03-05 HuggingFace Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning,… 1
2026-03-05 HuggingFace Building Tucano 2: Open-Source Language Models That Actually Think in Portuguese 1
2026-03-05 Snowplow What Is Agentic Analytics? A Guide for Data Leaders 1
2026-03-05 Together AI Key research and product announcements at the AI Native Conf 7
2026-03-02 Roboflow Inference as a Service: How Roboflow Makes Vision AI Production-Ready 1
2026-02-28 Prem AI 19 Best Together AI Alternatives for Private Model Fine-Tuning (2026) 6
2026-02-28 Prem AI Qwen 2.5 vs Llama 3.2 vs DeepSeek R1: Enterprise Model Comparison (2026) 2
2026-02-28 Prem AI PremAI vs Google Vertex AI: Privacy, Flexibility, and Cost Compared 1