Cleanlab Blog - Plushcap

Blog URL

cleanlab.ai/blog

Posts YTD

1 ↓ vs 2 last year

Avg Posts/Month

1.4 since 2023

Monthly Post Volume

Start year: 2022 2023 2024 2025 2026

Post Details

Search:

Title	Author	Published	Words	HN Pts
CleanVision: Audit your Image Data for better Computer Vision	Sanjana Garg, Ulyana Tkachenko, Yiming Chen, Elías Snorrason, Jonas Mueller	2023-03-22	1,729	4
Assessing the Quality of Synthetic Data with Cleanlab Studio	Elías Snorrason	2023-07-12	2,176	2
Overcoming Hallucinations with the Trustworthy Language Model	Anish Athalye, Jonas Mueller, Curtis Northcutt, Hui Wen Goh, Ulyana Tkachenko	2024-04-25	4,782	2
Letter from the CEO: Announcing our Series A and Cleanlab's Trustworthy Language …	Curtis Northcutt	2023-10-10	742	--
Detecting Dataset Drift and Non-IID Sampling: A k-Nearest Neighbors approach that works …	Jesse Cummings, Elías Snorrason, Jonas Mueller	2023-05-30	2,203	4
Effectively Annotate Text Data for Transformers via Active Learning + Re-labeling	Chris Mauck	2023-05-22	1,802	--
Training Transformer Networks in Scikit-Learn?!	Hui Wen Goh	2023-03-08	1,677	4
Improving any OpenAI Language Model by Systematically Improving its Data	Chris Mauck, Jonas Mueller	2023-06-01	1,898	--
Ensuring Reliable Few-Shot Prompt Selection for LLMs	Chris Mauck, Jonas Mueller	2023-08-15	1,678	3
How To Train and Deploy Reliable Models on Messy Real-World Data With …	Hui Wen Goh, Jonas Mueller, Anish Athalye	2023-07-24	1,518	5
Detecting Annotation Errors in Semantic Segmentation Data	Vedang Lad, Jonas Mueller	2023-11-02	845	1
Comparing tools for Data Science, Data Quality, Data Annotation, and AI/ML	Jonas Mueller	2024-02-09	1,916	--
Automatically Detect Problematic Content in any Text Dataset	Hui Wen Goh	2023-12-19	1,220	--
Announcing Auto-Labeling Agent: Your Assistant for Rapid and High Quality Labeling	Emily Barry	2024-07-17	776	--
The Stanford Cars Dataset aka Cars196 (cited in 1000+ papers) contains many …	Chris Mauck	2023-05-24	592	--
Reduce Legal Discovery Work by 10x with AI that Curates Documents and …	Chris Mauck	2023-08-03	1,356	2
Whisking Away Errors: How Cleanlab Studio Served Up Fixes for the Food-101N …	Chris Mauck	2023-09-11	546	--
cleanlab 2.3 adds support for Active Learning, Tensorflow/Keras models made sklearn-compatible, and …	Jonas Mueller	2023-03-01	1,045	--
How to detect bad data in your instruction tuning dataset (for better …	Jimming He, Sanjana Garg, Jonas Mueller	2024-02-07	2,278	--
Use Cleanlab to Improve LLMs: Find Errors in Human Feedback in the …	Chris Mauck, Jonas Mueller	2023-04-11	351	--
An open-source platform to catch all sorts of issues in all sorts …	Elías Snorrason, Jonas Mueller	2024-02-21	1,082	--
ActiveLab: Active Learning with Data Re-Labeling	Hui Wen Goh, Jonas Mueller	2023-03-02	1,720	4
Enhancing Product Analytics and E-commerce with Data-Centric AI	Sanjana Garg	2023-07-06	1,484	2
The Fashion MNIST Dataset (cited in 2,200+ papers) contains Hundreds of Miscategorized …	Ganesh Tata, Chris Mauck	2023-06-09	446	--
Don’t Let Your Messy Documents Run You RAG-Ged. Announcing Document Curation in …	Emily Barry	2024-06-07	311	--
Automated Correction of Satellite Imagery Data	Chris Mauck, Aditya Thyagarajan	2023-09-20	673	2
Ensure high-quality data quickly via AI validation of which data is Well …	Ulyana Tkachenko, Jonas Mueller	2023-08-28	1,544	--
Letter from the CEO: Announcing Our Seed Funding and the Launch of …	Curtis Northcutt	2023-07-20	1,074	--
Detecting Errors in Numerical Data via any Regression Model	Jonas Mueller, Mayank Kumar, Hui Wen Goh, Hang Zhou	2023-09-18	1,108	2
Accelerate Time Series Modeling with Cleanlab Studio AutoML: Train and Deploy in …	Matt Turk	2024-07-11	2,053	--
The Office-Home Dataset (cited by 600+ papers) contains hundreds of incorrect labels …	Chris Mauck, Jonas Mueller	2023-04-21	478	--
Datalab: A Linter for ML Datasets	Elías Snorrason, Sanjana Garg, Hui Wen Goh, Jesse Cummings, Jonas Mueller	2023-05-16	1,879	2
Automatically Find and Fix Issues in Image/Document Tags and other Multi-Label Datasets	Chris Mauck, Ulyana Tkachenko	2023-10-17	990	2
Most AI & Analytics are impaired by data issues. Now AI can …	Jonas Mueller, Curtis Northcutt, Anish Athalye	2023-07-31	1,948	1
cleanlab now supports all major ML tasks — including Regression, Object Detection, …	Chris Mauck, Curtis Northcutt, Jonas Mueller	2023-09-14	1,200	--
Automated Quality Assurance for Object Detection Datasets	Ulyana Tkachenko, Aditya Thyagarajan, Jonas Mueller	2023-09-26	1,370	1
How to Filter Unsafe and Low-Quality Images from any Dataset: A Product …	Sanjana Garg, Jonas Mueller	2024-01-22	1,505	--
How to Generate Better Synthetic Image Datasets with Stable Diffusion	Elías Snorrason, Jonas Mueller	2023-10-05	2,071	1
Automated Data Quality at Scale	Anish Athalye, Angela Liu	2023-07-27	1,155	1
Improving Legal Judgement Prediction with Data-Centric AI	Hui Wen Goh	2023-06-27	1,658	--
Handling Mislabeled Tabular Data to Improve Your XGBoost Model	Chris Mauck	2023-02-06	1,877	2
Beware of Unreliable Data in Model Evaluation: A LLM Prompt Selection case …	Chris Mauck, Jonas Mueller	2023-06-29	1,366	66
Reliable Agentic RAG with LLM Trustworthiness Estimates	Chris Mauck, Jonas Mueller	2024-09-12	1,875	--
OpenAI's o1 surpassed using the Trustworthy Language Model	Jay Zhang, Jonas Mueller	2024-10-21	1,505	2
Automatically Reduce Incorrect LLM Responses across OpenAI's SimpleQA Benchmark via Trustworthiness Scoring	Hui Wen Goh, Jonas Mueller	2024-11-07	1,107	--
Automatically boost the accuracy of any LLM, without changing your prompts or …	Hui Wen Goh, Jay Zhang, Ulyana Tkachenko, Jonas Mueller	2024-10-31	1,890	--
Safeguard Customer Data via Log Compliance Monitoring with the Trustworthy Language Model	Matt Turk	2025-01-06	1,640	--
Benchmarking Hallucination Detection Methods in RAG	Hui Wen Goh, Nelson Auner, Aditya Thyagarajan, Jonas Mueller	2024-09-30	2,556	--
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?	Ashish Sardana, Jonas Mueller	2025-04-07	3,308	--
TLM Lite: High-Quality LLM Responses with Efficient Trust Scores	Hui Wen Goh	2024-09-09	1,519	--
Automatically detecting LLM hallucinations with models like GPT-4o and Claude	Hui Wen Goh, Jay Zhang, Ulyana Tkachenko, Jonas Mueller	2024-09-04	1,781	--
Automatically catching spurious correlations in ML datasets	Rahul Aditya, Elías Snorrason	2024-09-27	1,843	--
CROWDLAB: The Right Way to Combine Humans and AI for LLM Evaluation	Nelson Auner	2024-08-06	727	4
Expert Answers: The Easiest Way to Improve Your AI Agent	Dave Kong and Aditya Thyagarajan	2025-09-24	731	--
Managing AI Agents in Production: The Role of People	Dave Kong	2025-09-24	1,324	--
Benchmarking real-time trust scoring across five AI Agent architectures	Gordon Lim and Jonas Mueller	2025-09-24	1,513	--
AI Agent Safety: Managing Unpredictability at Scale	Dave Kong	2025-09-24	1,579	--
Prevent Hallucinated Responses from any AI Agent	Gordon Lim and Dave Kong	2025-09-24	1,444	--
The Emerging Reliability Layer in the Modern AI Agent Stack	Charles Meng	2025-10-16	1,336	--
Preventing AI Mistakes in Production: Inside Cleanlab’s Guardrails	Charles Meng and Dave Kong	2025-10-30	908	--
Expert Guidance: Teaching Your AI How to Behave	Jonas Mueller and Ulyana Tkachenko and Anish Athalye and Dave Kong and Charles Meng	2025-11-19	955	--
Automated Hallucination Correction for AI Agents: A Case Study on Tau²-Bench	Tianyi Huang and Jonas Mueller	2025-12-03	1,623	--
LLM Structured Output Benchmarks are Riddled with Mistakes	Hui Wen Goh and Jonas Mueller	2025-12-05	1,659	--
Real-Time Error Detection for LLM Structured Outputs: A Comprehensive Benchmark	Hui Wen Goh and Jonas Mueller	2025-12-12	1,983	--
Letter from the CEO: Handshake acquires Cleanlab	Curtis Northcutt	2026-01-29	593	--

Plushcap, by Matt Makai. 2021-2026.