Home / Companies / HuggingFace / Blog / Post Details
Content Deep Dive

Interhuman’s Goblin: “Yeah, Friday at Five”

Blog post from HuggingFace

Post Details
Company
Date Published
Author
Siddharth Ravi
Word Count
2,371
Company Posts That Month
90
Language
-
Hacker News Points
-
Summary

Interhuman's Inter-1 model, designed to interpret human communication from videos, displayed a peculiar behavior of fabricating quotes like "Yeah, Friday at five" when audio was missing. This anomaly, attributed to a fallback mechanism where the model draws from its training to fill gaps, was traced back to a specific example in the system prompt rather than the training data itself. The investigation revealed that the model's tendency to invent speech in silence was a result of learned priors, demonstrating a broader "Clever Hans" effect, where models rely on prior expectations rather than actual data to make judgments. Efforts to address this issue have focused on modifying prompts and understanding the model's learned behaviors, highlighting the challenge of making omni-modal models robust when data from a modality is absent. The research team continues to explore solutions to ensure that missing modalities are treated as such, rather than prompting fabricated responses.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Observability 1 3,430 674 183 +0%
Reinforcement learning 1 59 31 19 -34%