Interhuman’s Goblin: “Yeah, Friday at Five”
Blog post from HuggingFace
Interhuman's Inter-1 model, designed to interpret human communication from videos, displayed a peculiar behavior of fabricating quotes like "Yeah, Friday at five" when audio was missing. This anomaly, attributed to a fallback mechanism where the model draws from its training to fill gaps, was traced back to a specific example in the system prompt rather than the training data itself. The investigation revealed that the model's tendency to invent speech in silence was a result of learned priors, demonstrating a broader "Clever Hans" effect, where models rely on prior expectations rather than actual data to make judgments. Efforts to address this issue have focused on modifying prompts and understanding the model's learned behaviors, highlighting the challenge of making omni-modal models robust when data from a modality is absent. The research team continues to explore solutions to ensure that missing modalities are treated as such, rather than prompting fabricated responses.
| Trend | Post Mentions | Total Month Mentions | Posts | Companies | MoM |
|---|---|---|---|---|---|
| Observability | 1 | 3,430 | 674 | 183 | +0% |
| Reinforcement learning | 1 | 59 | 31 | 19 | -34% |