Replicate Intelligence #4
Blog post from Replicate
Replicate's weekly bulletin highlights significant updates in open-source AI, focusing on the release of Stable Diffusion 3 Medium, a new image generation model that excels at crafting legible text but struggles with anatomy and composition; it is available under a non-commercial license. In other developments, OpenAI is using dictionary learning to extract patterns in GPT models, a technique similar to Anthropic’s approach with Golden Gate Claude, and has released a research paper and code to steer the GPT-2-small model. Additionally, Transformers.js has implemented OpenAI’s Whisper model in JavaScript, allowing real-time speech-to-text transcription in a browser without requiring coding. Researchers at ByteDance have introduced a novel method to tokenize images into a single short vector, potentially enhancing the efficiency of multimodal models. The bulletin also notes the upcoming support for NVIDIA’s H100 GPUs, inviting interested users to contact Replicate for early access.