QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices
Blog post from HuggingFace
QVAC MedPsy represents a significant advancement in medical and healthcare language models, specifically designed for deployment on edge devices like smartphones and wearables. Developed by Tether Data's AI Research group, these models, with 1.7B and 4B parameters, offer medical reasoning capabilities that rival models several times their size, setting a new standard for efficient medical AI. The MedPsy-1.7B model outperforms larger models such as Google's MedGemma-1.5-4B-it, while the MedPsy-4B model exceeds the performance of the MedGemma-27B-text-it model, despite being significantly smaller. These models achieve high parameter and token efficiency, reducing latency and computational costs, thereby enabling clinical-grade AI in resource-constrained settings. The models are designed for private, on-device inference, preserving patient privacy and data security, and are made available under the Apache 2.0 license for research and educational purposes. The comprehensive evaluation across various medical benchmarks demonstrates the models' capability in accurate, real-time clinical decision support, marking a shift towards more accessible and secure medical AI applications.