Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI
Blog post from HuggingFace
Singapore is advancing its AI sovereignty with the release of Nemotron-Personas-Singapore, a synthetic dataset developed by NVIDIA in collaboration with AI Singapore, designed to support the creation of AI systems that align with local cultural contexts and governance standards. This dataset, which features 888,000 synthetic Singaporean personas and includes diverse demographic, occupational, and cultural traits, is intended for developers building AI models tailored to Singapore’s unique societal landscape. Licensed under CC BY 4.0, it supports both commercial and public-sector development while ensuring privacy by not containing personally identifiable information. The dataset is grounded in public statistics and is designed to integrate seamlessly with existing AI models, offering applications in fields like financial services and healthcare by providing a safe, culturally relevant, and privacy-preserving environment for AI evaluation and development. This initiative underscores Singapore's commitment to trustworthy AI deployment, emphasizing transparency, local relevance, and shared infrastructure as key components of responsible AI governance.