AI models often struggle with bilingual and multimodal STEM tasks due to a lack of high-quality, domain-specific datasets in languages like Malay and English.
We created a curated dataset of 500 Math and Physics questions in Malay and English, complemented by a public leaderboard to benchmark AI model performance.
AI teams now have a reliable resource for fine-tuning and evaluating models on real-world STEM tasks, setting a new standard for bilingual and multimodal AI development.

This dataset provides a comprehensive evaluation set for tasks assessing reasoning skills in Science, Technology, Engineering, and Mathematics (STEM) subjects. It features questions in both English and Malay, catering to a diverse audience.
The dataset is comprised of two configurations: data_en (English) and data_ms (Malay). Both configurations share the same features and structure.
imgs list.

SUPA leveraged an elite team of 12 UI experts to deliver a highly complex UI optimization curation project.
.png)
SUPA leveraged domain-specific talent to source and label 600k stylized vector images, solving data diversity challenges
.png)