AI models often struggle with bilingual and multimodal STEM tasks due to a lack of high-quality, domain-specific datasets in languages like Malay and English.
We created a curated dataset of 500 Math and Physics questions in Malay and English, complemented by a public leaderboard to benchmark AI model performance.
AI teams now have a reliable resource for fine-tuning and evaluating models on real-world STEM tasks, setting a new standard for bilingual and multimodal AI development.
This dataset provides a comprehensive evaluation set for tasks assessing reasoning skills in Science, Technology, Engineering, and Mathematics (STEM) subjects. It features questions in both English and Malay, catering to a diverse audience.
The dataset is comprised of two configurations: data_en
(English) and data_ms
(Malay). Both configurations share the same features and structure.
imgs
list.
SUPA scales high-quality annotation output during seasonal data surges by 170% for a global agritech company that manages over 200 million trees
SUPA leveraged domain-specific talent to source and label 600k stylized vector images, solving data diversity challenges