AI models often struggle with bilingual and multimodal STEM tasks due to a lack of high-quality, domain-specific datasets in languages like Malay and English.
We created a curated dataset of 500 Math and Physics questions in Malay and English, complemented by a public leaderboard to benchmark AI model performance.
AI teams now have a reliable resource for fine-tuning and evaluating models on real-world STEM tasks, setting a new standard for bilingual and multimodal AI development.
This dataset provides a comprehensive evaluation set for tasks assessing reasoning skills in Science, Technology, Engineering, and Mathematics (STEM) subjects. It features questions in both English and Malay, catering to a diverse audience.
Key Features
Dataset Structure
The dataset is comprised of two configurations: data_en
(English) and data_ms
(Malay). Both configurations share the same features and structure.
Data Fields
imgs
list.
Discover how SUPA's specialized data labeling services enhanced an autonomous driving company's models, achieving 95% accuracy