AI models often struggle with bilingual and multimodal STEM tasks due to a lack of high-quality, domain-specific datasets in languages like Malay and English.
We created a curated dataset of 500 Math and Physics questions in Malay and English, complemented by a public leaderboard to benchmark AI model performance.
AI teams now have a reliable resource for fine-tuning and evaluating models on real-world STEM tasks, setting a new standard for bilingual and multimodal AI development.
This dataset provides a comprehensive evaluation set for tasks assessing reasoning skills in Science, Technology, Engineering, and Mathematics (STEM) subjects. It features questions in both English and Malay, catering to a diverse audience.
The dataset is comprised of two configurations: data_en
(English) and data_ms
(Malay). Both configurations share the same features and structure.
imgs
list.
Discover how SUPA's specialized data labeling services enhanced an autonomous driving company's models, achieving 95% accuracy
SUPA helped a mobile app design company scale their interface database 16x in 3 months with 90% accuracy, freeing resources for AI innovation.