In a move aimed at boosting the representation of Southeast Asian languages in artificial intelligence (AI), Sony Research and AI Singapore (AISG) have announced a new partnership. The collaboration focuses on the development of SEA-LION, a large language model designed to enhance AI's capabilities in understanding and processing the diverse languages spoken across Southeast Asia.
SEA-LION, which stands for "Southeast Asian Languages in One Network," is being developed to tackle the region's linguistic complexity. Southeast Asia is home to over a thousand languages, and SEA-LION aims to improve the accuracy and relevance of AI models in capturing this diversity. According to PR Newswire, the collaboration between Sony and AISG seeks to increase Southeast Asian representation within the global landscape of large language models (LLMs).
Hiroaki Kitano, President of Sony Research, highlighted the importance of diversity and localization in AI development. “In Southeast Asia specifically, there are more than a thousand different languages spoken by the citizens of the region. This linguistic diversity underscores the importance of ensuring AI models and tools are designed to support the needs of all populations around the world,” said Kitano.
A key focus of the collaboration will be enhancing SEA-LION's proficiency in Tamil, a language spoken by an estimated 60 to 85 million people worldwide. Sony Research plans to use its extensive expertise in Indian languages, speech generation, content analysis, and recognition to refine the model’s capabilities in Tamil and other Southeast Asian languages.
Kitano further emphasized the broader goal of the project: “Access to LLMs that address the global landscape of language and culture has been a barrier to driving research and developing new technologies that are representative and equitable for the global populations we serve."
Sony's strong ties to Singapore’s technology ecosystem will play a significant role in the partnership. Kitano serves on several key advisory councils in the country, including the Advisory Council on the Ethical Use of AI and Data, and the Infocomm Media Development Authority (IMDA). These relationships are expected to bolster the SEA-LION project’s success.
AISG’s Deputy Director, Teo, also expressed excitement over the project, noting the potential impact of SEA-LION on Southeast Asian languages. “We are particularly eager to contribute to the testing and refinement of the SEA-LION models for Tamil and other Southeast Asian languages, while also sharing our expertise and best practices in LLM development,” Teo stated.As AI continues to evolve, the partnership between Sony and AI Singapore marks a significant step forward in ensuring that the linguistic diversity of Southeast Asia is recognized and represented within the global AI landscape.