Preserving Telugu Language and Culture through AI-Driven Datathon
The Government of Telangana’s Information Technology, Electronics, and Communications (ITE&C) department has embarked on a groundbreaking initiative to create a Telugu Large Language Model (LLM). This ambitious project aims to collect and digitize a diverse range of Telugu linguistic and cultural resources, including folk tales, songs, local histories, and traditional knowledge about food and cuisine.
In a collaborative effort with Swecha, the datathon will bring together participants from various engineering colleges across Telangana to gather data from oral sources. This innovative event is facilitated by Emerging Technologies Wing, with the Telangana Academy of Skill and Knowledge (TASK) serving as the outreach partner. IIIT Hyderabad will be the research partner, and Ozonetel, DigiQuanta, and TechVedika are the industry partners.
Participants will engage with rich oral traditions to preserve and promote the Telugu language and culture.
Earlier this year, Ozonetel, in collaboration with Swecha movement, developed a 7 billion parameter Telugu small language model (SLM) called ‘AI Chandamama Kathalu.’ This achievement underscores the potential of AI-driven initiatives in promoting regional languages and cultural heritage.
The upcoming Global AI Summit in Hyderabad, slated for September, will provide a platform for stakeholders to converge and discuss the possibilities and challenges of AI adoption in various sectors. The Telugu LLM datathon will serve as a precursor to this event, showcasing the power of collaborative efforts in harnessing AI for social impact.
“By engaging with these rich oral traditions, the datathon will play a crucial role in preserving and promoting the Telugu language and its cultural heritage.” - [ Organizer’s quote ]
Telangana’s lush landscapes, rich in cultural heritage, will serve as the backdrop for the datathon.
The Telugu LLM datathon signals a significant step forward in the use of AI for social good. By empowering local communities to take ownership of their cultural heritage, this initiative has the potential to inspire similar efforts across India. As we navigate the complexities of language and culture in the digital age, the Telugu LLM datathon offers a beacon of hope for the preservation and promotion of regional languages and cultural diversity.