SignGen: An LLM-Based Thai Sign Language Generator

Abstract

The Thai Sign Language Generation System aims to create a comprehensive 3D modeling and animation platform that translates Thai sentences into dynamic and accurate representations of Thai Sign Language (TSL) gestures. This project enhances communication for the Thai deaf community by leveraging a landmark-based approach using a Vector Quantized Variational Autoencoder (VQVAE) and a Large Language Model (LLM) for sign language generation. The system first trains a VQVAE encoder using landmark data extracted from sign videos, allowing it to learn compact latent representations of TSL gestures. These encoded representations are then used to generate additional landmark-based sign sequences, effectively expanding the training dataset using the BigSign ThaiPBS dataset. Once the dataset is augmented, an LLM is trained to output accurate landmark sequences from Thai text inputs, which are then used to animate a 3D model in Blender, ensuring fluid and natural TSL gestures. The project is implemented using Python, incorporating MediaPipe for landmark extraction, OpenCV for real-time image processing, and Blender’s Python API for 3D animation. By integrating AI, VQVAE-based encoding, and LLM-driven landmark generation, this system aspires to bridge the communication gap between written Thai text and expressive TSL gestures, providing the Thai deaf community with an interactive, real-time sign language animation platform.

Objective

การสื่อสารระหว่างบุคคลที่ได้ยินกับผู้พิการทางการได้ยินหรือบุคคลที่ไม่สามารถใช้คำพูดเป็นอุปสรรคสำคัญ โดยเฉพาะเมื่อบุคคลที่ได้ยินไม่มีความคุ้นเคยกับภาษามือไทย (TSL) ภาษามือไทยมีความซับซ้อนและต้องใช้การฝึกฝน ความตั้งใจ และประสบการณ์อย่างมากจึงจะเชี่ยวชาญได้ ซึ่งมักต้องใช้เวลาหลายเดือนของการศึกษาอย่างต่อเนื่อง แม้แต่เพื่อให้ได้ระดับความคล่องแคล่วขั้นพื้นฐาน สิ่งนี้ก่อให้เกิดช่องว่างด้านการเข้าถึง โดยเฉพาะกับผู้พิการทางการได้ยินที่อาจไม่เคยเรียนรู้ไวยากรณ์หรือภาษาไทยมาตรฐาน อาจพบความยากลำบากในการทำความเข้าใจข้อความภาษาไทยที่เขียน ซึ่งยิ่งทำให้การสื่อสารเป็นเรื่องที่ท้าทายยิ่งขึ้น แม้ว่าการใช้ข้อความเขียนอาจดูเหมือนเป็นทางออกหนึ่งของการสื่อสาร แต่ในความเป็นจริงอาจไม่ได้ผลเสมอไป เนื่องจากผู้พิการทางการได้ยินบางคนอาจไม่สามารถอ่านหรือทำความเข้าใจข้อความภาษาไทยได้อย่างคล่องแคล่ว อันเนื่องมาจากการไม่ได้สัมผัสกับไวยากรณ์ของภาษาพูดแบบดั้งเดิม นี่จึงเป็นเหตุผลสำคัญที่ต้องมีเครื่องมือแปลภาษาแบบเรียลไทม์ที่สามารถแปลงข้อความภาษาไทยเป็นภาษามือไทยได้อย่างถูกต้อง เพื่อช่วยลดช่องว่างนี้ และส่งเสริมให้เกิดปฏิสัมพันธ์ที่มีความหมายระหว่างผู้พิการทางการได้ยิน บุคคลที่ไม่สามารถใช้คำพูด และบุคคลที่ได้ยิน

Other Innovations

คณะวิศวกรรมศาสตร์

SOH Estimation for Li-ion battery

Currently, lithium batteries are widely used in electronic devices and electric vehicles, making the estimation of their State of Health (SOH) crucial. Accurate SOH estimation helps extend battery lifespan, reduce maintenance costs, and prevent safety issues such as overheating or explosions. This project aims to study and analyze mathematical models of batteries and develop SOH estimation techniques using Neural Networks to enhance accuracy and evaluation speed. The experiment involved collecting charge and discharge data from three lithium battery cells under controlled temperature conditions while maintaining a constant current. The current, voltage, and time data were recorded and analyzed to determine the battery capacity for each cycle. These data were then used to train a Neural Network model. The results demonstrated an effective method for predicting battery health status. The outcomes of this project can contribute to the development of a Battery Management System (BMS) that improves battery efficiency and longevity. Additionally, it provides a foundation for applying artificial intelligence techniques in the energy sector effectively.

คณะอุตสาหกรรมอาหาร

Designed Quality coffee from fermentation

Coffee is a critical agricultural commodity to be used to produce a premium beverage to serve people worldwide. Coffee microbiome turned to be an essential tool to improve the bean quality through the natural fermentation. Therefore, understanding the microbial diversities could create the final product's better quality. This study investigated the natural microbial consortium during the wet process fermentation of coffee onsite in Thailand to characterize the microorganisms involved in correlation toward the biochemical characteristics and metabolic attributes. Roasting is another important step in developing the complex flavor/ aroma that make coffee to be enjoyable. During the roasting process, the beans undergo many complex and alternatively change in the physicochemical properties from the gained substances in the fermentation process. The changing in the formation of the substances responsible for the sensory qualities, physicochemical/ aroma attributes as well as the health benefits of the final product. Using the starter culture could also develop the distinguished characteristics of coffee (Research collaboration with Van Hart company)

คณะอุตสาหกรรมอาหาร

Unpolished Rice Yogurt with Trio Probiotic Popping Pearls and Healthy Rice Cereal

This black rice yogurt combines Trio Probiotic popping pearls and healthy rice cereal, rich in anthocyanins that help slow down bodily aging. It contains 3 types of probiotics to support gut balance and enhance digestive efficiency. This zero-waste product repurposes rice residue from the production process into nutritious cereal, offering a delicious and health-packed experience in one cup.