Page 85 - 2024F

P. 85

78 UEC Int’l Mini-Conference No.53

53rd UEC International mini-Conference on Informatics, Sciences and Engineering

Deep Learning Based Japanese Sign Language Translation

Rattapoom KEDTIWERASAK and Hiroki TAKAHASHI
Department of Informatics
The University of Electro-Communications

Introduction Methodology

Focus on:
Japanese Sign Language (JSL) Translation
Convert sign language video to spoken language
text
Serve better accessibility and communication
Knowledge Transferring
Transfer American Sign Language (ASL)
knowledge to JSL
Challenges:
A Scarcity of JSL dataset

Fig. 4 Detail Architecture of the Knowledge Transferring
and Fine-Tuned Translation
Fig. 1 Parallel SL dataset [1] Mask Data
Differences in grammar, vocabulary, and cultural Support autoregressive translation generation
context Self-supervised Video Pretraining
Collect ASL gestures
Supervised Sign Language Translation
Transformer Encoder
Fig. 2 “Weather” Sign Language Symbols [4] Receive knowledge
Proposed Method: Fine-tune
Self-Supervised Video Pretraining for Sign Language Transformer Decoder
Translation (SSVP-SLT) [2] Learn for translation
For Sign Language (SL) knowledge transferring
Transformer-based model
For fine-tuned JSL translation [3] Conclusion
Our proposed SSVP-SLT will be beneficial for the JSL
translation. We will be able to transfer the ASL knowledge
by fine-tuning technique. The proposed model will
overcomes data limitations and demonstrates the potential
Fig. 3 Overall Structure of the Proposed Method of cross-lingual transfer learning in JSL translation. The
proposed method will be useful for better accessibility and
Expected Contributions communication.

We will develop the novel approach to overcome the
scarcity of JSL dataset by transferring knowledge from References
the pre-trained ASL model.
[1] Tanzer, G., & Zhang, B. “YouTube-SL-25: A large-scale, open-domain
multilingual sign language parallel corpus”. (2024).
We will propose the SSVP-SLT to learn ASL dataset, [2] Rust, P., Shi, B., Wang, S., Camgoz, N.C., & Maillard, J. “Towards privacy-
which are then fine-tuned by JSL dataset. aware sign language translation at scale”. In ACL, pp. 8624–8641 (2024).
[3] Camgoz, N. C., Koller, O., Hadfield, S., & Bowden, R. “Sign language
transformers: Joint end-to-end sign language recognition and
The SSVP-SLT will improve the JSL translation to spoken translation”. In CVPR, pp. 10023–10033 (2020).
language with limited data. [4] “Spreadthesign”, https://www.spreadthesign.com/, Accessed: 2025-02-23

80 81 82 83 84 85 86 87 88 89 90