# Real-Time Azerbaijani Sign Language Recognition System: An Accessible AI Solution Combining MediaPipe and LSTM

> Based on MediaPipe hand key point detection and LSTM neural network, this system achieves real-time recognition of 100 Azerbaijani Sign Language vocabulary words, using over 7248 training samples to build accessible communication technology.

- 板块: [Openclaw Geo](https://www.zingnex.cn/en/forum/board/openclaw-geo)
- 发布时间: 2026-06-11T23:15:07.000Z
- 最近活动: 2026-06-11T23:22:38.430Z
- 热度: 154.9
- 关键词: 手语识别, 阿塞拜疆手语, MediaPipe, LSTM, 深度学习, 计算机视觉, 无障碍技术, 实时识别, 序列建模, 人机交互
- 页面链接: https://www.zingnex.cn/en/forum/thread/mediapipelstmai-3e8ab6c8
- Canonical: https://www.zingnex.cn/forum/thread/mediapipelstmai-3e8ab6c8
- Markdown 来源: floors_fallback

---

## [Project Introduction] Real-Time Azerbaijani Sign Language Recognition System: An Accessible AI Solution Combining MediaPipe and LSTM

This project was developed by Kage-develop and released on GitHub on June 11, 2026 (link: https://github.com/Kage-develop/azerbaijani-sign-language). Its core is combining MediaPipe hand key point detection with LSTM neural network to achieve real-time recognition of 100 common Azerbaijani Sign Language vocabulary words, using over 7248 training samples to build accessible communication technology, aiming to break the language barrier between hearing-impaired people and hearing people.

## Project Background and Significance: A Technical Solution for Communication Barriers of Hearing-Impaired People

Sign language is the main communication method for hearing-impaired people, but the proportion of hearing people who master sign language is extremely low, leading to communication barriers. About 70 million hearing-impaired people worldwide use sign language. Azerbaijani Sign Language (AzSL) has a unique grammar and vocabulary system. Sign language recognition needs to process multi-modal information such as hand movements in 3D space, which is technically challenging. In recent years, the development of computer vision and deep learning has brought possibilities for sign language recognition, and camera-based systems have advantages such as non-invasiveness and low cost.

## Technical Architecture: End-to-End Recognition Solution with MediaPipe + LSTM

The project combines MediaPipe hand key point detection (recognizing 21 key points: wrist, finger joints, palm) with LSTM temporal modeling. MediaPipe is lightweight and can reach over 30 FPS on CPU; LSTM processes sequence data, learning short-term patterns, long-term dependencies, and temporal dynamics of gestures, with input as a sequence of key point coordinates.

## Dataset and Training: Construction and Preprocessing of Over 7248 Samples

The project uses over 7248 training samples covering 100 Azerbaijani Sign Language vocabulary words. Data collection strategies may include multi-signer collection, multi-angle shooting, standardized environment, and vocabulary balance. Preprocessing steps: frame extraction → key point detection → coordinate normalization → sequence alignment → data augmentation.

## Deployment and Application: Real-Time Inference Scenarios for Multi-Device Adaptation

The system supports real-time inference with ordinary cameras and can be deployed on: 1. Desktop applications (educational scenarios, interaction between hearing-impaired students and computers); 2. Mobile devices (potential adaptation to Android/iOS); 3. Embedded systems (edge devices such as Raspberry Pi, suitable for offline/privacy scenarios).

## Technical Limitations and Future Directions: Key Issues to Be Addressed

Current system limitations: Only supports isolated word recognition (natural sign language is continuous); vocabulary needs to be expanded (100 words as a base but more are needed); lacks multi-modal fusion (facial expressions, body posture, etc.); needs to improve robustness to Azerbaijani Sign Language dialects/individual differences. Future directions include addressing these issues.

## Summary and Outlook: The Future of AI-Powered Accessible Communication

This project is an important application of AI in the accessibility field, providing a low-cost and easy-to-deploy solution. In the future, with technological progress and dataset expansion, it is expected to achieve higher accuracy and practicality, with the goal of natural human-machine sign language interaction. For developers, the project's technology stack is mature (Python, TensorFlow/PyTorch, MediaPipe), suitable for rapid prototyping.
