| --- |
| license: apache-2.0 |
| datasets: |
| - VinayHajare/Marathi-Sign-Language |
| language: |
| - en |
| base_model: |
| - google/siglip2-base-patch16-224 |
| pipeline_tag: image-classification |
| library_name: transformers |
| tags: |
| - Marathi-Sign-Language-Detection |
| - SigLIP2 |
| - 93M |
| --- |
| |
| %3C!----%3E%3C%2Ftd%3E%3C%2Ftr%3E%3Ctr id="L18"> | |
|
| # Marathi-Sign-Language-Detection |
|
|
| > Marathi-Sign-Language-Detection is a vision-language model fine-tuned from google/siglip2-base-patch16-224 for multi-class image classification. It is trained to recognize Marathi sign language hand gestures and map them to corresponding Devanagari characters using the SiglipForImageClassification architecture. |
|
|
| ```py |
| Classification Report: |
| precision recall f1-score support |
| |
| अ 0.9881 0.9911 0.9896 1009 |
| आ 0.9926 0.9237 0.9569 1022 |
| इ 0.8132 0.9609 0.8809 1101 |
| ई 0.9424 0.8894 0.9151 1103 |
| उ 0.9477 0.9073 0.9271 1198 |
| ऊ 0.9436 1.0000 0.9710 1071 |
| ए 0.9153 0.9378 0.9264 1141 |
| ऐ 0.7790 0.8871 0.8295 1089 |
| ओ 0.9188 0.9581 0.9381 1075 |
| औ 1.0000 0.9226 0.9598 1021 |
| क 0.9566 0.9160 0.9358 1083 |
| क्ष 0.9287 0.9667 0.9473 1200 |
| ख 0.9913 1.0000 0.9956 1140 |
| ग 0.9753 0.9982 0.9866 1109 |
| घ 0.8398 0.7908 0.8146 1200 |
| च 0.9388 0.9016 0.9198 1158 |
| छ 0.9764 0.8127 0.8870 1169 |
| ज 0.9599 0.9967 0.9779 1200 |
| ज्ञ 0.9878 0.9483 0.9677 1200 |
| झ 0.9939 0.9567 0.9749 1200 |
| ट 0.8917 0.8992 0.8954 1200 |
| ठ 0.9075 0.8425 0.8738 1200 |
| ड 0.9354 0.9900 0.9619 1200 |
| ढ 0.8616 0.9025 0.8816 1200 |
| ण 0.9114 0.9425 0.9267 1200 |
| त 0.9280 0.9025 0.9151 1200 |
| थ 0.9388 0.9717 0.9550 1200 |
| द 0.8648 0.9275 0.8951 1200 |
| ध 0.9876 0.9917 0.9896 1200 |
| न 0.7256 0.8967 0.8021 1200 |
| प 0.9991 0.9683 0.9835 1200 |
| फ 0.8909 0.8575 0.8739 1200 |
| ब 0.9814 0.7917 0.8764 1200 |
| भ 0.9758 0.8383 0.9018 1200 |
| म 0.8121 0.8142 0.8132 1200 |
| य 0.5726 0.9133 0.7039 1200 |
| र 0.7635 0.7339 0.7484 1210 |
| ल 0.9239 0.8800 0.9014 1200 |
| ळ 0.8950 0.7533 0.8181 1200 |
| व 0.9597 0.7542 0.8446 1200 |
| श 0.8829 0.8667 0.8747 1200 |
| स 0.8449 0.8758 0.8601 1200 |
| ह 0.9604 0.8883 0.9229 1200 |
| |
| accuracy 0.9027 50099 |
| macro avg 0.9117 0.9039 0.9051 50099 |
| weighted avg 0.9107 0.9027 0.9040 50099 |
| ``` |
|
|
| --- |
|
|
| ## Label Space: 43 Classes |
|
|
| The model classifies a hand sign into one of the following 43 Marathi characters: |
|
|
| ```json |
| "id2label": { |
| "0": "अ", "1": "आ", "2": "इ", "3": "ई", "4": "उ", "5": "ऊ", |
| "6": "ए", "7": "ऐ", "8": "ओ", "9": "औ", "10": "क", "11": "क्ष", |
| "12": "ख", "13": "ग", "14": "घ", "15": "च", "16": "छ", "17": "ज", |
| "18": "ज्ञ", "19": "झ", "20": "ट", "21": "ठ", "22": "ड", "23": "ढ", |
| "24": "ण", "25": "त", "26": "थ", "27": "द", "28": "ध", "29": "न", |
| "30": "प", "31": "फ", "32": "ब", "33": "भ", "34": "म", "35": "य", |
| "36": "र", "37": "ल", "38": "ळ", "39": "व", "40": "श", "41": "स", "42": "ह" |
| } |
| ``` |
|
|
| --- |
|
|
| ## Install Dependencies |
|
|
| ```bash |
| pip install -q transformers torch pillow gradio |
| ``` |
|
|
| --- |
|
|
| ## Inference Code |
|
|
| ```python |
| import gradio as gr |
| from transformers import AutoImageProcessor, SiglipForImageClassification |
| from PIL import Image |
| import torch |
| |
| # Load model and processor |
| model_name = "prithivMLmods/Marathi-Sign-Language-Detection" # Replace with actual path |
| model = SiglipForImageClassification.from_pretrained(model_name) |
| processor = AutoImageProcessor.from_pretrained(model_name) |
| |
| # Marathi label mapping |
| id2label = { |
| "0": "अ", "1": "आ", "2": "इ", "3": "ई", "4": "उ", "5": "ऊ", |
| "6": "ए", "7": "ऐ", "8": "ओ", "9": "औ", "10": "क", "11": "क्ष", |
| "12": "ख", "13": "ग", "14": "घ", "15": "च", "16": "छ", "17": "ज", |
| "18": "ज्ञ", "19": "झ", "20": "ट", "21": "ठ", "22": "ड", "23": "ढ", |
| "24": "ण", "25": "त", "26": "थ", "27": "द", "28": "ध", "29": "न", |
| "30": "प", "31": "फ", "32": "ब", "33": "भ", "34": "म", "35": "य", |
| "36": "र", "37": "ल", "38": "ळ", "39": "व", "40": "श", "41": "स", "42": "ह" |
| } |
| |
| def classify_marathi_sign(image): |
| image = Image.fromarray(image).convert("RGB") |
| inputs = processor(images=image, return_tensors="pt") |
| |
| with torch.no_grad(): |
| outputs = model(**inputs) |
| logits = outputs.logits |
| probs = torch.nn.functional.softmax(logits, dim=1).squeeze().tolist() |
| |
| prediction = { |
| id2label[str(i)]: round(probs[i], 3) for i in range(len(probs)) |
| } |
| |
| return prediction |
| |
| # Gradio Interface |
| iface = gr.Interface( |
| fn=classify_marathi_sign, |
| inputs=gr.Image(type="numpy"), |
| outputs=gr.Label(num_top_classes=5, label="Marathi Sign Classification"), |
| title="Marathi-Sign-Language-Detection", |
| description="Upload an image of a Marathi sign language hand gesture to identify the corresponding character." |
| ) |
| |
| if __name__ == "__main__": |
| iface.launch() |
| ``` |
|
|
| --- |
|
|
| ## Intended Use |
|
|
| Marathi-Sign-Language-Detection can be applied in: |
|
|
| * Educational platforms for learning regional sign language. |
| * Assistive communication tools for Marathi-speaking users with hearing impairments. |
| * Interactive applications that translate signs into text. |
| * Research and data collection for sign language development and recognition. |