Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ base_model:
|
|
| 8 |
pipeline_tag: visual-question-answering
|
| 9 |
---
|
| 10 |
|
| 11 |
-
# Pathumma-llm-vision-
|
| 12 |
|
| 13 |
## Model Overview
|
| 14 |
Pathumma-llm-vision-2.0.0-preview is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|
|
|
|
| 8 |
pipeline_tag: visual-question-answering
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# Pathumma-llm-vision-2.0.0-preview
|
| 12 |
|
| 13 |
## Model Overview
|
| 14 |
Pathumma-llm-vision-2.0.0-preview is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|