Efficient-Large-Model
/

Sana_1600M_4Kpx_BF16_diffusers

4Kpx_based_image_size

Model card Files Files and versions

Lawrence-cj commited on Jan 10, 2025

Commit

ed51b70

·

verified ·

1 Parent(s): 0b3fa0d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -51,7 +51,7 @@ Source code is available at https://github.com/NVlabs/Sana.
 - **Model type:** Linear-Diffusion-Transformer-based text-to-image generative model
 - **Model size:** 1648M parameters
 - **Model resolution:** This model is developed to generate 4Kpx based images with multi-scale heigh and width.
-- **License:** [CC BY-NC-SA 4.0 License](./LICENSE.txt)
 - **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
 It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
 and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).

 - **Model type:** Linear-Diffusion-Transformer-based text-to-image generative model
 - **Model size:** 1648M parameters
 - **Model resolution:** This model is developed to generate 4Kpx based images with multi-scale heigh and width.
+- **License:** [NSCL v2-custom](./LICENSE.txt). Governing Terms:  NVIDIA License.  Additional Information:  [Gemma Terms of Use  |  Google AI for Developers](https://ai.google.dev/gemma/terms) for Gemma-2-2B-IT, [Gemma Prohibited Use Policy  |  Google AI for Developers](https://ai.google.dev/gemma/prohibited_use_policy).
 - **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
 It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
 and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).