HY World 2.0 Demo
🤗
23
A Multi-Modal World Model for Reconstruction
pydantic==2.10.6 to requirements.txt or upgrade Gradio to the latest version.torch>=2.2.0 for Zero GPU space).transformers<=4.49.0for spaces using Transformers or Diffusers).huggingface_hub to the old version (huggingface_hub==0.25.2 for if an error like cached_download is not available occurs or inference does not work properly)WORKDIR in Dockerfile may cause the application to fail to start with error 137. (Docker Spaces, /static-proxy?url=https%3A%2F%2Fdiscuss.huggingface.co%2Ft%2Ferror-code-137-cache-error%2F152177%3C%2Fa%3E%3C!--%5D--%3E%3C!--%5B0--%3E%3Cspan%3E)%3C%2Fspan%3E%3C!--%5D--%3E%3C!--%5B1--%3E%3Cbr%2F%3E%3C!--%5D--%3E%3C!--%5B1--%3E%3Cbr%2F%3E%3C!--%5D--%3E%3C!--%5B0--%3E%3Cspan%3EAbout pydantic==2.10.6:Identify human poses in images
Turn drawing to a real photo
Generate detailed captions for any image
Rich Prompts Collections For Image Generation
Vectorizer AI | Convert Image to SVG
extract 68 points landmark from mediapipe-468
Text-guided object tracking, point tracking, reasoning.
Generate depth map and camera data from a photo
Generate a masked image by detecting and segmenting objects
Tag images with descriptive labels