google/paligemma2-10b-mix-448
Image-Text-to-Text • Updated • 1.2k • 36
Google ❤️ Open Source AI
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment