2 14 34

Yanhong Zeng

zengyh1900

https://zengyh1900.github.io/

AI & ML interests

Generative AI for Content Creation.

Recent Activity

authored a paper 4 days ago

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

authored a paper 4 days ago

Learning Joint Spatial-Temporal Transformations for Video Inpainting

authored a paper 4 days ago

A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting

View all activity

Organizations

None yet

authored 13 papers 4 days ago

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

Paper • 1904.07475 • Published Apr 16, 2019

Learning Joint Spatial-Temporal Transformations for Video Inpainting

Paper • 2007.10247 • Published Jul 20, 2020

A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting

Paper • 2312.03594 • Published Dec 6, 2023 • 2

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions

Paper • 2111.10337 • Published Nov 19, 2021

AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models

Paper • 2505.20255 • Published May 26 • 1

CharacterShot: Controllable and Consistent 4D Character Animation

Paper • 2508.07409 • Published Aug 10 • 39

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23 • 40

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published 7 days ago • 11

authored a paper 9 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 35

authored a paper 12 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

authored 5 papers over 1 year ago

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Paper • 2407.17438 • Published Jul 24, 2024 • 26

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Paper • 2407.08701 • Published Jul 11, 2024 • 13

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 13

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

Paper • 2407.01494 • Published Jul 1, 2024 • 15

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Paper • 2406.17758 • Published Jun 25, 2024 • 19

Yanhong Zeng

AI & ML interests

Recent Activity

Organizations

zengyh1900's activity