JoyAI-VL-Interaction-Preview

An 8B vision-language model from JD, built on Qwen3-VL, that understands images and video clips. Upload an image or short video and ask any question about it.

Model Card · Project Page

JoyAI-VL-Interaction Chat

MultimodalTextbox
Examples