

GLM-4.1V-Thinking: Image+Text > Text
Intern-S1: Image+Text > Text
Wan 2.2 - Text +Image > video
Skywork-R1V3: Image+Text > Text
Skywork-UniPic: Text > Image / Image > Text
Tar-7B: Any-to-Any
Ming-Lite-Omni-1.5: Any-to-Any
Step3: Image+Text > Text
HunyuanWorld-1: Image > 3D
ThinkSound: Video > Audio
Neta-Lumina: Text > Image
SmallThinker runs on 1GB RAM
Qwen3-Coder: fully spec'd tool calling
GLM-4.5: browser agents, IDE assistant
Qwen3 WebDev demo: text-to-frontend code
Science one S1: Scientific model
Agentar DeepFinance: Finance dataset
ObjectClear: Interactive Vision Tool
Qwen3 MT Demo: Machine Translation Tool





