QVQ-Max: Think with Evidence
Qwen released QVQ-Max, a visual reasoning model successor to QVQ-72B-Preview, capable of understanding and reasoning across images, video, math, code, and creative tasks.
Excerpt
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD
Introduction Last December, we launched QVQ-72B-Preview as an exploratory model, but it had many issues. Today, we are officially releasing the first version of QVQ-Max, our visual reasoning model. This model can not only “understand” the content in images and videos but also analyze and reason with this information to provide solutions. From math problems to everyday questions, from programming code to artistic creation, QVQ-Max has demonstrated impressive capabilities.
Read at source: https://qwenlm.github.io/blog/qvq-max-preview/