DeepSeek released 'Thinking-with-Visual-Primitives' framework
DeepSeek, Peking University, and Tsinghua release 'Thinking with Visual Primitives,' a multimodal reasoning framework that elevates spatial tokens—coordinates and bounding boxes—into minimal units of thought, enabling models to 'point' within images during chain-of-thought reasoning.
Excerpt
https://preview.redd.it/47r9qee44cyg1.png?width=1450&format=png&auto=webp&s=0d6f9687115be6ff96d0a194d95232ac0413a7e9
DeepSeek, in collaboration with Peking University and Tsinghua University, has released the paper "Thinking with Visual Primitives" along with its open-source repository, introducing a new multimodal reasoning framework. The core approach of this framework is to elevate spatial tokens—specifically coordinate points and bounding boxes—into the "minimal units of thought" within the model's chain-of-thought. These are directly interleaved during the reasoning process, enabling the model to "point" to specific locations within an image while it "thinks."
[https://github.com/deepseek-ai/Thinking-with-Visual-Primitives](https://github.com/deepseek-ai/Thinking-with-Visual-Primitives)
https://preview.redd.it/lt5qu53g0cyg1.png?width=1844&format=png&auto=webp&s=5d6f0a8de6481035faa22c9d57873c51ca97b1fb
**notice: deepseek removed the repo**
Read at source: https://www.reddit.com/r/LocalLLaMA/comments/1szwi1d/deepseek_released_thinkingwithvisualprimitives/
Discussions
- reddit · 101 points · 10 comments
- reddit · 123 points · 11 comments
- reddit · 142 points · 13 comments
- reddit · 159 points · 12 comments
- reddit · 168 points · 14 comments
- reddit · 188 points · 15 comments
- reddit · 198 points · 15 comments
- reddit · 198 points · 17 comments
- reddit · 202 points · 18 comments
- reddit · 209 points · 18 comments
- reddit · 216 points · 18 comments
- reddit · 220 points · 18 comments
- reddit · 224 points · 18 comments
- reddit · 231 points · 19 comments
- reddit · 236 points · 19 comments
- reddit · 239 points · 19 comments
- reddit · 249 points · 20 comments
- reddit · 251 points · 20 comments
- reddit · 253 points · 20 comments
- reddit · 256 points · 20 comments
- reddit · 253 points · 20 comments
- reddit · 260 points · 20 comments
- reddit · 258 points · 22 comments
- reddit · 259 points · 22 comments
- reddit · 267 points · 24 comments
- reddit · 266 points · 24 comments
- reddit · 267 points · 24 comments
- reddit · 269 points · 24 comments
- reddit · 272 points · 24 comments
- reddit · 274 points · 24 comments
- reddit · 278 points · 24 comments
- reddit · 281 points · 24 comments
- reddit · 285 points · 24 comments
- reddit · 287 points · 24 comments
- reddit · 286 points · 25 comments
- reddit · 288 points · 25 comments