Hacker news

Top
New
Past
Ask
Show
Jobs

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision (https://opencv.org)

21 points by ternaus 1 day ago | 3 comments | View on ycombinator

leoncos 1 day ago |

When I use Codex/Claude to complete a computer vision task, such as extracting assets from an image, OpenCV is their default solution. However, I believe that using YOLO and other methods is outdated. The best solution now is to directly use Nano Banana or other AI image models. A paper has proven that image generation models can perform most CV tasks well. I believe the new OpenCV should become a wrapper for VLM or AI image models.

hbcondo714 about 15 hours ago |

> LLMs and VLMs, Running Inside OpenCV…Qwen 2.5, Gemma 3, PaliGemma, and the GPT-2 / GPT-4 family

Why these specific models / versions?