Story Detail of id 48457965 | Liveview Hacker News

wongarsu12 hours ago | on: OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

YOLO has basically solved that for my use cases for a couple years now. If you want labels that are not in the pretrained labels it's also easy to fine-tune, provided you're willing to label 200 or so images

If you need something less restricted to existing labels (say wanting all the red apples, or all cardboard signs) SAM3 is great, as the sibling comment says

IanCal11 hours ago | parent | next

> provided you're willing to label 200 or so images

A quick note to say that this is also a task you can hand to things like gemini.

IX-1033 hours ago | parent

How do you handle object disambiguation with YOLO? All the examples I've played with have the problem where if two "cars" get too close to each other then the tracking IDs keep switching between them, meaning we'd need an additional kinetic model for disambiguation.

#visit	13,682,984
#session	74,665
#live-session	0