Replies: 1 comment
-
|
Potential approaches: Soln A: Bigger MoondreamOnce bigger Moondream model is out, it should be a lot better at picking up these semi-trivial target descriptions Soln B: Zoom InZoom in approach: Zoom into the area where the target lives and reprompt moondream for a more accurate click. Challenges:
Find general areaApproaches:
When did it miss?
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Moondream 2B is susceptible to misclick targets with descriptions that are even somewhat complex. Bigger Moondream models are on the way but how should Magnitude address these low-accuracy situations?
Beta Was this translation helpful? Give feedback.
All reactions