AI doesn't know how to interact with touchscreens(blog.allada.com)

4 pointsby allada4 hours ago1 comment

lifecodes3 hours ago
I guess it would be much cheaper to attach an api version of everything we developed till now than teaching these ai to be able to control things in real world as humans do..
I mean if we see the cost of training then making more apis for everything we have makes sense to me.
what do u think?
- allada3 hours ago
  I think the big thing I was trying to highlight in this article was the fact the not much effort has been put into spatial and image awareness. In my limited experiments where I would manually ask the models to take an image and highlight things (like "circle all elbows") it does a great job... but if you ask the model where an elbow is in the image (in pixels), it does a poor job.
  Or maybe put another way, going from `image->model->tool` seems to be an area for improvement.