Hacker News
new
top
best
ask
show
job
End-to-end model that listens, sees, thinks and responds on video in real time
(
twitter.com
)
1 point
by
dawkins
3 hours ago
1 comment
linzhangrun
3 hours ago
How is the first token latency for real-time scene processing being addressed?