Hacker News
new
top
best
ask
show
job

Show HN: I reduced LLM inference GPU calls by 94% using semantic routing(icomnewtechnologies.com)

2 pointsby kanacki8 hours ago1 comment

slach4 hours ago
better publish it on github

Guidelines
FAQ
Lists
API
Security
Legal
Apply to YC
Contact

Search: