HN2
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Browser Agent Benchmark: Comparing LLM models for web automation
(
browser-use.com
)
13 points
by
MagMueller
5 days ago
|
hide
|
past
|
favorite
|
5 comments
wiradikusuma
5 days ago
|
next
[–]
Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?
reply
pixel_popping
5 days ago
|
prev
|
next
[–]
It's lacking the best model (Opus 4.5) on the benchmark tho.
reply
djohnston
5 days ago
|
parent
|
next
[–]
Yeah but then their own product might not score the highest.
reply
pixel_popping
4 days ago
|
root
|
parent
|
next
[–]
Exactly why I'm pointing it out, which feels a bit corrupt, but understandable.
reply
djohnston
4 days ago
|
root
|
parent
|
next
[–]
tbh i was a bit cranky yesterday - even if they are #2 on a legit benchmark that would be impressive
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
reply