HN2new | past | comments | ask | show | jobs | submitlogin

The author of this post should benchmark his own blog for accessibility metrics, text contrast is dreadful..

On the other hand, this would be interesting for measuring agents in coding tasks, but there's quite a lot of context to provide here, both input and output would be massive.



Pushed a fix. Could you check, please?

Any resources you can recommend to properly tackle this going forward?


Appreciate the feedback, will work on that.


Do you have any insights on the platform evaluation for coding tasks?


One more vote on fixing contrast from me.


Will fix, thanks :)


Tried Evalry, its a really nice concept, thanks for sharing it!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: