| 1. | | Show HN: Stirrup – A lightweight and customizable foundation for building agents (github.com/artificialanalysis) |
| 2 points by Gcam 6 months ago | past |
|
| 2. | | MicroEvals – Easily run vibe checks against models (artificialanalysis.ai) |
| 3 points by Gcam 12 months ago | past |
|
| 3. | | From GPT-4 to Mistral 7B, there is a 300x range in the cost of LLM inference (twitter.com/artificialanlys) |
| 2 points by Gcam on Feb 12, 2024 | past |
|
| 4. | | Show HN: LLM Benchmarks Leaderboard with 60 model and API host combinations (artificialanalysis.ai) |
| 3 points by Gcam on Feb 7, 2024 | past | 1 comment |
|
| 5. | | Mistral API reduces time to first token by 10x (only place for Mistral Medium) (twitter.com/artificialanlys) |
| 4 points by Gcam on Feb 5, 2024 | past |
|
| 6. | | 240 Tokens/s achieved by Groq's custom chips on Lama 2 Chat (70B) (twitter.com/artificialanlys) |
| 5 points by Gcam on Jan 31, 2024 | past |
|
| 7. | | New GPT-4 Turbo (0125 Preview) slightly faster per initial benchmarks (twitter.com/artificialanlys) |
| 2 points by Gcam on Jan 26, 2024 | past |
|
| 8. | | Benchmarks and comparison of LLM AI models and API hosting providers (artificialanalysis.ai) |
| 152 points by Gcam on Jan 16, 2024 | past | 70 comments |
|
| 9. | | Taro – Write React Code for Web/React Native/Other (googleusercontent.com) |
| 1 point by Gcam on June 10, 2018 | past |
|