Hacker News .hnnew | past | comments | ask | show | jobs | submit | moconnor's commentslogin

Seems like China is using them in buses already?

https://english.cas.cn/newsroom/cas-in-media/202606/t2026060...


Literally the very first time I used ChatGPT. I had already been experimenting with GPT3 for various jokes and games via the API but the naturalness of it as a chat interface that understood you changed everything.

The first time I used a terminal agent was another one.


“Flow” moves agents through a yaml flowchart of prompts and decisions. It’s working quite well for a couple of us in Tenstorrent, more to discover here though:

https://github.com/yieldthought/flow

Happily, 5.5 is good at writing and using it.


I find Flow really interesting, thanks for pointing it out.

Deterministic workflows using AI to help perform those steps not requiring human input has been an area of interest for me for some time. Particularly interesting how you are using the AI to determine what a step has achieved and the action of the next step.

Combine it with workflow elements that does handle human steps together with a notification/routing/task system would make for a helpful system for so many.


This is the future of all software; the benefits of making it accessible to agents are overwhelming.


I don’t want to play a game that was playtested by an AI (and likely built by one as well). Games have a certain aspect that requires such finesse that no machine will ever be able to solve, test, capture or design: fun.

If your game was built and playtested by AI, it is not a game worth playing.


This is an incredibly good first approximation and is often all you need.

Source: spent a couple of years developing an energy and performance profiler for cpus and gpus with various government labs.


Very large, fast, read-only memory now has an incredible use-case: NN weights.


And that's described in this article is the opposite of fast! All of these technologies are


Is... is this named because they have a lemon they're trying to make the most of?


I think saying "L-L-M" sounds kind of like "lemon," so this is an LLM-aid (sounds like lemonade).


Wonder why they didn't call it LLMonade, which would be unique.


so obvious and yet I didn't connect the dots. thank you


wait until you discover the LuLuleMonade -connection /s


If life keeps giving it them, they should instead invent a combustible lemon.


Do they know who you are? They're the guys who are going to blow your house up ... with the lemons.


On an unrelated note, do you think this software supports running models from a CD?...


Lemonsqueeze was considered too violent


If you run it in a cluster, does it become a Lemon Party?


If you run it on someone else's computer it becomes Lemon Stealing


I exclusively buy AMD hardware for local inference. For open drivers, power efficiency, and cost AMD beats Nvidia easily for consumers.


You have got to be joking.

My three NVIDIA cards are more power efficient than my one AMD card, both at idle and during usage.

Official ROCm is like pulling teeth with poor support for desktop cards. Debian, a volunteer led project, have better ROCm CI than AMD and support more cards.

Look at any benchmarks. NV midrange cards are faster than AMD and at least a generation in front. Owning a 7900XTX is an embarrassing disappointment.

I like AMD and want them to succeed, but they are way behind NV in this area.


> Official ROCm is like pulling teeth with poor support for desktop cards...

I agree with most of your post and fled the AMD ecosystem some time ago because of the machine learning situation, but their problem seemed to be more the firmware bugs and memory management of compute shaders than the higher level libraries.

The obvious solution to this one would be not to use ROCm. ROCm has always been a bit of a train wreck for small users and it doesn't seem to do anything special anyway. The way forward would be something more like Vulkan which the server that today's link points to seems to be using. The existence of a badly managed software package doesn't really imply that users have to use it, they can use an alternative.

It would be nice if AMD sorts themselves out though. The NVidia driver situation on linux is painful and if AMD can reliably run LLMs without the hardware locking then I'd much rather move back to using their products.


Yes, AMD themselves even use Vulkan tg numbers in their marketing material, because it's faster than ROCm on everything RDNA2 onwards (seems embarrassing).

However for pp, Vulkan is still nowhere near close to ROCm. That matters for long context and/or quick response. A lot of people really care about that time-to-first-token.


Have a Strix Halo 128 running Qwen 3.5 122b at 35t/s using Vulkan and kernel 7.0.0 on a 400w PSU. Pretty hard to beat for the price and power consumption IMO. But to be fair I compile everything myself so proprietary drivers required by nvidia are a non starter for me.


Any recommendations in the current market? Love how plug and play and is on Linux from the driver side of things.


Strix Halo 128 w/ linux 7x


Whoever did this must have realised the users will hate it. So… is this just demonstrating that the internal culture emphasises other things than user happiness?

I also note that ”for PRs” - will we see these appearing as comments in generated code?


We also outlaw vices like physical violence and property theft.

Society is fundamentally counter to individual freedom, and the degree determines the nature of that society and the degree of cooperation possible within it.


The article’s “worst case” is not dark enough.

The real evil is when someone ensures the famine occurs so they can profit from an outside betting position.


Was thinking this as well. Not shocking though, the pundit class, of which Derek is a high-ranking member, is pretty unimaginative (in a way I imagine is deliberate)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: