I think the right solution is to endow the LLM with just enough permissions to do whatever it was meant to do in the first place.
In the customer service case, it has read access to the customer data who is calling, read access to support docs, write access to creating a ticket, and maybe write access to that customer's account within reason. Nothing else. It cannot search the internet, it cannot run a shell, nothing else whatsoever.
You treat it like you would an entry level person who just started - there is no reason to give the new hire the capability to SMS the entire customer base.
> People are free and probably do this because it is slow. Alternatives often are not a bad thing.
Alternatives are always good but IMO brew is just not something I interact with all that much and to me it's "good enough". It works and does what I expect, although to be fair maybe I'm on the happy path <shrug>.
Roblox is in some ways there, I think Epic thought fortnite could have competed. IMO they made a strategic mistake in shackling their game-as-a-platform to Fortnite. I thought the music fortnite thing looked interesting, but I have negative interest in installing Fortnite.
Call it something else and make it literally the first thing you see on epicgames.com, have it work on mobile, and maybe things would be different today.
(Aside: Roblox wins because I can go from typing in roblox.com into my browser and be playing a game with a friend in under 20s)
Makes me wonder if there's a bet you can take on Polymarket that Polymarket will get shut down due to it negatively influencing behavior. The insider trading on that one should get interesting.
2. If it’s only banned in the US, yes it pays out, you just need to get a VPN or go to another country.
Also even it’s banned everywhere, the markets are blockchain contracts so you should be able to access it without the website, which is just the frontend. (this is where my technical expertise breaks down, someone who knows blockchain is a better expert)
I've also found it to be better to ask the LLM to come up with several ideas and then spawn additional agents to evaluate each approach individually.
I think the general problem is that context cuts both ways, and the LLM has no idea what is "important". It's easier to make sure your context doesn't contain pink elephants than it is to tell it to forget about the pink elephants.
> what "red/green" means: the red phase watches the tests fail, then the green phase confirms that they now pass.
> Every good model understands "red/green TDD" as a shorthand for the much longer "use test driven development, write the tests first, confirm that the tests fail before you implement the change that gets them to pass".
Unfortunately the only way this changes is if a company writes a just enough unreasonable ToS, and someone violates it in just the right way and the company decides to enforce said ToS, and the user fights back, and this all ends in court.
I'd be surprised if all those stars align anytime soon.
Installing humanoid robots in a factory is like using regexes to parse data.
It makes sense if it's a one-off but there are better solutions.
Maybe it does make sense for small scale businesses that need just a little automation? Like a humanoid robot could restock shelves and do inventory in a grocery store at night, and you wouldn't need to retrofit anything to be able to do that.
Large scale factories seems like the wrong use case for humanoid robots.
Personally I stopped using Facebook because even in the before-AI days it started becoming a glamour photo book of everyone you ever knew (and probably lots of people you only kind of sorta know), and while people certainly deserve to do and see great things, seeing it all shoved in your face every day becomes exhausting in a keeping-up-with-the-joneses kind of way.
I totally get that not everybody is like that, but I am, and so I stopped going to Facebook.
These days I'm in private Whatsapp groups for my direct family and so I learn about what they do, and not the random stuff that my neighbors and 20-years-past classmates did.
My wife is still active on Facebook and I actually do still visit occasionally to boost her posts but that's about it.
I agree with this a lot. In the late 2000s, which for me was when I was about 20, posts were very throwaway and low effort -- in a good way! You never really knew what you'd see when you logged in. Photos of stupid things or silly status updates, etc.
Over the next five years though, content gradually shifted to mainly image crafting. Over-processed photos, highlight reel curated trip photos, major life updates, etc. It felt like the bar was higher on what people would share, but unfortunately that removed a lot of the things that made FB fun in the first place.
I don't know whether it was a more universal shift or whether it had more to do with the age of my peers.
I met a user from an antique land
Who said: Two squares of a clip of video
Stand in at the end of the search. Near them,
Lossly compressed, a profile with a pfp, whose smile,
And vacant eyes, and shock of content baiting,
Tell that its creator well those passions read
Which yet survive, stamped on these unclicked things,
The hand that mocked them and the heart that fed:
And on the title these words appear:
"My name is Ozymandias, Top Youtuber of All Time:
Look on my works, ye Mighty, and like and subscribe!"
No other video beside remains. Round the decay
Of that empty profile, boundless and bare
The lone and level page stretch far away.
Would've been, once. These days I assume bentcorner asked their favourite LLM to generate a poem parodying Ozymandias about once-popular youtube videos.
It doesn't feel like it at all (I'd never expect an LLM to say 'pfp' like that, or 'lossly[sic] compressed', ASCII instead of fancy quotes) but who knows at this point.
I may have gotten incredibly neurotic about online text since 2022.
I actually considered using an LLM but in my experience they "warp" the content too much for anything like this. The effort required to get them to retain what I would consider something to my taste would take longer than just writing the poem myself. (Although tbf it's been awhile since I've asked a LLM to do parody work, so I could be wrong)
In the customer service case, it has read access to the customer data who is calling, read access to support docs, write access to creating a ticket, and maybe write access to that customer's account within reason. Nothing else. It cannot search the internet, it cannot run a shell, nothing else whatsoever.
You treat it like you would an entry level person who just started - there is no reason to give the new hire the capability to SMS the entire customer base.
reply