More

zaptheimpaler · 2026-06-15T18:23:35 1781547815

I tried gemma-4-26B-A4B just to see if it could help me read/sort my emails on a relatively under-powered setup (16GB VRAM + 32GB RAM) and it's not going well.. the model burns 24K tokens just on searching for the right tool and then dumps the email contents into context - i tried to get it to use code-mode to save context but the code-mode implementation can't save files so it was useless and im going to try to switch to "ssh-mode" into my devbox container. Still relatively new to this, so I'm probably doing something wrong

anana_ · 2026-06-15T18:38:06 1781548686

Perhaps try a different model? Just from anecdotal experience, I find that the Gemma models smaller than 31B do not tool call as often as they should.

Some of the benchmarks appear to back this up [0]

Of course, a lot depends how you are using it (inference parameters, harness, prompting, etc.), but the model is quite important too.

[0]: https://artificialanalysis.ai/models/open-source/small?model...

Rzor · 2026-06-16T04:01:56 1781582516

So there was a problem with gemma 4 when it comes to tool calling that Google apparently fixed like 2 or 3 days ago. I remember reading something about this.

zaptheimpaler · 2026-06-15T17:51:13 1781545873

0? OpenCode is just a harness, it can connect to any model hosted online.

jagged-chisel · 2026-06-15T23:04:53 1781564693

> ... hosted online.

Oh, so not that kind of Home Lab.

Carrok · 2026-06-16T02:25:32 1781576732

It can connect to any model hosted anywhere.

zaptheimpaler · 2026-06-14T03:20:01 1781407201

They would have no internally/externally defensible justification to stop the launch as they are partners/part-owners of Anthropic. They would have to let the rank-and-file keep moving on the Fable launch.

milch · 2026-06-14T07:07:11 1781420831

IIRC Anthropic models haven't all been available on day 1, so it does feel like a deliberate choice, especially since they are partners/part-owners like you say. No one would have bat an eye about some corporate PR thing "blah blah mythos is too powerful and too intelligent and so we've decided to focus our capacity on Opus 4.6/4.7/4.8 for now until we have the proper safeguards in place blah blah"

zaptheimpaler · 2026-06-14T03:16:13 1781406973

This is corporate Game of Thrones, nothing more. Amazon, maybe in alliance/deals with others as well saw an opportunity to hurt their rival. Or maybe they were instructed to report this by the WH themselves. Hegseth and the WH will happily take any excuse to hurt Anthropic after the confrontation with DOW, being the vindictive cronies they are.

spprashant · 2026-06-14T03:18:15 1781407095

I thought Amazon has a stake in Anthropic, and would want them to succeed.

zaptheimpaler · 2026-06-11T04:59:33 1781153973

If you don't trust them, then no policy is enough. Technically everything you send to the model could be stored by them. Personally I do worry about that especially as an average consumer not an enterprise, no one is looking out for us and we don't get any guarantees. But enterprises will get the right treatment because they would find out and sue Anthropic if they lied.

coldtea · 2026-06-11T05:55:32 1781157332

>If you don't trust them, then no policy is enough.

No policy is enough, period. There should be technical and legal solutions to it.

jamespo · 2026-06-11T09:10:25 1781169025

There should be legal ramifications if they don't do what they say, but the practical solution is "don't use it".

zaptheimpaler · 2026-06-09T09:46:44 1780998404

I made a little tool where I can just drop HTML into a folder and it will deploy it either to my internal Caddy or publicly on Cloudflare based in folder. Can be a single html or a folder.

https://github.com/ankitson/webby

CF pages still required too much confusing clicking around on a webpage for me. This way I can just point any little report or app at a directory and done.

There’s others that are more server shaped and tightly coupled - a pipeline to pull in all my data like Garmin, Twitter bookmarks, messages into a Postgres DB. Kind of a personal data warehouse that i can use with apps/automations, like alerting me if my sleep schedule is drifting, and a custom web interface to my Garmin data

zaptheimpaler · 2026-06-05T00:34:45 1780619685

I was trying WSL years ago and this is one of the reasons I just moved to a full linux server instead. We still have way too many problems interfacing across filesystems. I hope with AI we will see an iteration on ExFAT that has all the journalling, versioning etc. magic of modern FS' and can be adopted across all 3 OSes. Probably a long shot but I can dream :)

zaptheimpaler · 2026-06-04T23:18:00 1780615080

Yes that would be great. Right now, there are many applications that use pinned certificates to communicate to servers meaning there is literally no way to see the data your own device is sending/receiving from the internet. It's an insane thing that should be banned.

trumpdong · 2026-06-05T00:29:29 1780619369

There is one way, you can modify the app or the OS to change which certificate is pinned, ignore all certificate failures, lie about the certificate in use, log encryption keys, or not even ask the app whether it likes the certificate.

Not on iOS, of course.

zaptheimpaler · 2026-06-04T03:44:51 1780544691

I tried the VPS briefly, it didn't really solve anything for me. The personal assistant agent is only as useful as the data & tools it has, that's where the real risk is. Separate box gives you isolated FS but docker also does that very easily.

jon-wood · 2026-06-04T10:34:12 1780569252

Docker is not a security boundary. It never has been, but given recent demonstrations of container escapes its even less of one than it ever was. If you want to properly contain a process it needs to be running in a VM of its own, or you need to accept that there's a risk of it escaping and ending up with more access than you planned.

zaptheimpaler · 2026-06-03T08:03:58 1780473838

Yes literally it threw an invoice from AWS into spam recently. Regularly throws financial docs into spam. Like you, i have a problem with false positives not false negatives.

I brought it up with support once and the domain was failing DMARC. Headers from the AWS email that has a spam score of 9.1! I don’t know much about email but something must be wrong there on Fastmails side.

x-sieve: CMU Sieve 3.0

X-Spam-known-sender: no

X-Spam-sender-reputation: 500 (none)

X-Spam-score: 9.1

X-Spam-hits: DCC_CHECK 1.1, HEADER_FROM_DIFFERENT_DOMAINS 0.249, HTML_MESSAGE 0.001, ME_SENDERREP_NEUTRAL 0.001, MIME_HTML_ONLY 0.1, MPART_ALT_DIFF 0.724, RCVD_IN_DNSWL_NONE -0.0001, RCVD_IN_MSPIKE_H4 0.001, RCVD_IN_MSPIKE_WL 0.001, SPF_HELO_NONE 0.001, SPF_PASS -0.001, T_REMOTE_IMAGE 0.01, URIBL_DBL_ABUSE_PHISH 7, LANGUAGES en, BAYES_USED none, SA_VERSION 4.0.1

X-Spam-source: IP='54.240.9.110', Host='a9-110.smtp-out.amazonses.com', Country='US', FromHeader='com', MailFrom='com'

X-Spam-charsets: subject='UTF-8', html='UTF-8'

sam_lowry_ · 2026-06-04T14:31:01 1780583461

> URIBL_DBL_ABUSE_PHISH 7

Hm...

Are you sure it was a legit invoice?

;-)