HN2new | past | comments | ask | show | jobs | submit | aliljet's commentslogin

Is this actually true? I want to see actual evals that match this up with Sonnet 4.5.

The Qwen3.5 27B model did almost the same as Sonnet 4.5 in this[1] reasoning benchmark, results here[2].

Obviously there's more to a model than that but it's a data point.

[1]: https://github.com/fairydreaming/lineage-bench

[2]: https://github.com/fairydreaming/lineage-bench-results/tree/...


Not exactly, but pretty close: https://artificialanalysis.ai/models/capabilities/coding?mod...

Somewhere between Haiku 4.5 and Sonnet 4.5


> Somewhere between Haiku 4.5 and Sonnet 4.5

That's like saying "somewhere between Eliza and Haiku 4.5". Haiku is not even a so-called 'reasoning model'.¹

¹ To preempt the easily-offended, this is what the latest Opus 4.6 in today's Claude Code update says: "Claude Haiku 4.5 is not a reasoning model — it's optimized for speed and cost efficiency. It's the fastest model in the Claude family, good for quick, straightforward tasks, but it doesn't have extended thinking/reasoning capabilities."


Haiku 4.5 is a reasoning model. [0]

[0]: https://www-cdn.anthropic.com/7aad69bf12627d42234e01ee7c3630...

> Claude Haiku 4.5, a new hybrid reasoning large language model from Anthropic in our small, fast model class.

> As with each model released by Anthropic beginning with Claude Sonnet 3.7, Claude Haiku 4.5 is a hybrid reasoning model. This means that by default the model will answer a query rapidly, but users have the option to toggle on “extended thinking mode”, where the model will spend more time considering its response before it answers. Note that our previous model in the Haiku small-model class, Claude Haiku 3.5, did not have an extended thinking mode.


Sure, marketing people gonna market. But Haiku's 'extended thinking' mode is very different than the reasoning capabilities of Sonnet or Opus.

I would absolutely believe mar-ticles that Qwen has achieved Haiku 4.5 'extended thinking' levels of coding prowess.


>Sure, marketing people gonna market.

Oh HN never change.


Not sure what this means, but as a marketing person myself, here's what happened: One day, an Anthropican involved in the Haiku 4.5 launch shrugged, weighed the odds of getting spanked for equating "extended thinking" with "reasoning", and then used Claude to generate copy declaring that. It's not rocket surgery!

It's mainly that people on here, regardless of profession, speak incorrectly but confidentally about things that could be easily verified with a Google search or basic familiarity with the thing in question.

Haiku 4.5 is a reasoning model, regardless of whatever hallucination you read. Being a hybrid reasoning model means that, depending on the complexity of the question and whether you explicitly enable reasoning (this is "extended thinking" in the API and other interfaces) when making a request to the LLM, it will emit reasoning tokens separately prior to the tokens used in the main response.

I love your theory that there was some mix up on their side because they were lazy and it was just some marketing dude being quirky with the technical language.


We are all reasonable people here, and while you are (mostly) correct, I think we can all agree that Anthropic documentation sucks. If I have to infer from the doc:

* Haiku 4.5 by default doesn't think, i.e. it has a default thinking budget of 0.

* By setting a non-zero thinking budget, Haiku 4.5 can think. My guess is that Claude Code may set this differently for different tasks, e.g. thinking for Explore, no thinking for Compact.

* This hybrid thinking is different from the adaptive thinking introduced in Opus 4.6, which when enabled, can automatically adjust the thinking level based on task difficulty.


Looks much closer to Haiku than Sonnet.

Maybe "Qwen3.5 122B offers Haiku 4.5 performance on local computers" would be a more realistic and defensible claim.


I won't disagree - the guideline prescribes to keep the original title as much as possible, and I failed to find more neutral source.

How do you even pull away from a Gmail address? I'm nearly twenty years into that service. Getting banned would be absolutely devastating...

Use your own domain to sign up for a paid email service, provided by a company that focuses on email. I use Fastmail, but there are many other options.

Set up forwarding in Gmail to your new address.

Then, whenever you log in to a website or app with your Gmail, take a moment to change it to your new address. In a few weeks, most of your important accounts will be covered. In a few months, almost everything you still actively use will be done.

I did this ~5 years ago and the only thing that still arrives at my Gmail is spam.


Same here but ~8 years.

You can mitigate/speed the process using your password manager too.

I still use a filter in my email so that if something comes in under my Gmail, it gets a special tag that I can filter on and treat those as a todo list. Rarely happens beyond the occasional Google Meet connection.


> Use your own domain to sign up for a paid email service, provided by a company that focuses on email.

Note you don't need to pay. just use zoho mail or any other free email that lets you bring your own domain. Switch email providers as needed without changing your domain

The trouble with paying is that if you forget to pay, you may lose email. (arguably this is also a problem with domains, generally you should pay some years in advance)


I prefer to pay for the product so I do not become the product.

Zoho lets you pay a small monthly (yearly?) fee and link several domains to it.

Solid advice, but I want to double, watch out for things you only log into once a year.

Making a new local account on your machine is a good first step.


I did this but don't forward. Instead, every new email in Gmail I got would prompt me to go update that service's contact info for me.

It probably doesn't matter, but it made me feel a little better because that way Google wouldn't have direct info on to which email/domain I transfered (ignoring other Gmail contacts that start emailing me at my new address(es) ).


^this is the way.

You can buy a domain name for like $10 per year; I recommend getting it from porkbun.com.

Cloudflare.com is good too, EXCEPT if you buy your domain from them, you'll be required to use their nameservers until and unless you transfer your domain elsewhere (which you won't be able to do for a while). Though to be fair, their free DNS is good and lots of people use it anyway. It makes email setup slightly more complicated, but it's still doable.

Spaceship.com also has a pretty good reputation, but I think their customer service isn't as good, they're quite new, and they're owned by Namecheap (a bigger domain registrar with a much worse reputation).

Whatever you do, DO NOT buy from GoDaddy. Do not even search for the domain you're considering on GoDaddy. Literally any option is better than GoDaddy.

By far the most reliable TLD options are .com, .net, and .org. These will look relatively trustworthy for email, and the price stays very very stable from year to year. If you don't want to think about it, just get one of these. You can even still find single dictionary word domains for .org or .net relatively easily.

Do not buy any domain marked "premium". This means the owner of the TLD can change the price at renewal as dramatically as they want, for any reason (e.g. if you have a website hosted at that domain that becomes popular). Your $20 per year domain might suddenly become a $300 or $3000 per year domain for no reason but greed, and you wouldn't be able to do anything about it.

Non-premium nTLD's (.club, .horse, .rocks, .theater, etc) can increase quite dramatically in price, BUT the price is required to be set the same for all domains using that nTLD, so they can't target any individual person for having a successful website or whatever. Also, you can pre-buy up to 10 years, which locks in your price for those 10 years. I'd still not recommend them for a primary email, but it's better than buying a "premium" domain. Just be aware that the yearly price might unexpectedly increase in the future.

Some country code TLD's are also good, but for email, probably stay away from the ones that spammers like to use.

___

Anyway, what I actually originally meant to comment about is: if you set up forwarding from gmail and don't check that account regularly anymore, I recommend setting up a gmail filter rule that forwards all your gmail spam to you (their regular forwarding setting leaves it out and just sends it to the gmail spam folder). It's a little annoying to have to re-flag some of the spam as spam in your new email, but gmail has a habit of marking non-spam as spam for me, and if you're not regularly checking that spam folder you can easily miss important email.


Porkbun have started demanding ID verification for registrations, which depending how you feel about current events might make you reconsider having them on your list

They've been doing it for a few years. KYC laws. See: legally required for registrants from India

> Your $20 per year domain might suddenly become a $300 or $3000 per year domain for no reason but greed, and you wouldn't be able to do anything about it.

Seconding this. Tthis is exactly what happened with the .sexy TLD: https://www.reddit.com/r/Domaining/comments/uia8pc/sexy_tlds...


For quite some time (approx 8 years) I've used an email forwarding (Blur, but any works) to avoid spam.

This looks like perfect case for change of email, since lot of these accounts can be moved out from Gmail by changing the address that email is forwarded too.

Looks like all this hassle with generating a new email for each service pays for the second time (by ease of changing the main mail), in addition to spam and privacy protection.


I had my Gmail for almost 20 years and made the transition. It's annoying and time consuming but I think well worth it. I bought a domain and host it on iCloud. It's like $3/month for 6 email addresses (you can use it with the family). That includes a little cloud data and other services like hidden email addresses. DNS is handled by Cloudflare for free. Then start moving each service/login to the new email address. Every time you log into something, change the email address. I took the opportunity to update passwords and passkeys too, using Vaultwarden. I was lazy and had used similar passwords for a lot of services. Passwords are all long and unique now.

Now, even if Apple bans me, I can move my host within minutes. I never lose access to my email domain. It's much more professional and I can do catch-all. E.g. netflix@[domain.com]. This way I can see who sells my email address to spammers and block it.


I just sold a domain I had for 25 years and used for everything including API endpoints, email, authentication, etc. It took a couple weeks to transition myself and my family/friends.

Pretty sure just moving emails would have take a lot less effort. I had the advantage of keeping the domain until I was ready to move, now imagine Google just turned it off one day and what your workload would be. I shudder to think about having to deal with that.


Register your own domain, use a third-party provider to handle actual sending and receiving (I use proton, which makes the setup very easy), forward your Gmail to your personal domain address and as renewals and reminders come in switch your email on services to your personal domain.

After a year or two losing Gmail becomes an inconvenience; after a few more years it is nothing. As everything is now on your own domain name you can switch providers without affecting anything.

That's what I did about 5 years ago and my only regret is not doing it earlier.


Just start changing addresses. Forward the rest. It takes about a year. Changing your name is way harder and tons of folks do that all the time.

I moved away from a gmail address that was that old, dating back to the invitation-only days. It had become more spam than not, mostly other people who share my initials not knowing their own email addresses. But the possible devastation you mention was more worrying. It had become too much of a risk for my banking and identity generally to not own my email address.

I got a custom domain. I still host it on google, because I know how impossible it is for small companies to have a reasonable program to deal with insider threats. Because of that, I think only one of the giant companies can realistically provide secure email. And the google app suite is great. Now that I pay for google workspace, there's support and appeals available, and if they ban me anyway, I still control the domain and can regain access to everything.

I have not been able to delete the old address, even after 3 years. There are some things like Google Fi that can only use a non-workplace google account. Very, very rarely, I still get an email that matters on it. But I got to the point where I could stop checking it in about 2 months, and now I look at it about once a week quickly, more out of habit than anything else.

The switch was annoying, but not "hard". It was worth it.


buy a domain.

create icloud account.

use their custom domain email setup (free btw) - https://support.apple.com/en-us/102540

Start replacing important account emails with your custom domain.

Every time you get an important email in gmail, login and update.

Bonus: icloud let's you create catch all emails, so you can create many burner emails such as hackernews@mydomain.com


Get your own domain so you can easily change providers in the future. Start with your password manager and change the address on all the accounts you have in there.

After a few years you'll notice you stop bothering to check your Gmail and you can delete it to close the address.

If you need motivation, skim the /r/GMail subreddit and see how many people are getting locked out daily.


Do you have a recommendation for a major email provider as a fallback if you have to pick one?

I vaguely recall encountering a service that only accepted addresses from a whitelist of big providers (Gmail, Yahoo, Outlook, etc.), even @icloud did not qualify.


That's a service that doesn't want your business. If you care, message them about it

I've never once run into a service with such a restriction, but I can imagine someone being that short-sighted. I have seen services that only support "log in with Google or Facebook", which is comparably terrible.


Discogs will not let me login with my own domain (of 30 years) and required one of the big providers. It kept complaining about "risky domain". But that is the only incident I can think of.

Discogs

Who? Never heard of them, and it sounds like there's a good reason for that.


It is a top 1000 web site according to Alexa rankings. It would take you about 5 seconds to Google about it. Probably less time than it took you to write your post.


I've run into services that will flag specific tlds as invalid.

I have heard of that, yeah. It's still busted, but marginally more understandable if they're dealing with a lot of scams. For instance, `.xyz` and some others have bad reputations. I've never seen something that'll reject an arbitrary self-owned `.org`, by way of example.

Begin with making a list of all services where you subscribed using gmail...

I switched to my own domain ages ago; it only took 2-3 years to stop getting relevant mail to the old one (I put a forwarding rule in place and just used the new one for everything).

Imported all my past mail on day one, forwarding meant I had one inbox only, and I only sent mail from the new domain. A few gentle “please stop using my old address” conversations with family.


Google Takeout :)

It will never be easier than right now. Every day you stay, you dig their moat around you even deeper

gmail uses IMAP.

make another mailbox (another provider - migadu, fastmail, proton, whoever) that has IMAP as well. (selfhosting.. is PITA. only if u really need it).

install some standalone mail-client - thunderbird, clawsmail, applemail, or k9 , aqua on android, whatever. Attach both mailboxes into that. Find out how to copy an e-mail from one folder into another.

Folder by folder, select all mails, copy from one mailbox into the other. Will take time.

(Beware, some clients (apple) will fuckup the mail-date, anything older than 5 years becomes 5 years old. or it shows like that. YMMV.)

i have made this multiple times, for 20+ years of mails...


I just went through all accounts in my password manager, logged in and changed my email. It takes a little while but not that much.

took about 30 minutes to switch to proton mail

Now move all the services and accounts you have registered to that account, along with all the friends and family who have your old email account.

This service is basically a nightmare to export/move away from. 30 minutes to switch to, maybe 30 hours to switch away from.

Sign up at fastmail.com, set up forwarding, change your "reply-to" address. A year later, you'll have nothing arriving in gmail except marketing cruft.

It's really not that hard. I switched about 10 years ago. Just every time you log in with your old email, replace it with your new one. Every time you email someone, email them from your new one with a note: "this is my new email". In a few months I had migrated everything to the new email.

Same. I still have an old Gmail address that receives forgotten but still considered important emails from various services.

What's the playbook for migrating away in this situation?


Companies need to allow you update your personal information including your email. It may need tickets to support but it's doable.

Just have to get started and suffer for a while and make it a practice to switch emails when you log into places.

I switched to fastmail with my own domain.


I went with SimpleLogin.

Although I am increasingly concerned with its longevity since there's a non-zero risk that Proton might shut down SimpleLogin since Proton Pass has its own alias feature.


I really really want to see how these images are starting to form into videos. The stills are clearly getting better and better, but what about when you need the stills to organically conform to a keyed script?

Check out Seedance 2: https://seed.bytedance.com/en/seedance2_0

Nano Banana was technically impressive the first time, but after Seedance it's not really. It's all just an internet pollution machine anyway.


The page looks promising but how can I try it out?

They have an API.

I'm seeing more and more AI video memes and they are getting really good. Still just bunch of short clips, long shots are not working well enough, but typical Hollywood movies have few second cuts anyway so this is almost good enough to make a marvel fanfic.

the workflow right now would be to take this images, make a sequence of them for key "shots" and send them to an I2V model. LTX-2 is the model the r/stablediffusion folks are playing with right now, but there are a fair few.

The public can absolutely participate in this by way of syndication deals. Those syndicates are what's covering up the true extent of ownership and they're essentially charging for access with their fees. It's oddly shady, poorly regulated, and more expensive than just being public, but everyone can ride this ride.

The general public absolutely cannot. You have to be an accredited investor or qualified purchaser; you need to have access; you have to pay carry & fees (maybe multiple, stacked middlemen).

The path to declaring yourself accredited is uniquely easy. Just say it. The whole space is deeply unregulated and unaudited. What makes it insane is that those middleman are making a small fortune exploiting this loophole protecting large companies from being forced to go public. The number is 2000 private investors. Rest assured, more than 2000 individuals have money in Stripe today. It's a total scam.

Yep. I know several people who became accredited investors through one simple trick: they lied. No one actually checks.

And now they’ve thrown their legal rights away should any dispute arise. Wouldn’t recommend this strategy.

“Rights”. If you’re lying about accredited investor status, you definitely don’t have the resources to pursue your adversaries in civil litigation, much less actually collect on any judgment.

But they had the option of buying VOO or whatever broad market index the entire time, so I don’t see the need to protect people who feel the need to gamble.


No, that's not how the US legal system works. I don't endorse fraud (or dishonesty in general) but your claim that lying about being an accredited investor extinguishes all legal rights in a dispute is simply false and has no basis in law. Most rights would still be retained.

Is there a pure stripe fund though? What if I just want exposure to stripe and not the rest of their portfolio?

Not really. You still have to be an accredited investor AND financially savvy enough to have the awareness of what syndicate deals are and how to find them and participate.

> You still have to be an accredited investor

Which means that you have to be able to repeat after me “I am an accredited investor”, that’s the actual full list of requirements.


No, the actual full list of requirements is on the SEC website site.

I think they are trying to point out the seller may not be required to verify. Only ask.

Like the age check on logged out YouTube videos


No, those are not actual requirements.

Legally speaking that is not correct. Maybe in practice you can do that, but it opens you up to very real risk.

To your bog standard human being, trading Stripe stock has significant barriers vs trading a public stock.


What “very real risk”?

Not legal risk, certainly.

“Risk” in economic terms? Yeah that’s the whole purpose of claiming to be an accredited investor.


How exactly?

Robinhood just launched a new ventures vehicle and Stripe could be "supposedly" part of it. Robinhood Ventures Fund I (RVI),

Thank you for sharing this, I was unaware! It launches tomorrow.

This is the footnote regarding Stripe: "New deal incoming: RVI has entered a binding agreement to invest in Stripe, Inc. This investment is expected to close after RVI’s IPO, subject to meeting customary closing conditions. Note: This deal isn’t guaranteed to close."


The easiest way? Angelist.

I’ve poked around – looks like it’s not enough to just declare you’re an accredited investor, but also prove it?

What evidence do I need to provide as a US investor: https://help.angellist.com/hc/en-us/articles/15569926884109-...

As a non-US investor: https://help.angellist.com/hc/en-us/articles/15570114297869-...


That's a bit confusing. Do you believe LLMs coming out of non-chinese labs are censoring information about Israel and/or Palestine? Can you provide examples?

I will let you explore the Israel Palestine angle yourself as it is more subtle than Qwen's Tiananmen hard filtering.

But there are topics that ChatGPT hard blocks just like Qwen [1].

[1] https://www.independent.co.uk/tech/chatgpt-ai-david-mayer-op...


I wonder if this will extend SEV-SNP and TDX to the child VMs?


It says VSM is automatically disabled... so I would assume not.


I wonder if these are starting to get reasonable enough to use locally?


The problem here is that it looks like this is released with almost no real access. How are people using this without submitting to a $250/mo subscription?


I gather this isn't intended a consumer product. It's for academia and research institutions.


I have some very difficult to debug bugs that Opus 4.6 is failing at. Planning to pay $250 to see if it can solve those.


People are paying for the subscriptions.


What I really want is for the peon voice to be replicated and for custom things to be in that voice. Or even better, the starcraft battlecruiser guy's voice!


All the Starcraft voice acting is amazing. It's in the pipe - 5 by 5!


Am I just a cynic, or do any of the LLMs deserve love too?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: