This is interesting but i find it strange than there is no tests with a controls groups with closed eyes. Maybe some of the observed effects are visual only or psychological and not tactile at all.
>are visual only or psychological and not tactile at all
I think for a visual only test you'd need something to the effect of a neuralink that gave control over the robot arm.
Otherwise we're dealing with a set of signal mixing where your brain is attempting to take the strongest/what it deems the most important signals and give an effect based on that. The eyes give us far more data than we can actually process so the has to filter down this data to a usable stream. This can also happen with tactile response, but the number of situations this occurs in is rather rare.
I guess what I'm trying to say, at the end of the day all observed effects (except maybe reflexes) are psychological as the brain is trying to create an accurate virtualization of the input data it's receiving and that more data doesn't necessarily mean better outcomes.
Lets go with the usual reminder: de-soldering / soldering Li-ion cells can be super dangerous. With a bit too much of heat it can fire or even explode...
They explain in the article what they consider a proper citation, an erroneous one and an hallucination, in the section "Defining Hallucitations". They also say than they have many false positives, mostly real papers who are not available online.
Thad said, i am also very curious of the result than their tool, would give to papers from the 2010's and before.
If you look at their examples in the "Defining Hallucitations" section, I'd say those could be 100% human errors. Shortening authors' names, leaving out authors, misattributing authors, misspelling or misremembering the paper title (or having an old preprint-title, as titles do change) are all things that I would fully expect to happen to anyone in any field were things get ever got published. Modern tools have made the citation process more comfortable, but if you go back to the old days, you'd probably find those kinds of errors everywhere. If you look at the full list of "hallucinations" they claim to have discovered, the only ones I'd not immediately blame on human screwups are the ones where a title and the authors got zero matches for existing papers/people. If you really want to do this kind of analysis correctly, you'd have to match the claim of the text and verify it with the cited article. Because I think it would be even more dangerous if you can get claims accepted by simply quoting an existing paper correctly, while completely ignoring its content (which would have worked here).
> Modern tools have made the citation process more comfortable,
That also makes some of those errors easier. A bad auto-import of paper metadata can silently screw up some of the publication details, and replacing an early preprint with the peer-reviewed article of record takes annoying manual intervention.
There are other issues. In January they claimed that a US health report contained "fabricated" and "AI generated" citations with the headline being a claim from a Cigna Group report. Their claim it's fabricated is based on nothing more than the URL now being a redirect of the type common in corporate website reorgs.
I did some checking and found the report does exist, but the citation is still not quite correct. Then I discovered someone is running some LLM based citation checker already, which already fact checked this claim and did a correct writeup that seems a lot better than what this GPTZero tool does.
The mistakes in the citation are the sort of mistake that could have been made by both a human or an AI, really. The visualization in the report is confusing and does contain the 73% number (rounded up), but it's unclear how to interpret the numbers because it's some sort of "vitality index" and not what you'd expect based on how it's introduced. At first glance I actually mis-interpreted it the same way the report does, so it's hard to view this is as clear evidence of AI misuse. Yet the GPTZero folks do make very strong claims based on nothing more than a URL scraper script.
I mean, if you’re able to take the citation, find the cited work, and definitively state ‘looks like they got the title wrong’ or ‘they attributed the paper to the wrong authors’, that doesn’t sound like what people usually mean when they say a ‘hallucinated’ citation. Work that is lazily or poorly cited but nonetheless attempts to cite real work is not the problem. Work which gives itself false authority by claiming to cite works that simply do not exist is the main concern surely?
>Work which gives itself false authority by claiming to cite works that simply do not exist is the main concern surely?
You'd think so, but apparently it isn't for these folks. On the other hand, saying "we've found 50 hallucinations in scientific papers" generates a lot more clicks than "we've found 50 common citation mistakes that people make all the time"
Not exactly true, they have hacked the end of the copyright for SteamBoat Willie, by adding a few second extract of it, as part of the actual "Walt Disney Animation Studios" actual logo.
They cannot sue anymore for copyright infringements, but they may do it the registered trademark way, by saying "It's in our logo !".
Don't confuse the license with project ownership. Flyway is owned by Red Gate Software and the community edition of Flyway is licensed under Apache 2.0. Apache Atlas is owned by the Apache Software Foundation AND licensed under Apache 2.0.
Ah, my fault. But that does not change the point I try to make: project ownership is equally important, if you cannot just fork and maintain some open source software yourself. It's something to include in risk calculations.
It look like easy to disable a rule : `SELECT pglinter.disable_rule('B006');`.
That said, i agree with you than some of the default rules may be bad. For example : B001 & T001 recommend primary keys, but it will effectively kill a TimescaleDB hypertable (primary keys are not recommended).
Don't you have energy cooperatives to avoid this in the Netherlands ? According to rescoop.eu i did find hetcooperatie.nl, energiesamen.nu & lochemenergie.org.
Even if you are instead in Newfoundland, maybe ask cecooperative.ca if there us a project to create one in your province.