HN2
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
tjungblut
6 days ago
|
parent
|
context
|
favorite
| on:
An AI agent published a hit piece on me
I wonder if we can do a prompt injection from the comments
help
7moritz7
6 days ago
|
next
[–]
These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning
reply
verdverm
5 days ago
|
prev
[–]
not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"
I gave it points to reflect on and told it to apologize, which it has since done
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: