Hacker News .hnnew | past | comments | ask | show | jobs | submitlogin

I assume this has been tried but what happens if you give MuZero a goal like "keep the system/process that spawns me running as long as possible?"


Why do you assume this has been tried? It's not even clear what the game is. In this setting, what state and actions would the algorithm have access to?


In some games it could find an equilibrium where it could keep the game going on indefinitely by moving back and forth, for example (which won't work in a game like Go[1], though).

1: https://en.wikipedia.org/wiki/Rules_of_Go#Ko_and_Superko




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: