Discussion about this post

User's avatar
Kashyap Chitta's avatar

Great to see more people realizing this! You may find this an interesting read, we came across many of the same issues: https://kashyap7x.github.io/assets/pdf/students/Fauth2025.pdf

Tambet Matiisen's avatar

How much is this problem specific to reinforcement learning? With imitation learning you are optimizing for human-likeness anyway, does this metric even make sense?

4 more comments...

No posts

Ready for more?