Discussion about this post

User's avatar
Tambet Matiisen's avatar

How much is this problem specific to reinforcement learning? With imitation learning you are optimizing for human-likeness anyway, does this metric even make sense?

3 more comments...

No posts

Ready for more?