Reward hacking prevention methods

  1. Perfect AI Alignment vs Imperfect AI Alignment (Prompt Engineering Secrets)