Skip to content

Logs29: Wrap up Mutual Information Beam

Higepon Taro Minowa edited this page Jun 28, 2018 · 1 revision

Conclusion

  • Mutual Information + Beam wouldn't work, because even with beam_width = 100, it doesn't show non-generic responses.
    • It does mean it's too optimal, there's no room for agents to explore the action space.
  • We should switch back to sample base approach :(

Next steps

  • Save model files
  • restore sample

Clone this wiki locally