Hello Professor,
Recently, I've been studying your paper and reproducing your code, and I have some question as follows:
- After training the stage1 and stage2, I got Figure 4 in your paper, but the other figures such as Figure 5? And how to get the result such as success rate and extra time and so on in Table II by your code or by doing some calculations?
- How to get the code of the baseline such as SL-policy and NH-ORCA?
- How long did you train stage1 and stage2? What results in Terminal or in GUI can be shown to prove that the policy has been trained well?
I hope you can give me some advice, thank you very much!
Hello Professor,
Recently, I've been studying your paper and reproducing your code, and I have some question as follows:
I hope you can give me some advice, thank you very much!