Coding agents take turns to carry out tasks. Using Claude code as the harness, the repo contains code to
- collect per-turn token distributions
- collect per-turn inference metrics for open source/weight models
- collect per-turn tool call distributions
To collect above metrics, please check the README in the folder /isl_osl here.
Install prerequisites and setup python virtual env.
./install.sh