Hi urchade,
Thanks a lot for this great work.
I'm trying to reproduce your results on CONLL04. But I found that the code reported an error:
Arguments: ("RuntimeError: The size of the 2D attn_mask is not correct.")
Through debugging, The error is reported in the "loss = model(batch)"l in the train.py file.In the source code, this statement is placed in a try-catch, which will cause the model to not be trained, how can I fix this?

Looking forward to your reply, thank you.
Sincerely
Hi urchade,
Thanks a lot for this great work.

I'm trying to reproduce your results on CONLL04. But I found that the code reported an error:
Arguments: ("RuntimeError: The size of the 2D attn_mask is not correct.")
Through debugging, The error is reported in the "loss = model(batch)"l in the train.py file.In the source code, this statement is placed in a try-catch, which will cause the model to not be trained, how can I fix this?
Looking forward to your reply, thank you.
Sincerely