Squares as Tokens, Geometric Attention Bias (GAB), Dot-Product Attention, Source-Destination Policy Head, and Sparse Transcoders allow for clear identification of where the model is "looking." Is there any intention to create a tool that explains FEN chess positions in natural language (NLP) using Maia-3 (Maia 2600)?
Squares as Tokens, Geometric Attention Bias (GAB), Dot-Product Attention, Source-Destination Policy Head, and Sparse Transcoders allow for clear identification of where the model is "looking." Is there any intention to create a tool that explains FEN chess positions in natural language (NLP) using Maia-3 (Maia 2600)?