Discrete Attn Transformer
Implementation of Discrete Attn Transformer. Includes from scratch training of transformer for Templatic Generation Tasks (dataset available at: https://huggingface.co/datasets/rfernand/templatic_generation_tasks), along with code to evaluate LLMs on same dataset. See our paper: “Mechanisms of Symbol Processing…