attention-matrix

Here are 3 public repositories matching this topic...

hila-chefer / Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

deep-learning vit bert perturbation attention-visualization bert-model explainability attention-matrix vision-transformer transformer-interpretability visualize-classifications cvpr2021

Updated Jan 24, 2024
Jupyter Notebook

LeviBorodenko / garnn

Star

TensorFlow implementation of Graphical Attention Recurrent Neural Networks based on work by Cirstea et al., 2019.

tensorflow shape batch rnn attention attention-mechanism rnn-tensorflow graph-convolutional-networks temporal-data paper-implementations graph-signals graph-neural-networks diffusion-graph-convolution attention-matrix

Updated Jan 2, 2020
Python

Attention Saver lets you extract entire attention matrices or row-wise statistics (e.g. entropy) from any HuggingFace causal LLM layer for ultra-long context when using flash-attention without running out of GPU memory.

interpretability attention-map attention-matrix long-context llms

Updated Oct 6, 2025
Python

Improve this page

Add a description, image, and links to the attention-matrix topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-matrix topic, visit your repo's landing page and select "manage topics."

Learn more

CS Knowledge Base

Provide feedback

Saved searches

Use saved searches to filter your results more quickly