项目作者: irugina

项目描述 :
attention pruning
高级语言: Python
项目地址: git://github.com/irugina/AP.git
创建时间: 2020-10-02T05:56:16Z
项目社区:https://github.com/irugina/AP

开源协议:

下载


Attention Pruning

This repo hosts code for Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks and borrows starter code from fairseq, huggingface, and transformer-xl.

Results

Performance

See paper for details, comparison with entmax, and ood results.

Computational Efficiency

Results on the SQuAD question answering task:

Percentage Exact/F1 scores Time(s) GPU Memory(GB)
0 81.02/88.63 95.41 6.85
90 79.62/87.32 86.44 5.00