Focal Attention for Long-Range Interactions in Vision Transformers
Focal Attention for Long-Range Interactions in Vision Transformers
Jianwei Yang,Chunyuan Li,4 作者,Jianfeng Gao
2021 · DBLP: conf/nips/YangLZDXYG21
Neural Information Processing Systems · 引用数 148
TLDR
A new variant of Vision Transformer models, called Focal Transformers, is built, which achieve superior performance over the state-of-the-art (SoTA) Vision Transformers on a range of public image classification and object detection benchmarks.
