UPDF AI

Focal Attention for Long-Range Interactions in Vision Transformers

Jianwei Yang,Chunyuan Li,4 저자,Jianfeng Gao

2021 · DBLP: conf/nips/YangLZDXYG21
Neural Information Processing Systems · 148회 인용

TLDR

A new variant of Vision Transformer models, called Focal Transformers, is built, which achieve superior performance over the state-of-the-art (SoTA) Vision Transformers on a range of public image classification and object detection benchmarks.