Meta AI Researchers Propose Token Merging (ToMe) to Accelerate Vision Transformer Execution
Vision transformers (ViT) were introduced to the literature two years ago and have become a central part of computer vision research. Taking a component that performed exceptionally well in linguistic tasks and converting it into the realm of computer vision was a bold move, but it worked. Since then, advancements in the field of computer …