Image-Specific Adaptation of Transformer Encoders for Compute-Efficient Segmentation
Vision transformer-based models bring significant improvements for image segmentation tasks. Although these architectures offer powerful capabilities irrespective of specific segmentation tasks, their use of computational resources can be taxing on deployed devices. One way to overcome this challenge

