Networking Videotutorial

TM2SP: A Transformer-Based Multi-Level Spatiotemporal Feature Pyramid Network for Video Saliency Prediction

Abstract: This paper proposes an end-to-end video saliency prediction network model, termed TM2SP-Net (Transformer-based Multi-level Spatiotemporal Feature Pyramid Network). Leveraging the strong ...

IEEE

Bidirectional Error-Aware Fusion Network for Video Inpainting

Abstract: Existing video inpainting approaches tend to adopt vision transformers with rare customized designs, which poses two limitations. Firstly, the conventional self-attention mechanism treats ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

TM2SP: A Transformer-Based Multi-Level Spatiotemporal Feature Pyramid Network for Video Saliency Prediction

Bidirectional Error-Aware Fusion Network for Video Inpainting

Trending now