News
In this study, we propose a novel lightweight model specifically for reliable traffic perception in low-light conditions, utilizing an encoder-decoder architecture ... a feature fusion module based on ...
Unlike the commonly used UNet-based diffusion models, Diffusion Transformers apply the transformer architecture ... the model into a dedicated condition encoder for semantic extraction and a velocity ...
IBM says TerraMind is a multimodal model based on a novel “symmetric transformer-based encoder-decoder architecture.” It can handle pixel-based, token-based and sequence-based inputs and learn ...
Our method employs a hierarchical encoder-decoder network constructed with Transformer Blocks and introduces three key innovations: (1) We propose a Transformer Block based on cross-attention ...
A real-time violence detection system leveraging YOLO models for object detection and scene analysis. Extracted features are processed using an LSTM-based classifier to differentiate violent and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results