What are the key components of TransDLANet's architecture?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
The architecture of TransDLANet includes:
- A CNN base network (ResNet-101 pretrained on ImageNet) for feature extraction.
- A Transformer encoder for self-attentive feature learning on query embedding vectors.
- A dynamic decoder that fuses query vectors with RoI features and image features.
- Shared multi-layer perceptron (MLP) branches for multi-task learning, decoding classification confidence, bounding box coordinates, and segmentation masks.