What are the key components of TransDLANet's architecture?

Question

Answers ( 1 )

    0
    2025-03-28T01:56:36+00:00

    The architecture of TransDLANet includes:
    - A CNN base network (ResNet-101 pretrained on ImageNet) for feature extraction.
    - A Transformer encoder for self-attentive feature learning on query embedding vectors.
    - A dynamic decoder that fuses query vectors with RoI features and image features.
    - Shared multi-layer perceptron (MLP) branches for multi-task learning, decoding classification confidence, bounding box coordinates, and segmentation masks.

Leave an answer