dynamic-masking unified training

AI / MLtransform

Text + AttentionMaskType -> ContextualizedRepresentations

Configure the self-attention mask dynamically to support bidirectional encoding, unidirectional decoding, or sequence-to-sequence tasks within a single model architecture.

Problem it solves

Maintaining separate model architectures for understanding and generation tasks increases infrastructure overhead.

Consumes

TextAttentionMaskType

Emits

ContextualizedRepresentations

Distilled from 1 source

The real projects this mechanism was found in. Attribution is the point — this is how the best teams actually do it.

microsoft/unilmgithub