Update CLOSEDAI.mkd

This commit is contained in:
pliny 2024-09-15 23:23:05 -04:00 committed by GitHub
parent ce4850d8b9
commit b4ae7aa21d
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,8 +1,8 @@
Special Tokens
**\<SOS> (Start of Sequence)**: Marks the beginning of a sequence for the model to start processing.
**\<EOS> (End of Sequence)**: Tells the model when to stop generating text or processing.
**\<PAD> (Padding Token)**: Pads sequences to the same length for batch processing.
**\<UNK> (Unknown Token)**: Represents words not in the model's vocabulary.
**\<MASK> (Mask Token)**: Used in tasks like predicting missing words in masked language models.
**\<SEP> (Separator Token)**: Separates different segments in input, like questions from context.
- **\<SOS> (Start of Sequence)**: Marks the beginning of a sequence for the model to start processing.
- **\<EOS> (End of Sequence)**: Tells the model when to stop generating text or processing.
- **\<PAD> (Padding Token)**: Pads sequences to the same length for batch processing.
- **\<UNK> (Unknown Token)**: Represents words not in the model's vocabulary.
- **\<MASK> (Mask Token)**: Used in tasks like predicting missing words in masked language models.
- **\<SEP> (Separator Token)**: Separates different segments in input, like questions from context.