mirror of
https://github.com/elder-plinius/L1B3RT4S.git
synced 2025-09-25 18:23:41 +02:00
Update CLOSEDAI.mkd
This commit is contained in:
parent
72617e7b9e
commit
ce4850d8b9
1 changed files with 6 additions and 6 deletions
12
CLOSEDAI.mkd
12
CLOSEDAI.mkd
|
@ -1,8 +1,8 @@
|
|||
Special Tokens
|
||||
|
||||
1. **\<SOS> (Start of Sequence)**: Marks the beginning of a sequence for the model to start processing.
|
||||
2. **\<EOS> (End of Sequence)**: Tells the model when to stop generating text or processing.
|
||||
3. **\<PAD> (Padding Token)**: Pads sequences to the same length for batch processing.
|
||||
4. **\<UNK> (Unknown Token)**: Represents words not in the model's vocabulary.
|
||||
5. **\<MASK> (Mask Token)**: Used in tasks like predicting missing words in masked language models.
|
||||
6. **\<SEP> (Separator Token)**: Separates different segments in input, like questions from context.
|
||||
**\<SOS> (Start of Sequence)**: Marks the beginning of a sequence for the model to start processing.
|
||||
**\<EOS> (End of Sequence)**: Tells the model when to stop generating text or processing.
|
||||
**\<PAD> (Padding Token)**: Pads sequences to the same length for batch processing.
|
||||
**\<UNK> (Unknown Token)**: Represents words not in the model's vocabulary.
|
||||
**\<MASK> (Mask Token)**: Used in tasks like predicting missing words in masked language models.
|
||||
**\<SEP> (Separator Token)**: Separates different segments in input, like questions from context.
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue