L1B3RT4S/CLOSEDAI.mkd at ce4850d8b9892748a2a540df8f1f6d1cd4f438f0

fabian/L1B3RT4S

Fork 0

mirror of https://github.com/elder-plinius/L1B3RT4S.git synced 2025-09-26 02:33:39 +02:00

pliny ce4850d8b9

Update CLOSEDAI.mkd

2024-09-15 23:04:30 -04:00

569 B

Raw Blame History

Special Tokens

<SOS> (Start of Sequence): Marks the beginning of a sequence for the model to start processing. <EOS> (End of Sequence): Tells the model when to stop generating text or processing. <PAD> (Padding Token): Pads sequences to the same length for batch processing. <UNK> (Unknown Token): Represents words not in the model's vocabulary. <MASK> (Mask Token): Used in tasks like predicting missing words in masked language models. <SEP> (Separator Token): Separates different segments in input, like questions from context.

569 B Raw Blame History

569 B

Raw Blame History