byte pair encoding
短语词形变化
byte pair encodings
复数
别名
byte-pair encoding
释义与例句
n.
-
1.
A lossless data compression algorithm that iteratively replaces the most frequent pair of adjacent bytes in a sequence with a new byte not already present in the data.
可数 不可数 计算机 工程 数学 -
2.
A subword tokenization method that iteratively merges the most frequent pairs of adjacent characters in a corpus to form longer and more meaningful tokens, typically until a predefined vocabulary size is reached.
可数 不可数