byte pair encoding

短语

词形变化

byte pair encodings 复数

别名

byte-pair encoding

释义与例句

n.
  1. 1.

    A lossless data compression algorithm that iteratively replaces the most frequent pair of adjacent bytes in a sequence with a new byte not already present in the data.

    可数 不可数 计算机 工程 数学
  2. 2.

    A subword tokenization method that iteratively merges the most frequent pairs of adjacent characters in a corpus to form longer and more meaningful tokens, typically until a predefined vocabulary size is reached.

    可数 不可数