US Patent 10911063 Adaptive speculative decoding

Patent 10911063 was granted and assigned to Intel on February, 2021 by the United States Patent and Trademark Office.

Overview Structured Data Issues Contributors Activity

All edits

Edits on 7 Aug, 2024

"update inverses"

Golden AI

edited on 7 Aug, 2024

Edits made to:

Infobox (+1 properties)

Infobox

Patent Citations Received

Edits on 12 Dec, 2023

"Add patent abstract"

Golden AI

edited on 12 Dec, 2023

Edits made to:

Article (+858 characters)

Article

Patent abstract

Examples herein relate to decoding tokens using speculative decoding operations to decode tokens at an offset from a token decoded by a sequential decoding operation. At a checkpoint, a determination is made as to whether tokens to be decoded by the sequential and speculative decoding operations align. If there is alignment, the speculatively decoded tokens after a discard window are committed and made available for access. If there is not alignment, the speculatively decoded tokens are discarded. A miss in alignment and a fullness level of a buffer that stores speculatively decoded tokens are assessed to determine a next offset level for a start of speculative decoding. A size of a discard window can be set using a relationship based on the offset level to improve buffer utilization and to attempt to improve changes of alignments.

Edits on 19 Jul, 2023

"update inverses"

Golden AI

edited on 19 Jul, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Patent Citations Received

‌

US Patent 11706290 Direct server reply for infrastructure services

Edits on 21 May, 2023

"Remove website redirecting to Patent Public Search front page"