ARTICLE
|
doi:10.20944/preprints202310.1487.v2
Subject:
Computer Science And Mathematics,
Artificial Intelligence And Machine Learning
Keywords:
post-training pruning; combinatorial optimization; large language models; inference acceleration
Online: 27 May 2024 (08:34:18 CEST)