fix adaptive p sampler rewinding too far back (#1359)

* fix adaptive p sampler rewinding too far back * update comments * correct default value for total_weight, more comments * new variables/names * update comment for n_rewind * move null pointer check back to common_sampler_review() * refactor weighted_sum and total_weight to vector<pair>, better boundary check in llama_review_adaptive_p_impl()
2026-03-06 20:10:08 +00:00 · 2026-03-04 07:26:25 -05:00
parent f27678d39b
commit a903409a5e
7 changed files with 75 additions and 43 deletions
--- a/include/llama.h
+++ b/include/llama.h
@@ -1415,7 +1415,7 @@ LLAMA_API struct llama_grammar* llama_sampler_init_grammar_lazy_patterns(
                               llama_token_data_array * candidates,
                      struct llama_sampler_adaptive_p * adapt_p_ctx);

-    void llama_review_adaptive_p(struct llama_sampler_adaptive_p * adapt_p_ctx, const bool record, const bool rewind);
+    void llama_review_adaptive_p(struct llama_sampler_adaptive_p * adapt_p_ctx, const int32_t n_rewind);


    /// @details Mirostat 1.0 algorithm described in the paper https://arxiv.org/abs/2007.14966. Uses tokens instead of words.