mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-07-02 23:01:15 +08:00
![]() * Predict stop sequence matches during streaming Check for overlap of stop sequences and the tokens array for potential sequence matches after more tokens get generated. Generate tokens until we can confirm that the stop sequence is not met. * fix typo * Change sequence_overlap logic * range isn't inclusive, add 1 to max_overlap * Add test_server.py Added a test for the sequence_overlap method * nits * eos sequence * finalize --------- Co-authored-by: Y4hL <43219534+Y4hL@users.noreply.github.com> Co-authored-by: Awni Hannun <awni@apple.com> |
||
---|---|---|
.. | ||
test_datsets.py | ||
test_gguf.py | ||
test_lora.py | ||
test_models.py | ||
test_sample_utils.py | ||
test_server.py | ||
test_tuner_utils.py | ||
test_utils_load_model.py | ||
test_utils.py |