Files
mlx-examples/llms/mlx_lm
mark e5c98f4715 Update utils.py
Change to enable saving the kv-cache as a safetensors file after a text completion; after generate step has finished creating all the tokens, the key values cache is made into a dict and saved using mx.save_safetensors to a user-specified file location; similar to cache_prompt.
2024-09-26 16:58:02 +01:00
..
2024-08-29 21:08:57 -07:00
2024-08-29 21:08:57 -07:00
2024-09-06 20:19:27 -07:00
2024-09-03 13:29:10 -07:00
2024-09-20 13:33:45 -07:00
2024-09-07 14:46:57 -07:00
2024-01-23 08:44:37 -08:00
2024-08-23 13:18:51 -07:00
2024-08-15 15:45:02 -07:00
2024-08-01 16:18:18 -07:00
2024-01-12 10:25:56 -08:00
2024-09-26 16:58:02 +01:00

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.

This package also supports fine tuning with LoRA or QLoRA. For more information see the LoRA documentation.