mlx-examples

zhangyiss/mlx-examples

Fork 0

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-25 09:51:19 +08:00

Commit Graph

Author	SHA1	Message	Date
Cavit Erginsoy	7ee76a32a4	Add memory estimation tool for MLX language models This commit introduces a comprehensive memory estimation utility for MLX language models, supporting: - Dynamic parameter calculation across diverse model architectures - Handling of quantized and standard models - Estimation of model weights, KV cache, and overhead memory - Support for bounded and unbounded KV cache modes - Flexible configuration via command-line arguments The new tool provides detailed memory usage insights for different model configurations and generation scenarios.	2025-03-10 03:03:01 +00:00

Author

SHA1

Message

Date

Cavit Erginsoy

7ee76a32a4

Add memory estimation tool for MLX language models

This commit introduces a comprehensive memory estimation utility for MLX language models, supporting:
- Dynamic parameter calculation across diverse model architectures
- Handling of quantized and standard models
- Estimation of model weights, KV cache, and overhead memory
- Support for bounded and unbounded KV cache modes
- Flexible configuration via command-line arguments

The new tool provides detailed memory usage insights for different model configurations and generation scenarios.

2025-03-10 03:03:01 +00:00

1 Commits