mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-07-15 06:41:13 +08:00
![]() * Add Starcoder2 model and update utils.py * Refactor model arguments and modules in starcoder2.py * Refactor FeedForward class to MLP in starcoder2.py * Fix typo * pre-commit * Refactor starcoder2.py: Update model arguments and modules * Fix LM head and MLP layers * Rename input layer norm * Update bias in linear layers * Refactor token embeddings in Starcoder2Model * Rename to standard HF attention layer name * Add LayerNorm * Add transposed token embeddings (like in Gemma) * Refactor MLP and TransformerBlock classes * Add tie_word_embeddings option to ModelArgs and update Model implementation * Add conditional check for tying word embeddings in Starcoder2Model * Fix bias in lm_head linear layer * Remove unused LayerNorm in stablelm * Update transformers dependency to use GitHub repository * fix lm head bug, revert transformer req * Update RoPE initialization in Attention class --------- Co-authored-by: Awni Hannun <awni@apple.com> |
||
---|---|---|
.. | ||
__init__.py | ||
base.py | ||
gemma.py | ||
layers.py | ||
llama.py | ||
mixtral.py | ||
olmo.py | ||
phi.py | ||
phixtral.py | ||
plamo.py | ||
qwen2.py | ||
qwen.py | ||
stablelm.py | ||
starcoder2.py |