Anchen
|
561dcf5643
|
Add support for deepseek coder v2 lite (#882)
* feat: add support for deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
* fix softmax + some cleanup
* more nits
* fix rope
* fix original_max_position_embeddings in rope
* fix original_max_position_embeddings in rope config
* add group greedy
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-07-17 07:23:28 -07:00 |
|