Anchen
561dcf5643
Add support for deepseek coder v2 lite (#882)
* feat: add support for deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
* fix softmax + some cleanup
* more nits
* fix rope
* fix original_max_position_embeddings in rope
* fix original_max_position_embeddings in rope config
* add group greedy
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2024-07-17 07:23:28 -07:00
..
2024-01-12 10:25:56 -08:00
2024-07-17 07:23:28 -07:00
2024-05-08 08:18:13 -07:00
2024-05-08 08:18:13 -07:00
2024-07-17 07:23:28 -07:00
2024-07-02 07:52:39 -07:00
2024-05-08 08:18:13 -07:00
2024-06-02 16:33:20 -07:00
2024-05-21 20:16:31 -07:00
2024-07-11 06:13:17 -07:00
2024-05-27 06:22:21 -07:00
2024-06-10 15:18:34 -07:00
2024-05-08 08:18:13 -07:00
2024-05-31 12:36:05 -07:00
2024-05-08 08:18:13 -07:00
2024-05-08 08:18:13 -07:00
2024-07-12 07:19:11 -07:00
2024-06-12 06:53:55 -07:00
2024-05-08 08:18:13 -07:00
2024-05-31 12:36:05 -07:00
2024-05-08 08:18:13 -07:00
2024-06-14 09:44:50 -07:00
2024-06-14 09:44:50 -07:00
2024-05-08 08:18:13 -07:00
2024-07-08 12:34:31 -07:00
2024-05-08 08:18:13 -07:00
2024-06-14 09:44:50 -07:00
2024-06-11 06:20:04 -07:00
2024-05-23 19:47:35 -07:00