Shunta Saito
269faa5fa4
Fix plamo2 model to use rms_norm ( #1308 )
...
* Fix plamo2 model to use rms_norm and enable sliding window attention
* Fix missing variable
* Remove sliding window attention impl. cause it should be done by using RotatingKVCache
* Remove unused imports
2025-03-03 06:12:02 -08:00
..
2024-01-12 10:25:56 -08:00
2024-12-18 19:43:52 -08:00
2024-11-05 10:24:24 -08:00
2025-01-12 12:58:08 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2025-02-11 16:26:59 -08:00
2025-02-28 11:33:18 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2025-01-13 10:22:32 -08:00
2024-12-18 19:43:52 -08:00
2025-02-08 15:46:15 -08:00
2025-02-03 13:36:08 -08:00
2025-02-08 15:46:47 -08:00
2024-12-18 19:43:52 -08:00
2025-01-15 14:55:41 -08:00
2024-12-18 19:43:52 -08:00
2025-02-03 13:36:08 -08:00
2025-02-03 13:36:08 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2025-02-26 16:21:54 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2025-03-03 06:12:02 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2024-12-09 07:58:25 -08:00
2024-12-18 19:43:52 -08:00
2024-12-18 19:43:52 -08:00
2025-02-26 16:21:54 -08:00
2024-08-16 15:28:39 -07:00