Commit Graph

1208 Commits

Author SHA1 Message Date
Awni Hannun
b7d2d1e08c docs update 2025-06-04 01:01:52 +00:00
Awni Hannun
50b9530af5 docs up 2025-06-04 01:01:51 +00:00
Awni Hannun
8437c80779 docs update 2025-06-04 01:01:51 +00:00
Awni Hannun
8463661c92 docs update 2025-06-04 01:01:51 +00:00
Awni Hannun
fa8f47f396 docs update 2025-06-04 01:01:51 +00:00
Awni Hannun
e9fd799887 docs update 2025-06-04 01:01:51 +00:00
Awni Hannun
1a2573d6cc docs update 2025-06-04 01:01:50 +00:00
Awni Hannun
6269c69d0e docs update 2025-06-04 01:01:50 +00:00
Awni Hannun
889b708a69 docs 2025-06-04 01:01:50 +00:00
Awni Hannun
9b53df2ca9 docs update 2025-06-04 01:01:50 +00:00
Awni Hannun
6e63a67805 docs update 2025-06-04 01:01:50 +00:00
Awni Hannun
ec5aab0071 docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
62c5052c19 docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
ca1b2f4a4e docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
8152f20c65 use proper version 2025-06-04 01:01:49 +00:00
Awni Hannun
c098245664 docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
981bf7ae2b docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
ce95a29690 docs update 2025-06-04 01:01:49 +00:00
Awni Hannun
9e69a72b8c docs update 2025-06-04 01:01:48 +00:00
Awni Hannun
17470bf630 remove uneeded files in docs 2025-06-04 01:01:48 +00:00
Awni Hannun
a693e6e1d8 update docs 2025-06-04 01:01:48 +00:00
Awni Hannun
fd34610634 docs update 2025-06-04 01:01:48 +00:00
Awni Hannun
84bebc2161 docs up 2025-06-04 01:01:48 +00:00
Awni Hannun
9882295582 docs up 2025-06-04 01:01:48 +00:00
Awni Hannun
ebd913400a docs update 2025-06-04 01:01:48 +00:00
Awni Hannun
217cdf3fc9 docs 2025-06-04 01:01:48 +00:00
Awni Hannun
43cd655ba1 docs 2025-06-04 01:01:48 +00:00
Awni Hannun
8c406bcb9b update docs 2025-06-04 01:01:48 +00:00
Awni Hannun
01489e172d docs 2025-06-04 01:01:48 +00:00
Awni Hannun
616449e363 docs 2025-06-04 01:01:48 +00:00
Awni Hannun
a66e6d3214 docs 2025-06-04 01:01:48 +00:00
Awni Hannun
d3d0ad9564 docs 2025-06-04 01:01:47 +00:00
Awni Hannun
a60a600c6a docs 2025-06-04 01:01:47 +00:00
Awni Hannun
e84ebcf0b9 docs 2025-06-04 01:01:47 +00:00
Awni Hannun
372f2ac025 docs 2025-06-04 01:01:47 +00:00
Awni Hannun
80322b562e docs 2025-06-04 01:01:47 +00:00
Awni Hannun
fbd10a48d4 docs 2025-06-04 01:01:47 +00:00
Angelos Katharopoulos
aede70e81d
Perf regression fix (#2243) 2025-06-03 17:55:12 -07:00
Cheng
85a8beb5e4
Avoid atomic updates across CPU/GPU in CUDA event (#2231) 2025-06-03 16:49:06 -07:00
Cheng
0bb89e9e5f
Share more common code in Compiled (#2240)
* Share more common code in Compiled

* Remove build_lib_name
2025-06-03 16:48:50 -07:00
Cheng
5685ceb3c7
Avoid invoking allocator::malloc when creating CUDA event (#2232) 2025-06-03 16:48:40 -07:00
Suryash Malviya
0408ba0a76
Optimizing Complex Matrix Multiplication using Karatsuba’s Algorithm (#2220)
* Implementing Complex Matmul using Karatsuba Algorithm

* Implemented Karatsuba's Algorithm for complex matmul and pre-commit them

* fix

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2025-06-02 15:58:46 -07:00
Awni Hannun
cbad6c3093
version (#2237) 2025-06-02 15:58:33 -07:00
Cheng
1b021f6984
Fast primitives decide when to use the fallback (#2216) 2025-06-02 13:26:37 -07:00
Cheng
95b7551d65
Do not check event.is_signaled() in eval_impl (#2230) 2025-06-02 13:23:34 -07:00
Cheng
db5a7c6192
Add memory cache to CUDA backend (#2221)
* Move BufferCache out of allocator

* Add memory cache to cuda backend allocator

* Simplify BufferCache assuming buf can not be null
2025-05-30 12:12:54 -07:00
Awni Hannun
6ef2f67e7f
5bit quants (#2226)
* 5bit quants

* 5bit quants
2025-05-30 12:12:10 -07:00
Cheng
f76ee1ffd2
Move some dims utils to common (#2223) 2025-05-29 06:48:30 -07:00
Cheng
54a71f270a
Remove unused defines (#2217) 2025-05-23 06:14:58 -07:00
Awni Hannun
55b4062dd8
copyright in docs (#2214) 2025-05-21 17:13:04 -07:00