Commit Graph - mlx - Gitea for Geophysics

zhangyiss/mlx

Fork 0

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Commit Graph

Select branches

Hide Pull Requests

cpp20

cuda-sdpa-vector

fft

gguf_q4_k

gh-pages

ibv-backend

ibv-backend-test

interrupt_eval

jagrit06/cuda-gemm-experiment

jit-nax

main

q-sdpa

qmm

ring-init

sdpa-test

sdpav-backup

sign-warns

simple-gemm

split_logsumexp

steel-refactor

#1

#1000

#1002

#1003

#1006

#1007

#1010

#1011

#1014

#1016

#1018

#1019

#102

#1020

#1028

#1030

#1032

#1034

#1035

#1036

#1037

#1038

#1039

#104

#1043

#1053

#1054

#1058

#1059

#1060

#1061

#1064

#1067

#1070

#1074

#1077

#1079

#108

#1081

#1085

#1087

#109

#1091

#1092

#1093

#1097

#1098

#1099

#11

#110

#1100

#1101

#1102

#1104

#1105

#1109

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1116

#1117

#1118

#1119

#1120

#1122

#1123

#1124

#1125

#1126

#1129

#1131

#1132

#1135

#1136

#1137

#1138

#1139

#1140

#1141

#1142

#1147

#1149

#115

#1150

#1151

#1152

#1154

#1157

#116

#1161

#1165

#1167

#1168

#1169

#117

#1172

#1174

#1175

#1176

#1177

#1178

#1179

#118

#1180

#1183

#1184

#1185

#1188

#1189

#119

#1190

#1191

#1194

#1195

#1199

#120

#1200

#1202

#1203

#1204

#1205

#1206

#1208

#1209

#121

#1211

#1212

#1215

#1216

#122

#1221

#1222

#1224

#1227

#1228

#123

#1235

#1236

#1237

#1239

#1242

#1243

#1245

#1246

#1247

#1249

#125

#1252

#1253

#1256

#1260

#1262

#1262

#1263

#1264

#1266

#1268

#1269

#1270

#1273

#1274

#1275

#1278

#1279

#128

#1280

#1281

#1282

#1283

#1285

#1287

#1289

#1291

#1297

#1298

#1299

#1300

#1301

#1304

#1305

#1306

#1307

#1309

#131

#1310

#1314

#1315

#1316

#1318

#1319

#1320

#1323

#1325

#1326

#1327

#1329

#133

#1330

#1332

#1333

#1334

#1336

#1337

#1339

#1340

#1343

#1344

#1346

#1347

#1348

#1349

#1350

#1351

#1352

#1353

#1355

#1356

#1358

#1359

#136

#1360

#1361

#1362

#1365

#1366

#1367

#1368

#1369

#137

#1371

#1372

#1373

#1374

#1376

#1379

#138

#1381

#1383

#1384

#1385

#1387

#1389

#139

#1390

#1391

#1394

#1395

#1396

#1397

#1401

#1402

#1403

#1404

#1405

#1407

#1408

#1410

#1412

#1414

#1415

#1416

#1417

#1418

#1419

#142

#1420

#1421

#1425

#143

#1430

#1431

#1434

#1436

#1437

#144

#1440

#1442

#1444

#1445

#1446

#1447

#1449

#145

#1450

#1451

#1452

#1453

#1455

#1456

#1460

#1461

#1462

#1468

#1470

#1471

#1476

#1477

#1478

#1479

#1482

#1485

#1486

#1488

#149

#1490

#1491

#1492

#1493

#1495

#1496

#1497

#1498

#150

#1501

#1502

#1503

#1506

#1508

#1509

#1510

#1514

#1515

#1515

#1518

#1519

#1521

#1522

#1523

#1524

#1525

#1526

#1528

#1529

#1532

#1534

#1535

#1537

#1539

#1541

#1543

#1545

#1546

#1548

#1550

#1551

#1553

#1555

#1556

#1557

#1558

#156

#1561

#1562

#1563

#1564

#1565

#1566

#1568

#1569

#157

#1570

#1572

#1574

#1575

#1577

#1578

#1579

#158

#1584

#1587

#1589

#159

#1590

#1591

#1594

#1595

#1596

#1597

#1600

#1601

#1603

#1606

#1607

#1609

#161

#1610

#1612

#1613

#1615

#1616

#1617

#1620

#1625

#1626

#1627

#1628

#1629

#1630

#1632

#1634

#1635

#1637

#1638

#1639

#1640

#1642

#1644

#1645

#1646

#1650

#1651

#1652

#1653

#1654

#1655

#1656

#1657

#1658

#1659

#166

#1660

#1661

#1662

#1663

#1664

#1665

#1666

#1667

#1668

#167

#1671

#1672

#1673

#1674

#1675

#1677

#1678

#1679

#1680

#1681

#1682

#1683

#1684

#1685

#1687

#1688

#1689

#1690

#1691

#1692

#1693

#1694

#1695

#1696

#1697

#1698

#1699

#170

#1700

#1701

#1702

#1704

#1705

#1706

#1708

#1709

#1710

#1714

#1715

#1716

#1718

#1719

#1721

#1722

#1723

#1724

#1726

#1727

#1728

#1731

#1732

#1733

#1735

#1736

#1737

#1738

#1740

#1741

#1742

#1743

#1745

#1746

#1747

#1748

#1749

#1750

#1752

#1753

#1754

#1755

#1756

#1757

#1758

#1759

#1760

#1761

#1762

#1763

#1764

#1765

#1768

#1772

#1773

#1774

#1775

#1776

#1777

#1782

#1783

#1784

#1788

#1789

#1789

#1793

#1795

#1797

#1798

#1799

#1801

#1802

#1803

#1805

#1806

#181

#1810

#1811

#1812

#1813

#1814

#1816

#1817

#1819

#1820

#1822

#1825

#1827

#1829

#183

#1830

#1831

#1833

#1834

#1835

#1836

#1837

#1838

#184

#1840

#1843

#1844

#1845

#1848

#185

#1852

#1854

#1856

#1857

#1858

#1859

#186

#1860

#1861

#1862

#1863

#1864

#1865

#1866

#1867

#1869

#187

#1870

#1874

#1875

#1876

#1879

#1882

#1883

#1884

#1885

#1887

#1889

#189

#1890

#1892

#1894

#1896

#1897

#1898

#1899

#190

#1900

#1901

#1902

#1904

#1906

#1911

#1913

#1914

#1915

#1916

#1917

#1920

#1921

#1922

#1923

#1924

#1925

#1926

#1928

#1929

#1931

#1932

#1935

#1936

#1937

#1938

#1939

#1940

#1943

#1944

#1948

#1949

#195

#1950

#1952

#1953

#1955

#1957

#196

#1961

#1962

#1966

#1968

#1969

#1970

#1970

#1972

#1973

#1974

#1975

#1976

#1978

#198

#1980

#1981

#1982

#1983

#1985

#1986

#1987

#1988

#1989

#199

#1990

#1991

#1992

#1995

#1996

#1997

#1998

#1999

#2

#2000

#2004

#2005

#2006

#2007

#2008

#2009

#2011

#2012

#2013

#2014

#2016

#2017

#2018

#202

#2020

#2021

#2024

#2025

#2026

#2027

#2028

#2029

#203

#2031

#2032

#2033

#2035

#2036

#2040

#2041

#2042

#2043

#2044

#2045

#2046

#2047

#2048

#2049

#205

#2051

#2052

#2053

#2054

#2055

#2058

#2059

#2060

#2061

#2062

#2065

#2066

#2068

#2069

#207

#2070

#2071

#2072

#2073

#2074

#2074

#2075

#2078

#2079

#2080

#2081

#2082

#2087

#209

#2090

#2091

#2092

#2094

#2095

#210

#2100

#2101

#2102

#2104

#2104

#2109

#211

#2110

#2114

#2117

#2119

#2121

#2123

#2128

#2129

#2131

#2135

#2136

#2138

#2139

#2141

#2142

#2143

#2145

#2147

#2148

#215

#2150

#2151

#2152

#2153

#2156

#2156

#2157

#2158

#2160

#2161

#2162

#217

#2172

#2173

#2176

#2177

#2178

#2179

#2181

#2182

#2183

#2187

#2188

#2189

#2191

#2192

#2193

#2195

#22

#2201

#2202

#2204

#2206

#2207

#2209

#221

#2210

#2213

#2214

#2215

#2216

#2217

#2219

#222

#2220

#2221

#2223

#2225

#2226

#2230

#2231

#2232

#2234

#2237

#2239

#224

#2240

#2241

#2242

#2243

#2244

#2246

#2247

#2248

#225

#2250

#2255

#2256

#2258

#2259

#226

#2260

#2261

#2262

#2263

#2264

#2265

#2268

#2269

#227

#2270

#2271

#2272

#2274

#2275

#2276

#2277

#228

#2280

#2282

#2283

#2284

#2286

#2287

#2288

#2289

#2290

#2291

#2293

#2294

#2295

#2296

#2297

#2297

#2298

#2299

#230

#2300

#2300

#2302

#2303

#2304

#2306

#2307

#2308

#2311

#2314

#2316

#2317

#2318

#232

#2320

#2321

#2322

#2323

#2324

#2325

#2326

#2327

#2328

#2329

#233

#2330

#2331

#2335

#2336

#2339

#2340

#2341

#2342

#2343

#2345

#2346

#2347

#235

#2350

#2351

#2352

#2354

#2355

#2356

#2357

#2360

#2361

#2362

#2363

#2364

#2365

#2367

#2368

#2370

#2371

#2372

#2375

#2377

#2378

#2379

#2380

#2382

#2383

#2385

#2386

#2387

#2388

#2389

#2392

#2396

#2397

#2398

#2399

#240

#2400

#2401

#2404

#2406

#2407

#2408

#2409

#2411

#2412

#2413

#2414

#2415

#2416

#2417

#2419

#2420

#2421

#2423

#2424

#2425

#2426

#2427

#2429

#2430

#2431

#2432

#2433

#2434

#2435

#2437

#2438

#2439

#244

#2440

#2441

#2442

#2443

#2444

#2445

#2446

#2447

#2448

#2449

#245

#2450

#2453

#2454

#2455

#2460

#2461

#2462

#2463

#2464

#2465

#2466

#2468

#247

#2470

#2471

#2472

#2473

#2474

#2476

#2477

#2480

#2482

#2483

#2484

#2485

#2486

#2487

#2488

#2489

#249

#2491

#2493

#2494

#2495

#2496

#2499

#250

#2502

#2505

#2506

#2508

#2510

#2511

#2513

#2514

#2515

#2517

#2518

#252

#2520

#2521

#2523

#2524

#2525

#2526

#2527

#2528

#2530

#2531

#2532

#2533

#2534

#2535

#2539

#254

#2541

#2542

#2543

#2544

#2545

#2546

#2548

#2549

#255

#2550

#2551

#2552

#2553

#2554

#2555

#2557

#2558

#2559

#256

#2560

#2562

#2563

#2564

#2565

#2567

#2568

#2569

#2570

#2571

#2572

#2573

#2574

#2576

#2577

#2578

#2580

#2581

#2582

#2584

#2586

#2587

#2588

#2591

#2592

#2594

#2595

#2598

#26

#260

#2600

#2601

#2602

#2603

#2603

#2604

#2606

#2608

#2609

#261

#2611

#2612

#2613

#2614

#2618

#2619

#2619

#2620

#2621

#2622

#2627

#263

#2630

#2631

#2633

#2634

#2636

#2638

#2641

#2642

#2644

#2645

#2646

#2648

#2649

#2650

#2652

#2653

#2654

#2656

#2657

#2658

#2659

#2661

#2662

#2663

#2666

#2667

#2669

#2671

#2672

#2673

#2678

#2679

#268

#2680

#2682

#2684

#2686

#2687

#2688

#2689

#2690

#2692

#2694

#2697

#2699

#2700

#2701

#2702

#2704

#2705

#2706

#2706

#2707

#2709

#2713

#2715

#2716

#2717

#2718

#2719

#2720

#2721

#2722

#2723

#2723

#2725

#2726

#2727

#2730

#2731

#2733

#2734

#2736

#2737

#2739

#274

#2740

#2741

#2743

#2746

#275

#2750

#2751

#2752

#2753

#2754

#2756

#2757

#2758

#2759

#276

#2760

#2761

#2762

#2763

#2764

#2765

#2767

#2769

#277

#2771

#2772

#2773

#2774

#2775

#2776

#2777

#2778

#278

#2780

#2781

#2782

#2783

#2784

#2785

#2786

#2787

#2788

#2789

#2789

#2790

#2792

#2796

#2796

#2797

#2798

#2799

#280

#2800

#2802

#2804

#2805

#2808

#2808

#2809

#2809

#281

#2810

#2811

#2813

#2814

#2815

#2817

#2817

#2818

#2819

#2820

#2822

#2823

#2824

#2825

#2826

#2828

#2830

#2831

#2832

#2833

#2836

#2838

#284

#2843

#2845

#2847

#2850

#2852

#2853

#2853

#2854

#2857

#2859

#2860

#2860

#2861

#2862

#2862

#2863

#2866

#2868

#2869

#2870

#2871

#2872

#2873

#2874

#2875

#2877

#2882

#2883

#2885

#2885

#2886

#289

#2890

#2891

#2892

#2893

#2896

#2897

#2899

#29

#2901

#2902

#2902

#2903

#2904

#2904

#2905

#2906

#2906

#2909

#2910

#2910

#2911

#2912

#292

#295

#298

#299

#3

#302

#304

#305

#306

#307

#308

#309

#310

#311

#312

#313

#315

#316

#317

#318

#319

#32

#323

#324

#325

#327

#329

#330

#332

#334

#335

#336

#337

#339

#34

#340

#342

#344

#345

#347

#348

#349

#350

#352

#353

#354

#355

#356

#357

#358

#359

#36

#364

#366

#370

#371

#373

#374

#375

#379

#38

#380

#381

#382

#383

#384

#385

#386

#388

#389

#390

#391

#392

#394

#395

#397

#398

#4

#401

#405

#406

#409

#411

#412

#414

#415

#418

#419

#421

#423

#424

#425

#426

#427

#428

#430

#431

#432

#433

#435

#438

#441

#443

#444

#445

#446

#447

#448

#449

#453

#455

#456

#457

#458

#461

#462

#463

#47

#470

#472

#473

#475

#476

#477

#478

#479

#480

#481

#482

#484

#489

#490

#492

#494

#497

#498

#5

#50

#500

#501

#505

#507

#508

#510

#511

#512

#513

#514

#517

#519

#520

#521

#523

#524

#525

#526

#527

#53

#533

#536

#537

#539

#541

#543

#55

#552

#554

#558

#559

#56

#560

#561

#562

#563

#564

#565

#569

#571

#579

#58

#581

#584

#588

#59

#591

#592

#595

#599

#6

#601

#602

#603

#604

#607

#608

#612

#613

#614

#616

#619

#62

#620

#622

#623

#624

#625

#626

#627

#629

#630

#631

#634

#635

#637

#638

#639

#64

#641

#643

#645

#647

#648

#651

#653

#656

#658

#659

#660

#661

#662

#663

#664

#667

#670

#674

#675

#676

#677

#678

#679

#68

#681

#682

#683

#684

#685

#686

#687

#688

#689

#69

#691

#692

#694

#696

#697

#698

#699

#7

#70

#702

#703

#704

#705

#706

#707

#708

#709

#710

#711

#713

#715

#716

#717

#718

#72

#721

#723

#727

#729

#730

#735

#737

#739

#74

#740

#744

#745

#747

#749

#75

#752

#759

#760

#761

#763

#764

#766

#768

#771

#776

#777

#778

#78

#78

#782

#783

#784

#785

#786

#787

#788

#79

#790

#791

#792

#793

#794

#796

#797

#8

#80

#800

#801

#802

#803

#804

#805

#806

#807

#809

#81

#812

#813

#816

#818

#819

#820

#821

#822

#823

#824

#826

#828

#829

#831

#835

#836

#838

#839

#84

#841

#843

#844

#848

#849

#85

#850

#852

#853

#858

#859

#861

#862

#863

#864

#867

#869

#87

#870

#871

#872

#874

#875

#876

#877

#879

#88

#880

#881

#883

#886

#889

#89

#890

#891

#892

#893

#894

#895

#897

#899

#9

#90

#901

#902

#903

#904

#905

#906

#907

#91

#911

#915

#916

#917

#919

#92

#920

#921

#923

#924

#925

#926

#929

#932

#933

#934

#94

#940

#941

#942

#944

#947

#948

#949

#950

#951

#952

#953

#955

#956

#957

#958

#960

#961

#962

#964

#967

#969

#970

#971

#972

#973

#974

#975

#976

#977

#978

#979

#98

#980

#981

#982

#984

#986

#987

#988

#989

#99

#991

#992

#993

#994

#998

v0.0.10

v0.0.11

v0.0.2

v0.0.3

v0.0.4

v0.0.5

v0.0.6

v0.0.7

v0.0.9

v0.1.0

v0.10.0

v0.11.0

v0.11.1

v0.12.0

v0.12.1

v0.12.2

v0.13.0

v0.13.1

v0.14.0

v0.14.1

v0.15.0

v0.15.1

v0.15.2

v0.16.0

v0.16.1

v0.16.2

v0.16.3

v0.17.0

v0.17.1

v0.17.2

v0.17.3

v0.18.0

v0.18.1

v0.19.0

v0.19.1

v0.19.2

v0.19.3

v0.2.0

v0.20.0

v0.21.0

v0.21.1

v0.22.0

v0.22.1

v0.23.0

v0.23.1

v0.23.2

v0.24.0

v0.24.1

v0.24.2

v0.25.0

v0.25.1

v0.25.2

v0.26.0

v0.26.1

v0.26.2

v0.26.3

v0.26.5

v0.27.1

v0.28.0

v0.29.0

v0.29.1

v0.29.2

v0.29.3

v0.29.4

v0.3.0

v0.30.0

v0.4.0

v0.5.0

v0.5.1

v0.6.0

v0.7.0

v0.8.0

v0.8.1

v0.9.0

v0.9.1

fb4e8b896b patch bump (#2343) v0.26.3 Awni Hannun 2025-07-08 14:26:07 -07:00
2ca533b279 Fix compilation with CUDA 11 (#2331) Cheng 2025-07-08 12:00:43 +09:00
4a9b29a875 MoE backward improvements (#2335) Angelos Katharopoulos 2025-07-07 17:59:53 -07:00
a4fcc893cd auto build linux release (#2341) Awni Hannun 2025-07-07 09:29:23 -07:00
9d10239af7 [CUDA] Do vectorized store/load in binary ops (#2330) Cheng 2025-07-08 00:44:14 +09:00
19facd4b20 Build with all cpu cores by default (#2336) Cheng 2025-07-07 22:06:45 +09:00
f5299f72cd Fix layernorm race condition (#2340) Angelos Katharopoulos 2025-07-07 06:06:01 -07:00
0e0d9ac522 [CUDA] Add MLX_CUDA_GRAPH_CACHE_SIZE env for setting graph cache size (#2329) Cheng 2025-07-06 00:33:29 +09:00
8917022deb fix graphs for older cuda (#2328) Awni Hannun 2025-07-02 19:37:58 -07:00
ec0d5db67b [CUDA] Switch to CUDA graphs (#2317) Awni Hannun 2025-07-02 15:59:13 -07:00
e76e9b87f0 Fix compilation error from integral_constant (#2326) Cheng 2025-07-02 22:04:38 +09:00
cfb6a244ea allow parameters to be deleted (#2325) Awni Hannun 2025-07-01 21:27:23 -07:00
58f3860306 patch bump (#2324) v0.26.2 Awni Hannun 2025-07-01 12:12:16 -07:00
dd4f53db63 use fp32 for testing, add more complex ops (#2322) Awni Hannun 2025-07-01 07:30:00 -07:00
3d5e17e507 MLX_SWITCH macros to templates (#2320) Angelos Katharopoulos 2025-07-01 01:33:44 -07:00
33bf1a244b Fix module update in strict mode (#2321) Awni Hannun 2025-06-29 11:12:29 -07:00
772f471ff2 [CUDA] Fix reductions (#2314) Angelos Katharopoulos 2025-06-27 12:59:20 -07:00
2c11d10f8d Split broadcast so it is always fused in compile (#2318) Angelos Katharopoulos 2025-06-26 22:08:18 -07:00
656ed7f780 Fix get 2d grid dims (#2316) Angelos Katharopoulos 2025-06-25 13:03:09 -07:00
81bb9a2a9e Compile float64 functions on CPU (#2311) Awni Hannun 2025-06-24 10:18:52 -07:00
5adf185f86 Fix update_modules() when providing a subset (#2308) Angelos Katharopoulos 2025-06-20 17:19:46 -07:00
c9a9180584 Cuda perf tuning (#2307) Awni Hannun 2025-06-20 14:50:57 -07:00
76831ed83d Build CUDA release in Circle (#2306) Awni Hannun 2025-06-19 15:26:36 -07:00
b3d7b85376 Make ptx cache settable by environment variable (#2304) Angelos Katharopoulos 2025-06-17 23:55:56 -07:00
cad5c0241c [CUDA] synch properly waits for all tasks to finish and clear (#2303) Awni Hannun 2025-06-17 12:03:25 -07:00
b8022c578a divmod, partition, sort fixes (#2302) Awni Hannun 2025-06-16 18:49:32 -07:00
870208eff5 Start sdpa vector cuda-sdpa-vector Angelos Katharopoulos 2025-06-15 21:58:34 -07:00
bc53f8293f Cuda bug fixes 2 (#2298) Awni Hannun 2025-06-16 13:14:46 -07:00
c552ff2451 [CUDA] Fix back-end bugs and enable corresponding tests (#2296) Awni Hannun 2025-06-16 08:45:40 -07:00
4fda5fbdf9 add python testing for cuda with ability to skip list of tests (#2295) Awni Hannun 2025-06-15 10:56:48 -07:00
580776559b RoPE for CUDA (#2293) Angelos Katharopoulos 2025-06-15 06:08:07 -07:00
a14aaa7c9d Fix cuda arg reduce (#2291) Awni Hannun 2025-06-14 17:54:00 -07:00
a6d780154f fix cuda gemm for bf16 (#2288) Awni Hannun 2025-06-13 22:10:46 -07:00
6871e2eeb7 fix cuda jit (#2287) Awni Hannun 2025-06-13 19:21:46 -07:00
8402a2acf4 Fix complex power and print (#2286) Awni Hannun 2025-06-13 11:13:00 -07:00
fddb6933e1 Collection of refactors (#2274) Jagrit Digani 2025-06-13 10:44:56 -07:00
c8b4787e4e CUDA backend: indexing ops (#2277) Cheng 2025-06-13 13:44:19 +09:00
2188199ff8 [CUDA] ternary with select op (#2283) Awni Hannun 2025-06-12 20:24:43 -07:00
aa07429bad Fix cuda build (#2284) Awni Hannun 2025-06-12 17:48:05 -07:00
918761a25a [CUDA] RMSNorm and VJP (#2280) Awni Hannun 2025-06-12 17:09:49 -07:00
a4fc671d3e CUDA backend: compile (#2276) Cheng 2025-06-13 09:08:39 +09:00
f5f65ef48c Make sliceUpdate general (#2282) Awni Hannun 2025-06-12 16:48:54 -07:00
c2dd81a8aa Fix warnings from latest CUDA toolkit (#2275) Cheng 2025-06-12 22:03:01 +09:00
d7e680ffe4 CUDA backend: layernorm (#2271) Cheng 2025-06-12 07:48:32 +09:00
c371baf53a CUDA backend: softmax (#2272) Cheng 2025-06-12 05:55:22 +09:00
ccf78f566c CUDA backend: argreduce (#2270) Cheng 2025-06-12 05:26:17 +09:00
c9fa68664a CUDA backend: reduce (#2269) Cheng 2025-06-12 03:22:25 +09:00
c35f4d089a start cuda circle config (#2256) Awni Hannun 2025-06-10 21:19:47 -07:00
8590c0941e Add load_safe to the general conv loaders (#2258) Angelos Katharopoulos 2025-06-10 20:58:16 -07:00
095163b8d1 Fix building cpp benchmarks on Linux (#2268) Cheng 2025-06-11 09:10:24 +09:00
99c33d011d rebase + nit (#2260) Cheng 2025-06-11 02:51:51 +09:00
62fecf3e13 fix conv export (#2265) Awni Hannun 2025-06-10 09:34:01 -07:00
7c4eb5d03e CUDA backend: random (#2261) Cheng 2025-06-11 00:59:56 +09:00
bae9a6b404 CUDA backend: sort (#2262) Cheng 2025-06-11 00:59:47 +09:00
004c1d8ef2 Report number of missing parameters (#2264) Christopher Fleetwood 2025-06-10 14:37:50 +01:00
7ebb2e0193 CUDA backend: binary ops (#2259) Cheng 2025-06-10 22:37:40 +09:00
9ce77798b1 fix export to work with gather/scatter axis (#2263) Awni Hannun 2025-06-09 20:37:27 -07:00
f8bad60609 CUDA backend: unary ops (#2158) Cheng 2025-06-09 22:45:08 +09:00
5866b3857b Refactor the lu test (#2250) Emmanuel Ferdman 2025-06-07 16:12:08 +03:00
1ca616844b Fix unintuitive metal kernel caching (#2242) Awni Hannun 2025-06-06 20:08:15 -07:00
2e8cf0b450 Change layernorms to two pass algorithm (#2246) Angelos Katharopoulos 2025-06-06 13:34:56 -07:00
24f89173d1 CUDA backend: matmul (#2241) Cheng 2025-06-07 04:24:04 +09:00
c6a20b427a Improve metal elementwise kernels (#2247) Awni Hannun 2025-06-06 11:37:40 -07:00
a5ac9244c4 fix linux linking error (#2248) Awni Hannun 2025-06-06 10:41:51 -07:00
c763fe1be0 default strict mode for module update and update_modules (#2239) Awni Hannun 2025-06-05 15:27:02 -07:00
52dc8c8cd5 Add profiler annotations in common primitives for CUDA backend (#2244) Cheng 2025-06-05 11:55:12 +09:00
aede70e81d Perf regression fix (#2243) v0.26.1 Angelos Katharopoulos 2025-06-03 17:55:12 -07:00
85a8beb5e4 Avoid atomic updates across CPU/GPU in CUDA event (#2231) Cheng 2025-06-04 08:49:06 +09:00
0bb89e9e5f Share more common code in Compiled (#2240) Cheng 2025-06-04 08:48:50 +09:00
5685ceb3c7 Avoid invoking allocator::malloc when creating CUDA event (#2232) Cheng 2025-06-04 08:48:40 +09:00
0408ba0a76 Optimizing Complex Matrix Multiplication using Karatsuba’s Algorithm (#2220) v0.26.0 Suryash Malviya 2025-06-02 18:58:46 -04:00
cbad6c3093 version (#2237) Awni Hannun 2025-06-02 15:58:33 -07:00
1b021f6984 Fast primitives decide when to use the fallback (#2216) Cheng 2025-06-03 05:26:37 +09:00
95b7551d65 Do not check event.is_signaled() in eval_impl (#2230) Cheng 2025-06-03 05:23:34 +09:00
db5a7c6192 Add memory cache to CUDA backend (#2221) Cheng 2025-05-31 04:12:54 +09:00
6ef2f67e7f 5bit quants (#2226) Awni Hannun 2025-05-30 12:12:10 -07:00
f76ee1ffd2 Move some dims utils to common (#2223) Cheng 2025-05-29 22:48:30 +09:00
54a71f270a Remove unused defines (#2217) Cheng 2025-05-23 22:14:58 +09:00
55b4062dd8 copyright in docs (#2214) Awni Hannun 2025-05-21 17:13:04 -07:00
79071bfba4 Fix out-of-bounds default value in logsumexp/softmax (#2213) Cheng 2025-05-21 23:25:16 +09:00
7774b87cbd Remove redundant simd_sum in logsumexp (#2210) Cheng 2025-05-21 23:25:03 +09:00
35c87741cf Build for compute capability 70 instead of 75 (#2209) Cheng 2025-05-21 11:42:48 +09:00
4cbe605214 Feat: Allow per-target Metal debug flags (#2201) Jack Wind 2025-05-20 13:22:26 -04:00
ab8883dd55 include mlx::core::version() symbols in the mlx static library (#2207) Clement Liaw 2025-05-20 07:39:11 -07:00
eebe73001a fix large arg reduce (#2206) Awni Hannun 2025-05-19 13:10:44 -07:00
0359bf02c9 Nearest upsample (#2202) Angelos Katharopoulos 2025-05-19 11:23:38 -07:00
237f9e58a8 Fix BEFORE keyword in target_include_directories (#2204) Cheng 2025-05-19 22:10:44 +09:00
8576e6fe36 fix conv2d bug + faster conv 1d (#2195) Awni Hannun 2025-05-18 06:05:11 -07:00
0654543dcc Add complex eigh (#2191) Angelos Katharopoulos 2025-05-18 00:18:43 -07:00
48ef3e74e2 reduce vjp for all and any (#2193) Awni Hannun 2025-05-16 08:38:49 -07:00
7d4b378952 Include cuda_bf16.h for bfloat16 overloads (#2192) Cheng 2025-05-16 22:44:42 +09:00
7ff5c41e06 Add set_threadgroup_memory_length to CommandEncoder (#2183) Jack Wind 2025-05-16 03:28:03 -04:00
602f43e3d1 fix conv grad (#2187) Awni Hannun 2025-05-15 19:20:36 -07:00
a2cadb8218 real and imag properties (#2189) Awni Hannun 2025-05-15 18:17:50 -07:00
c1eb9d05d9 non-symmetric eig and eigh (#2188) Awni Hannun 2025-05-15 13:01:44 -07:00
cf6c939e86 Fix some complex vjps (#2178) Angelos Katharopoulos 2025-05-14 23:37:12 -07:00
130df35e1b Add random normal distribution for complex numbers (#2182) Angelos Katharopoulos 2025-05-13 22:43:45 -07:00
0751263dec Fix typo in row_reduce_small (#2179) Cheng 2025-05-14 12:19:54 +09:00
eca2f3eb97 Add remove_index utility (#2173) Cheng 2025-05-14 09:09:56 +09:00
3aa9cf3f9e Fix put_along_axis for empty arrays (#2181) Angelos Katharopoulos 2025-05-13 14:27:53 -07:00

... 4 5 6 7 8 ...