2024-10-15 23:12:17 +08:00
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "https://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
< html xmlns = "http://www.w3.org/1999/xhtml" lang = "en-US" >
< head >
< meta http-equiv = "Content-Type" content = "text/xhtml;charset=UTF-8" / >
< meta http-equiv = "X-UA-Compatible" content = "IE=11" / >
2025-02-07 04:16:29 +08:00
< meta name = "generator" content = "Doxygen 1.13.2" / >
2024-10-15 23:12:17 +08:00
< meta name = "viewport" content = "width=device-width, initial-scale=1" / >
< title > MLX: mlx/backend/metal/kernels/copy.h Source File< / title >
< link href = "tabs.css" rel = "stylesheet" type = "text/css" / >
< script type = "text/javascript" src = "jquery.js" > < / script >
< script type = "text/javascript" src = "dynsections.js" > < / script >
< script type = "text/javascript" src = "clipboard.js" > < / script >
< link href = "navtree.css" rel = "stylesheet" type = "text/css" / >
2025-01-10 05:56:20 +08:00
< script type = "text/javascript" src = "navtreedata.js" > < / script >
< script type = "text/javascript" src = "navtree.js" > < / script >
2024-10-15 23:12:17 +08:00
< script type = "text/javascript" src = "resize.js" > < / script >
< script type = "text/javascript" src = "cookie.js" > < / script >
< link href = "search/search.css" rel = "stylesheet" type = "text/css" / >
< script type = "text/javascript" src = "search/searchdata.js" > < / script >
< script type = "text/javascript" src = "search/search.js" > < / script >
2025-01-10 05:56:20 +08:00
< script type = "text/javascript" >
/* @license magnet:?xt=urn:btih:d3d9a9a6595521f9666a5e94cc830dab83b65699& dn=expat.txt MIT */
$(function() { init_search(); });
/* @license-end */
< / script >
2024-10-15 23:12:17 +08:00
< link href = "doxygen.css" rel = "stylesheet" type = "text/css" / >
< / head >
< body >
< div id = "top" > <!-- do not remove this div, it is closed by doxygen! -->
< div id = "titlearea" >
< table cellspacing = "0" cellpadding = "0" >
< tbody >
< tr id = "projectrow" >
< td id = "projectalign" >
< div id = "projectname" > MLX
< / div >
< / td >
2025-01-10 05:56:20 +08:00
< td > < div id = "MSearchBox" class = "MSearchBoxInactive" >
< span class = "left" >
< span id = "MSearchSelect" onmouseover = "return searchBox.OnSearchSelectShow()" onmouseout = "return searchBox.OnSearchSelectHide()" >   < / span >
< input type = "text" id = "MSearchField" value = "" placeholder = "Search" accesskey = "S"
onfocus="searchBox.OnSearchFieldFocus(true)"
onblur="searchBox.OnSearchFieldFocus(false)"
onkeyup="searchBox.OnSearchFieldChange(event)"/>
< / span > < span class = "right" >
< a id = "MSearchClose" href = "javascript:searchBox.CloseResultsWindow()" > < img id = "MSearchCloseImg" border = "0" src = "search/close.svg" alt = "" / > < / a >
< / span >
< / div >
< / td >
2024-10-15 23:12:17 +08:00
< / tr >
< / tbody >
< / table >
< / div >
<!-- end header part -->
2025-02-07 04:16:29 +08:00
<!-- Generated by Doxygen 1.13.2 -->
2024-10-15 23:12:17 +08:00
< script type = "text/javascript" >
/* @license magnet:?xt=urn:btih:d3d9a9a6595521f9666a5e94cc830dab83b65699& dn=expat.txt MIT */
var searchBox = new SearchBox("searchBox", "search/",'.html');
/* @license-end */
< / script >
< script type = "text/javascript" >
/* @license magnet:?xt=urn:btih:d3d9a9a6595521f9666a5e94cc830dab83b65699& dn=expat.txt MIT */
$(function() { codefold.init(0); });
/* @license-end */
< / script >
2025-01-10 05:56:20 +08:00
< / div > <!-- top -->
< div id = "side-nav" class = "ui-resizable side-nav-resizable" >
< div id = "nav-tree" >
< div id = "nav-tree-contents" >
< div id = "nav-sync" class = "sync" > < / div >
< / div >
< / div >
< div id = "splitbar" style = "-moz-user-select:none;"
class="ui-resizable-handle">
< / div >
< / div >
2024-10-15 23:12:17 +08:00
< script type = "text/javascript" >
/* @license magnet:?xt=urn:btih:d3d9a9a6595521f9666a5e94cc830dab83b65699& dn=expat.txt MIT */
2025-01-10 05:56:20 +08:00
$(function(){initNavTree('metal_2kernels_2copy_8h_source.html',''); initResizable(true); });
2024-10-15 23:12:17 +08:00
/* @license-end */
< / script >
2025-01-10 05:56:20 +08:00
< div id = "doc-content" >
2024-10-15 23:12:17 +08:00
<!-- window showing the filter options -->
< div id = "MSearchSelectWindow"
onmouseover="return searchBox.OnSearchSelectShow()"
onmouseout="return searchBox.OnSearchSelectHide()"
onkeydown="return searchBox.OnSearchSelectKey(event)">
< / div >
<!-- iframe showing the search results (closed by default) -->
< div id = "MSearchResultsWindow" >
< div id = "MSearchResults" >
< div class = "SRPage" >
< div id = "SRIndex" >
< div id = "SRResults" > < / div >
< div class = "SRStatus" id = "Loading" > Loading...< / div >
< div class = "SRStatus" id = "Searching" > Searching...< / div >
< div class = "SRStatus" id = "NoMatches" > No Matches< / div >
< / div >
< / div >
< / div >
< / div >
< div class = "header" >
< div class = "headertitle" > < div class = "title" > copy.h< / div > < / div >
< / div > <!-- header -->
< div class = "contents" >
< a href = "metal_2kernels_2copy_8h.html" > Go to the documentation of this file.< / a > < div class = "fragment" > < div class = "line" > < a id = "l00001" name = "l00001" > < / a > < span class = "lineno" > 1< / span > < span class = "comment" > // Copyright © 2024 Apple Inc.< / span > < / div >
< div class = "line" > < a id = "l00002" name = "l00002" > < / a > < span class = "lineno" > 2< / span > < / div >
< div class = "line" > < a id = "l00003" name = "l00003" > < / a > < span class = "lineno" > 3< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U> < / div >
< div class = "foldopen" id = "foldopen00004" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00004" name = "l00004" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#aef09f9b9475345b1bba121d037d222ea" > 4< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#aef09f9b9475345b1bba121d037d222ea" > copy_s< / a > (< / div >
< div class = "line" > < a id = "l00005" name = "l00005" > < / a > < span class = "lineno" > 5< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00006" name = "l00006" > < / a > < span class = "lineno" > 6< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00007" name = "l00007" > < / a > < span class = "lineno" > 7< / span > uint index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00008" name = "l00008" > < / a > < span class = "lineno" > 8< / span > dst[index] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[0]);< / div >
< div class = "line" > < a id = "l00009" name = "l00009" > < / a > < span class = "lineno" > 9< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00010" name = "l00010" > < / a > < span class = "lineno" > 10< / span > < / div >
< div class = "line" > < a id = "l00011" name = "l00011" > < / a > < span class = "lineno" > 11< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U> < / div >
< div class = "foldopen" id = "foldopen00012" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00012" name = "l00012" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#ae26a13e0c8e6c15f7b10078e65970659" > 12< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#ae26a13e0c8e6c15f7b10078e65970659" > copy_v< / a > (< / div >
< div class = "line" > < a id = "l00013" name = "l00013" > < / a > < span class = "lineno" > 13< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00014" name = "l00014" > < / a > < span class = "lineno" > 14< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00015" name = "l00015" > < / a > < span class = "lineno" > 15< / span > uint index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00016" name = "l00016" > < / a > < span class = "lineno" > 16< / span > dst[index] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[index]);< / div >
< div class = "line" > < a id = "l00017" name = "l00017" > < / a > < span class = "lineno" > 17< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00018" name = "l00018" > < / a > < span class = "lineno" > 18< / span > < / div >
< div class = "line" > < a id = "l00019" name = "l00019" > < / a > < span class = "lineno" > 19< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U> < / div >
< div class = "foldopen" id = "foldopen00020" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00020" name = "l00020" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a8023e9335cc5334847a8d315042be3a3" > 20< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a8023e9335cc5334847a8d315042be3a3" > copy_s2< / a > (< / div >
< div class = "line" > < a id = "l00021" name = "l00021" > < / a > < span class = "lineno" > 21< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00022" name = "l00022" > < / a > < span class = "lineno" > 22< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00023" name = "l00023" > < / a > < span class = "lineno" > 23< / span > uint2 index [[thread_position_in_grid]],< / div >
< div class = "line" > < a id = "l00024" name = "l00024" > < / a > < span class = "lineno" > 24< / span > uint2 grid_dim [[threads_per_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00025" name = "l00025" > < / a > < span class = "lineno" > 25< / span > < span class = "keyword" > auto< / span > offset = index.x + grid_dim.x * int64_t(index.y);< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00026" name = "l00026" > < / a > < span class = "lineno" > 26< / span > dst[offset] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[0]);< / div >
< div class = "line" > < a id = "l00027" name = "l00027" > < / a > < span class = "lineno" > 27< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00028" name = "l00028" > < / a > < span class = "lineno" > 28< / span > < / div >
< div class = "line" > < a id = "l00029" name = "l00029" > < / a > < span class = "lineno" > 29< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U> < / div >
< div class = "foldopen" id = "foldopen00030" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00030" name = "l00030" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#aee14a5326f53d9b30b0b38e27d180ef3" > 30< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#aee14a5326f53d9b30b0b38e27d180ef3" > copy_v2< / a > (< / div >
< div class = "line" > < a id = "l00031" name = "l00031" > < / a > < span class = "lineno" > 31< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00032" name = "l00032" > < / a > < span class = "lineno" > 32< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00033" name = "l00033" > < / a > < span class = "lineno" > 33< / span > uint2 index [[thread_position_in_grid]],< / div >
< div class = "line" > < a id = "l00034" name = "l00034" > < / a > < span class = "lineno" > 34< / span > uint2 grid_dim [[threads_per_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00035" name = "l00035" > < / a > < span class = "lineno" > 35< / span > < span class = "keyword" > auto< / span > offset = index.x + grid_dim.x * int64_t(index.y);< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00036" name = "l00036" > < / a > < span class = "lineno" > 36< / span > dst[offset] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[offset]);< / div >
< div class = "line" > < a id = "l00037" name = "l00037" > < / a > < span class = "lineno" > 37< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00038" name = "l00038" > < / a > < span class = "lineno" > 38< / span > < / div >
2024-12-07 05:22:39 +08:00
< div class = "line" > < a id = "l00039" name = "l00039" > < / a > < span class = "lineno" > 39< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
2024-10-15 23:12:17 +08:00
< div class = "foldopen" id = "foldopen00040" data-start = "{" data-end = "}" >
2024-12-07 05:22:39 +08:00
< div class = "line" > < a id = "l00040" name = "l00040" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a232c5c6b8386cf8ecbf4cdadb6e4176e" > 40< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a232c5c6b8386cf8ecbf4cdadb6e4176e" > copy_g_nd1< / a > (< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00041" name = "l00041" > < / a > < span class = "lineno" > 41< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00042" name = "l00042" > < / a > < span class = "lineno" > 42< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00043" name = "l00043" > < / a > < span class = "lineno" > 43< / span > constant < span class = "keyword" > const< / span > int64_t& src_stride [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00044" name = "l00044" > < / a > < span class = "lineno" > 44< / span > uint index [[thread_position_in_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00045" name = "l00045" > < / a > < span class = "lineno" > 45< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< IdxT> < / a > (index, src_stride);< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00046" name = "l00046" > < / a > < span class = "lineno" > 46< / span > dst[index] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00047" name = "l00047" > < / a > < span class = "lineno" > 47< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00048" name = "l00048" > < / a > < span class = "lineno" > 48< / span > < / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00049" name = "l00049" > < / a > < span class = "lineno" > 49< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
2024-10-15 23:12:17 +08:00
< div class = "foldopen" id = "foldopen00050" data-start = "{" data-end = "}" >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00050" name = "l00050" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a39ec5b7b8351e4332b842982a2ee6260" > 50< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a39ec5b7b8351e4332b842982a2ee6260" > copy_g_nd2< / a > (< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00051" name = "l00051" > < / a > < span class = "lineno" > 51< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00052" name = "l00052" > < / a > < span class = "lineno" > 52< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00053" name = "l00053" > < / a > < span class = "lineno" > 53< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00054" name = "l00054" > < / a > < span class = "lineno" > 54< / span > uint2 index [[thread_position_in_grid]],< / div >
< div class = "line" > < a id = "l00055" name = "l00055" > < / a > < span class = "lineno" > 55< / span > uint2 grid_dim [[threads_per_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00056" name = "l00056" > < / a > < span class = "lineno" > 56< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< IdxT> < / a > (index, src_strides);< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00057" name = "l00057" > < / a > < span class = "lineno" > 57< / span > IdxT dst_idx = index.x + IdxT(grid_dim.x) * index.y;< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00058" name = "l00058" > < / a > < span class = "lineno" > 58< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00059" name = "l00059" > < / a > < span class = "lineno" > 59< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00060" name = "l00060" > < / a > < span class = "lineno" > 60< / span > < / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00061" name = "l00061" > < / a > < span class = "lineno" > 61< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
2024-10-15 23:12:17 +08:00
< div class = "foldopen" id = "foldopen00062" data-start = "{" data-end = "}" >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00062" name = "l00062" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#aab82689380897ff4716b5eafd6ef3ecc" > 62< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#aab82689380897ff4716b5eafd6ef3ecc" > copy_g_nd3< / a > (< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00063" name = "l00063" > < / a > < span class = "lineno" > 63< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00064" name = "l00064" > < / a > < span class = "lineno" > 64< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00065" name = "l00065" > < / a > < span class = "lineno" > 65< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00066" name = "l00066" > < / a > < span class = "lineno" > 66< / span > uint3 index [[thread_position_in_grid]],< / div >
< div class = "line" > < a id = "l00067" name = "l00067" > < / a > < span class = "lineno" > 67< / span > uint3 grid_dim [[threads_per_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00068" name = "l00068" > < / a > < span class = "lineno" > 68< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< IdxT> < / a > (index, src_strides);< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00069" name = "l00069" > < / a > < span class = "lineno" > 69< / span > IdxT dst_idx =< / div >
< div class = "line" > < a id = "l00070" name = "l00070" > < / a > < span class = "lineno" > 70< / span > index.x + IdxT(grid_dim.x) * (index.y + IdxT(grid_dim.y) * index.z);< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00071" name = "l00071" > < / a > < span class = "lineno" > 71< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00072" name = "l00072" > < / a > < span class = "lineno" > 72< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00073" name = "l00073" > < / a > < span class = "lineno" > 73< / span > < / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00074" name = "l00074" > < / a > < span class = "lineno" > 74< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keywordtype" > int< / span > N = 1, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
2024-10-15 23:12:17 +08:00
< div class = "foldopen" id = "foldopen00075" data-start = "{" data-end = "}" >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00075" name = "l00075" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a71e4103db4689d90ef6f9d5ba93604cf" > 75< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a71e4103db4689d90ef6f9d5ba93604cf" > copy_g< / a > (< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00076" name = "l00076" > < / a > < span class = "lineno" > 76< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00077" name = "l00077" > < / a > < span class = "lineno" > 77< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00078" name = "l00078" > < / a > < span class = "lineno" > 78< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > * src_shape [[buffer(2)]],< / div >
< div class = "line" > < a id = "l00079" name = "l00079" > < / a > < span class = "lineno" > 79< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00080" name = "l00080" > < / a > < span class = "lineno" > 80< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > & ndim [[buffer(5)]],< / div >
< div class = "line" > < a id = "l00081" name = "l00081" > < / a > < span class = "lineno" > 81< / span > uint3 index [[thread_position_in_grid]],< / div >
< div class = "line" > < a id = "l00082" name = "l00082" > < / a > < span class = "lineno" > 82< / span > uint3 grid_dim [[threads_per_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00083" name = "l00083" > < / a > < span class = "lineno" > 83< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a497dd9f1a00c8a4303d8782158a0812a" > elem_to_loc< IdxT> < / a > (< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00084" name = "l00084" > < / a > < span class = "lineno" > 84< / span > {N * index.x, index.y, index.z}, src_shape, src_strides, ndim);< / div >
< div class = "line" > < a id = "l00085" name = "l00085" > < / a > < span class = "lineno" > 85< / span > < span class = "keywordflow" > if< / span > (N == 1) {< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00086" name = "l00086" > < / a > < span class = "lineno" > 86< / span > IdxT dst_idx =< / div >
< div class = "line" > < a id = "l00087" name = "l00087" > < / a > < span class = "lineno" > 87< / span > index.x + grid_dim.x * (index.y + IdxT(grid_dim.y) * index.z);< / div >
2024-10-15 23:12:17 +08:00
< div class = "line" > < a id = "l00088" name = "l00088" > < / a > < span class = "lineno" > 88< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00089" name = "l00089" > < / a > < span class = "lineno" > 89< / span > < span class = "keywordflow" > return< / span > ;< / div >
< div class = "line" > < a id = "l00090" name = "l00090" > < / a > < span class = "lineno" > 90< / span > }< / div >
< div class = "line" > < a id = "l00091" name = "l00091" > < / a > < span class = "lineno" > 91< / span > < span class = "keyword" > auto< / span > xshape = src_shape[ndim - 1];< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00092" name = "l00092" > < / a > < span class = "lineno" > 92< / span > IdxT dst_idx = N * index.x + xshape * (index.y + IdxT(grid_dim.y) * index.z);< / div >
< div class = "line" > < a id = "l00093" name = "l00093" > < / a > < span class = "lineno" > 93< / span > < span class = "keyword" > auto< / span > src_xstride = src_strides[ndim - 1];< / div >
< div class = "line" > < a id = "l00094" name = "l00094" > < / a > < span class = "lineno" > 94< / span > < span class = "keywordflow" > for< / span > (< span class = "keywordtype" > int< / span > i = 0; i < N & & (int(N * index.x) + i) < xshape; ++i) {< / div >
< div class = "line" > < a id = "l00095" name = "l00095" > < / a > < span class = "lineno" > 95< / span > dst[dst_idx + i] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00096" name = "l00096" > < / a > < span class = "lineno" > 96< / span > src_idx += src_xstride;< / div >
< div class = "line" > < a id = "l00097" name = "l00097" > < / a > < span class = "lineno" > 97< / span > }< / div >
< div class = "line" > < a id = "l00098" name = "l00098" > < / a > < span class = "lineno" > 98< / span > }< / div >
2024-10-15 23:12:17 +08:00
< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00099" name = "l00099" > < / a > < span class = "lineno" > 99< / span > < / div >
2024-12-07 05:22:39 +08:00
< div class = "line" > < a id = "l00100" name = "l00100" > < / a > < span class = "lineno" > 100< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
2024-11-23 04:24:16 +08:00
< div class = "foldopen" id = "foldopen00101" data-start = "{" data-end = "}" >
2024-12-07 05:22:39 +08:00
< div class = "line" > < a id = "l00101" name = "l00101" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a370d7bbba1a4b0d64da873bafd29a78b" > 101< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a370d7bbba1a4b0d64da873bafd29a78b" > copy_gg_nd1< / a > (< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00102" name = "l00102" > < / a > < span class = "lineno" > 102< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00103" name = "l00103" > < / a > < span class = "lineno" > 103< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00104" name = "l00104" > < / a > < span class = "lineno" > 104< / span > constant < span class = "keyword" > const< / span > int64_t& src_stride [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00105" name = "l00105" > < / a > < span class = "lineno" > 105< / span > constant < span class = "keyword" > const< / span > int64_t& dst_stride [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00106" name = "l00106" > < / a > < span class = "lineno" > 106< / span > uint index [[thread_position_in_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00107" name = "l00107" > < / a > < span class = "lineno" > 107< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< IdxT> < / a > (index, src_stride);< / div >
< div class = "line" > < a id = "l00108" name = "l00108" > < / a > < span class = "lineno" > 108< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< IdxT> < / a > (index, dst_stride);< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00109" name = "l00109" > < / a > < span class = "lineno" > 109< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00110" name = "l00110" > < / a > < span class = "lineno" > 110< / span > }< / div >
2024-10-15 23:12:17 +08:00
< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00111" name = "l00111" > < / a > < span class = "lineno" > 111< / span > < / div >
< div class = "line" > < a id = "l00112" name = "l00112" > < / a > < span class = "lineno" > 112< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00113" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00113" name = "l00113" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#af0b06ac3a96852a64fa4274a94b58301" > 113< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#af0b06ac3a96852a64fa4274a94b58301" > copy_gg_nd2< / a > (< / div >
< div class = "line" > < a id = "l00114" name = "l00114" > < / a > < span class = "lineno" > 114< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00115" name = "l00115" > < / a > < span class = "lineno" > 115< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00116" name = "l00116" > < / a > < span class = "lineno" > 116< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00117" name = "l00117" > < / a > < span class = "lineno" > 117< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00118" name = "l00118" > < / a > < span class = "lineno" > 118< / span > uint2 index [[thread_position_in_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00119" name = "l00119" > < / a > < span class = "lineno" > 119< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< IdxT> < / a > (index, src_strides);< / div >
< div class = "line" > < a id = "l00120" name = "l00120" > < / a > < span class = "lineno" > 120< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< IdxT> < / a > (index, dst_strides);< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00121" name = "l00121" > < / a > < span class = "lineno" > 121< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00122" name = "l00122" > < / a > < span class = "lineno" > 122< / span > }< / div >
2024-10-15 23:12:17 +08:00
< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00123" name = "l00123" > < / a > < span class = "lineno" > 123< / span > < / div >
< div class = "line" > < a id = "l00124" name = "l00124" > < / a > < span class = "lineno" > 124< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00125" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00125" name = "l00125" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a3f3836ad0b6545ec9b9e1864224f7a13" > 125< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a3f3836ad0b6545ec9b9e1864224f7a13" > copy_gg_nd3< / a > (< / div >
< div class = "line" > < a id = "l00126" name = "l00126" > < / a > < span class = "lineno" > 126< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00127" name = "l00127" > < / a > < span class = "lineno" > 127< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00128" name = "l00128" > < / a > < span class = "lineno" > 128< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00129" name = "l00129" > < / a > < span class = "lineno" > 129< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00130" name = "l00130" > < / a > < span class = "lineno" > 130< / span > uint3 index [[thread_position_in_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00131" name = "l00131" > < / a > < span class = "lineno" > 131< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< IdxT> < / a > (index, src_strides);< / div >
< div class = "line" > < a id = "l00132" name = "l00132" > < / a > < span class = "lineno" > 132< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< IdxT> < / a > (index, dst_strides);< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00133" name = "l00133" > < / a > < span class = "lineno" > 133< / span > dst[dst_idx] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[src_idx]);< / div >
< div class = "line" > < a id = "l00134" name = "l00134" > < / a > < span class = "lineno" > 134< / span > }< / div >
2024-10-15 23:12:17 +08:00
< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00135" name = "l00135" > < / a > < span class = "lineno" > 135< / span > < / div >
< div class = "line" > < a id = "l00136" name = "l00136" > < / a > < span class = "lineno" > 136< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keywordtype" > int< / span > N = 1, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00137" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00137" name = "l00137" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#ade9a9eea9b8262a854a11721fe2bb9fa" > 137< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#ade9a9eea9b8262a854a11721fe2bb9fa" > copy_gg< / a > (< / div >
< div class = "line" > < a id = "l00138" name = "l00138" > < / a > < span class = "lineno" > 138< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00139" name = "l00139" > < / a > < span class = "lineno" > 139< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00140" name = "l00140" > < / a > < span class = "lineno" > 140< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > * src_shape [[buffer(2)]],< / div >
< div class = "line" > < a id = "l00141" name = "l00141" > < / a > < span class = "lineno" > 141< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00142" name = "l00142" > < / a > < span class = "lineno" > 142< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00143" name = "l00143" > < / a > < span class = "lineno" > 143< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > & ndim [[buffer(5)]],< / div >
< div class = "line" > < a id = "l00144" name = "l00144" > < / a > < span class = "lineno" > 144< / span > uint3 index [[thread_position_in_grid]]) {< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00145" name = "l00145" > < / a > < span class = "lineno" > 145< / span > < span class = "keyword" > auto< / span > idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a97ea664406a270b34ff5a23815716730" > elem_to_loc_2_nd< IdxT> < / a > (< / div >
2024-11-23 04:24:16 +08:00
< div class = "line" > < a id = "l00146" name = "l00146" > < / a > < span class = "lineno" > 146< / span > {N * index.x, index.y, index.z},< / div >
< div class = "line" > < a id = "l00147" name = "l00147" > < / a > < span class = "lineno" > 147< / span > src_shape,< / div >
< div class = "line" > < a id = "l00148" name = "l00148" > < / a > < span class = "lineno" > 148< / span > src_strides,< / div >
< div class = "line" > < a id = "l00149" name = "l00149" > < / a > < span class = "lineno" > 149< / span > dst_strides,< / div >
< div class = "line" > < a id = "l00150" name = "l00150" > < / a > < span class = "lineno" > 150< / span > ndim);< / div >
< div class = "line" > < a id = "l00151" name = "l00151" > < / a > < span class = "lineno" > 151< / span > < span class = "keywordflow" > if< / span > (N == 1) {< / div >
< div class = "line" > < a id = "l00152" name = "l00152" > < / a > < span class = "lineno" > 152< / span > dst[idx.y] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[idx.x]);< / div >
< div class = "line" > < a id = "l00153" name = "l00153" > < / a > < span class = "lineno" > 153< / span > < span class = "keywordflow" > return< / span > ;< / div >
< div class = "line" > < a id = "l00154" name = "l00154" > < / a > < span class = "lineno" > 154< / span > }< / div >
< div class = "line" > < a id = "l00155" name = "l00155" > < / a > < span class = "lineno" > 155< / span > IdxT src_xstride = src_strides[ndim - 1];< / div >
< div class = "line" > < a id = "l00156" name = "l00156" > < / a > < span class = "lineno" > 156< / span > IdxT dst_xstride = dst_strides[ndim - 1];< / div >
< div class = "line" > < a id = "l00157" name = "l00157" > < / a > < span class = "lineno" > 157< / span > < span class = "keyword" > auto< / span > xshape = src_shape[ndim - 1];< / div >
< div class = "line" > < a id = "l00158" name = "l00158" > < / a > < span class = "lineno" > 158< / span > < span class = "keywordflow" > for< / span > (< span class = "keywordtype" > int< / span > i = 0; i < N & & (int(N * index.x) + i) < xshape; ++i) {< / div >
< div class = "line" > < a id = "l00159" name = "l00159" > < / a > < span class = "lineno" > 159< / span > dst[idx.y] = < span class = "keyword" > static_cast< < / span > U< span class = "keyword" > > < / span > (src[idx.x]);< / div >
< div class = "line" > < a id = "l00160" name = "l00160" > < / a > < span class = "lineno" > 160< / span > idx.x += src_xstride;< / div >
< div class = "line" > < a id = "l00161" name = "l00161" > < / a > < span class = "lineno" > 161< / span > idx.y += dst_xstride;< / div >
< div class = "line" > < a id = "l00162" name = "l00162" > < / a > < span class = "lineno" > 162< / span > }< / div >
< div class = "line" > < a id = "l00163" name = "l00163" > < / a > < span class = "lineno" > 163< / span > }< / div >
2024-10-15 23:12:17 +08:00
< / div >
2025-01-10 05:56:20 +08:00
< div class = "line" > < a id = "l00164" name = "l00164" > < / a > < span class = "lineno" > 164< / span > < / div >
< div class = "line" > < a id = "l00165" name = "l00165" > < / a > < span class = "lineno" > 165< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00166" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00166" name = "l00166" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a8548ea41cac179084ddd33d26921576f" > 166< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a8548ea41cac179084ddd33d26921576f" > copy_gg_dynamic_nd1< / a > (< / div >
< div class = "line" > < a id = "l00167" name = "l00167" > < / a > < span class = "lineno" > 167< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00168" name = "l00168" > < / a > < span class = "lineno" > 168< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00169" name = "l00169" > < / a > < span class = "lineno" > 169< / span > constant < span class = "keyword" > const< / span > int64_t& src_stride [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00170" name = "l00170" > < / a > < span class = "lineno" > 170< / span > constant < span class = "keyword" > const< / span > int64_t& dst_stride [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00171" name = "l00171" > < / a > < span class = "lineno" > 171< / span > constant < span class = "keyword" > const< / span > int64_t& src_offset [[buffer(6)]],< / div >
< div class = "line" > < a id = "l00172" name = "l00172" > < / a > < span class = "lineno" > 172< / span > constant < span class = "keyword" > const< / span > int64_t& dst_offset [[buffer(7)]],< / div >
< div class = "line" > < a id = "l00173" name = "l00173" > < / a > < span class = "lineno" > 173< / span > uint index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00174" name = "l00174" > < / a > < span class = "lineno" > 174< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< IdxT> < / a > (index, src_stride);< / div >
< div class = "line" > < a id = "l00175" name = "l00175" > < / a > < span class = "lineno" > 175< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< IdxT> < / a > (index, dst_stride);< / div >
< div class = "line" > < a id = "l00176" name = "l00176" > < / a > < span class = "lineno" > 176< / span > dst[dst_idx + dst_offset] = src[src_idx + src_offset];< / div >
< div class = "line" > < a id = "l00177" name = "l00177" > < / a > < span class = "lineno" > 177< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00178" name = "l00178" > < / a > < span class = "lineno" > 178< / span > < / div >
< div class = "line" > < a id = "l00179" name = "l00179" > < / a > < span class = "lineno" > 179< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00180" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00180" name = "l00180" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#a9b9266ee25a4dbcbe4fde883b40170f1" > 180< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#a9b9266ee25a4dbcbe4fde883b40170f1" > copy_gg_dynamic_nd2< / a > (< / div >
< div class = "line" > < a id = "l00181" name = "l00181" > < / a > < span class = "lineno" > 181< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00182" name = "l00182" > < / a > < span class = "lineno" > 182< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00183" name = "l00183" > < / a > < span class = "lineno" > 183< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00184" name = "l00184" > < / a > < span class = "lineno" > 184< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00185" name = "l00185" > < / a > < span class = "lineno" > 185< / span > constant < span class = "keyword" > const< / span > int64_t& src_offset [[buffer(6)]],< / div >
< div class = "line" > < a id = "l00186" name = "l00186" > < / a > < span class = "lineno" > 186< / span > constant < span class = "keyword" > const< / span > int64_t& dst_offset [[buffer(7)]],< / div >
< div class = "line" > < a id = "l00187" name = "l00187" > < / a > < span class = "lineno" > 187< / span > uint2 index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00188" name = "l00188" > < / a > < span class = "lineno" > 188< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< IdxT> < / a > (index, src_strides);< / div >
< div class = "line" > < a id = "l00189" name = "l00189" > < / a > < span class = "lineno" > 189< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< IdxT> < / a > (index, dst_strides);< / div >
< div class = "line" > < a id = "l00190" name = "l00190" > < / a > < span class = "lineno" > 190< / span > dst[dst_idx + dst_offset] = src[src_idx + src_offset];< / div >
< div class = "line" > < a id = "l00191" name = "l00191" > < / a > < span class = "lineno" > 191< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00192" name = "l00192" > < / a > < span class = "lineno" > 192< / span > < / div >
< div class = "line" > < a id = "l00193" name = "l00193" > < / a > < span class = "lineno" > 193< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00194" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00194" name = "l00194" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#af33ccc02f10bcb5c19ea7b1dd0af4956" > 194< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#af33ccc02f10bcb5c19ea7b1dd0af4956" > copy_gg_dynamic_nd3< / a > (< / div >
< div class = "line" > < a id = "l00195" name = "l00195" > < / a > < span class = "lineno" > 195< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00196" name = "l00196" > < / a > < span class = "lineno" > 196< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00197" name = "l00197" > < / a > < span class = "lineno" > 197< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00198" name = "l00198" > < / a > < span class = "lineno" > 198< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00199" name = "l00199" > < / a > < span class = "lineno" > 199< / span > constant < span class = "keyword" > const< / span > int64_t& src_offset [[buffer(6)]],< / div >
< div class = "line" > < a id = "l00200" name = "l00200" > < / a > < span class = "lineno" > 200< / span > constant < span class = "keyword" > const< / span > int64_t& dst_offset [[buffer(7)]],< / div >
< div class = "line" > < a id = "l00201" name = "l00201" > < / a > < span class = "lineno" > 201< / span > uint3 index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00202" name = "l00202" > < / a > < span class = "lineno" > 202< / span > < span class = "keyword" > auto< / span > src_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< IdxT> < / a > (index, src_strides);< / div >
< div class = "line" > < a id = "l00203" name = "l00203" > < / a > < span class = "lineno" > 203< / span > < span class = "keyword" > auto< / span > dst_idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< IdxT> < / a > (index, dst_strides);< / div >
< div class = "line" > < a id = "l00204" name = "l00204" > < / a > < span class = "lineno" > 204< / span > dst[dst_idx + dst_offset] = src[src_idx + src_offset];< / div >
< div class = "line" > < a id = "l00205" name = "l00205" > < / a > < span class = "lineno" > 205< / span > }< / div >
< / div >
< div class = "line" > < a id = "l00206" name = "l00206" > < / a > < span class = "lineno" > 206< / span > < / div >
< div class = "line" > < a id = "l00207" name = "l00207" > < / a > < span class = "lineno" > 207< / span > < span class = "keyword" > template< / span > < < span class = "keyword" > typename< / span > T, < span class = "keyword" > typename< / span > U, < span class = "keywordtype" > int< / span > N = 1, < span class = "keyword" > typename< / span > IdxT = < span class = "keywordtype" > int< / span > 64_t> < / div >
< div class = "foldopen" id = "foldopen00208" data-start = "{" data-end = "}" >
< div class = "line" > < a id = "l00208" name = "l00208" > < / a > < span class = "lineno" > < a class = "line" href = "metal_2kernels_2copy_8h.html#ad0f05a73165d4ee38c9f02c705ea6ca8" > 208< / a > < / span > [[kernel]] < span class = "keywordtype" > void< / span > < a class = "code hl_function" href = "metal_2kernels_2copy_8h.html#ad0f05a73165d4ee38c9f02c705ea6ca8" > copy_gg_dynamic< / a > (< / div >
< div class = "line" > < a id = "l00209" name = "l00209" > < / a > < span class = "lineno" > 209< / span > device < span class = "keyword" > const< / span > T* src [[buffer(0)]],< / div >
< div class = "line" > < a id = "l00210" name = "l00210" > < / a > < span class = "lineno" > 210< / span > device U* dst [[buffer(1)]],< / div >
< div class = "line" > < a id = "l00211" name = "l00211" > < / a > < span class = "lineno" > 211< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > * src_shape [[buffer(2)]],< / div >
< div class = "line" > < a id = "l00212" name = "l00212" > < / a > < span class = "lineno" > 212< / span > constant < span class = "keyword" > const< / span > int64_t* src_strides [[buffer(3)]],< / div >
< div class = "line" > < a id = "l00213" name = "l00213" > < / a > < span class = "lineno" > 213< / span > constant < span class = "keyword" > const< / span > int64_t* dst_strides [[buffer(4)]],< / div >
< div class = "line" > < a id = "l00214" name = "l00214" > < / a > < span class = "lineno" > 214< / span > constant < span class = "keyword" > const< / span > < span class = "keywordtype" > int< / span > & ndim [[buffer(5)]],< / div >
< div class = "line" > < a id = "l00215" name = "l00215" > < / a > < span class = "lineno" > 215< / span > constant < span class = "keyword" > const< / span > int64_t& src_offset [[buffer(6)]],< / div >
< div class = "line" > < a id = "l00216" name = "l00216" > < / a > < span class = "lineno" > 216< / span > constant < span class = "keyword" > const< / span > int64_t& dst_offset [[buffer(7)]],< / div >
< div class = "line" > < a id = "l00217" name = "l00217" > < / a > < span class = "lineno" > 217< / span > uint3 index [[thread_position_in_grid]]) {< / div >
< div class = "line" > < a id = "l00218" name = "l00218" > < / a > < span class = "lineno" > 218< / span > src += src_offset;< / div >
< div class = "line" > < a id = "l00219" name = "l00219" > < / a > < span class = "lineno" > 219< / span > dst += dst_offset;< / div >
< div class = "line" > < a id = "l00220" name = "l00220" > < / a > < span class = "lineno" > 220< / span > < span class = "keyword" > auto< / span > idx = < a class = "code hl_function" href = "backend_2metal_2kernels_2utils_8h.html#a97ea664406a270b34ff5a23815716730" > elem_to_loc_2_nd< IdxT> < / a > (< / div >
< div class = "line" > < a id = "l00221" name = "l00221" > < / a > < span class = "lineno" > 221< / span > {N * index.x, index.y, index.z},< / div >
< div class = "line" > < a id = "l00222" name = "l00222" > < / a > < span class = "lineno" > 222< / span > src_shape,< / div >
< div class = "line" > < a id = "l00223" name = "l00223" > < / a > < span class = "lineno" > 223< / span > src_strides,< / div >
< div class = "line" > < a id = "l00224" name = "l00224" > < / a > < span class = "lineno" > 224< / span > dst_strides,< / div >
< div class = "line" > < a id = "l00225" name = "l00225" > < / a > < span class = "lineno" > 225< / span > ndim);< / div >
< div class = "line" > < a id = "l00226" name = "l00226" > < / a > < span class = "lineno" > 226< / span > < span class = "keywordflow" > if< / span > (N == 1) {< / div >
< div class = "line" > < a id = "l00227" name = "l00227" > < / a > < span class = "lineno" > 227< / span > dst[idx.y] = src[idx.x];< / div >
< div class = "line" > < a id = "l00228" name = "l00228" > < / a > < span class = "lineno" > 228< / span > < span class = "keywordflow" > return< / span > ;< / div >
< div class = "line" > < a id = "l00229" name = "l00229" > < / a > < span class = "lineno" > 229< / span > }< / div >
< div class = "line" > < a id = "l00230" name = "l00230" > < / a > < span class = "lineno" > 230< / span > IdxT src_xstride = src_strides[ndim - 1];< / div >
< div class = "line" > < a id = "l00231" name = "l00231" > < / a > < span class = "lineno" > 231< / span > IdxT dst_xstride = dst_strides[ndim - 1];< / div >
< div class = "line" > < a id = "l00232" name = "l00232" > < / a > < span class = "lineno" > 232< / span > < span class = "keyword" > auto< / span > xshape = src_shape[ndim - 1];< / div >
< div class = "line" > < a id = "l00233" name = "l00233" > < / a > < span class = "lineno" > 233< / span > < span class = "keywordflow" > for< / span > (< span class = "keywordtype" > int< / span > i = 0; i < N & & (int(N * index.x) + i) < xshape; ++i) {< / div >
< div class = "line" > < a id = "l00234" name = "l00234" > < / a > < span class = "lineno" > 234< / span > dst[idx.y] = src[idx.x];< / div >
< div class = "line" > < a id = "l00235" name = "l00235" > < / a > < span class = "lineno" > 235< / span > idx.x += src_xstride;< / div >
< div class = "line" > < a id = "l00236" name = "l00236" > < / a > < span class = "lineno" > 236< / span > idx.y += dst_xstride;< / div >
< div class = "line" > < a id = "l00237" name = "l00237" > < / a > < span class = "lineno" > 237< / span > }< / div >
< div class = "line" > < a id = "l00238" name = "l00238" > < / a > < span class = "lineno" > 238< / span > }< / div >
< / div >
< div class = "ttc" id = "abackend_2metal_2kernels_2utils_8h_html_a497dd9f1a00c8a4303d8782158a0812a" > < div class = "ttname" > < a href = "backend_2metal_2kernels_2utils_8h.html#a497dd9f1a00c8a4303d8782158a0812a" > elem_to_loc< / a > < / div > < div class = "ttdeci" > METAL_FUNC IdxT elem_to_loc(IdxT elem, constant const int *shape, constant const int64_t *strides, int ndim)< / div > < div class = "ttdef" > < b > Definition< / b > utils.h:93< / div > < / div >
< div class = "ttc" id = "abackend_2metal_2kernels_2utils_8h_html_a6787efcdf7a898d5bafb48f2a2f1e555" > < div class = "ttname" > < a href = "backend_2metal_2kernels_2utils_8h.html#a6787efcdf7a898d5bafb48f2a2f1e555" > elem_to_loc_1< / a > < / div > < div class = "ttdeci" > METAL_FUNC IdxT elem_to_loc_1(uint elem, constant const int64_t & stride)< / div > < div class = "ttdef" > < b > Definition< / b > utils.h:126< / div > < / div >
< div class = "ttc" id = "abackend_2metal_2kernels_2utils_8h_html_a97ea664406a270b34ff5a23815716730" > < div class = "ttname" > < a href = "backend_2metal_2kernels_2utils_8h.html#a97ea664406a270b34ff5a23815716730" > elem_to_loc_2_nd< / a > < / div > < div class = "ttdeci" > METAL_FUNC vec< IdxT, 2 > elem_to_loc_2_nd(uint3 elem, constant const int *shape, constant const int64_t *a_strides, constant const int64_t *b_strides, int ndim)< / div > < div class = "ttdef" > < b > Definition< / b > utils.h:145< / div > < / div >
< div class = "ttc" id = "abackend_2metal_2kernels_2utils_8h_html_aac0e227f82198021246aa91d8c427b3e" > < div class = "ttname" > < a href = "backend_2metal_2kernels_2utils_8h.html#aac0e227f82198021246aa91d8c427b3e" > elem_to_loc_2< / a > < / div > < div class = "ttdeci" > METAL_FUNC IdxT elem_to_loc_2(uint2 elem, constant const int64_t strides[2])< / div > < div class = "ttdef" > < b > Definition< / b > utils.h:131< / div > < / div >
< div class = "ttc" id = "abackend_2metal_2kernels_2utils_8h_html_ac8f4258ba306870b0280079f1c5eb23e" > < div class = "ttname" > < a href = "backend_2metal_2kernels_2utils_8h.html#ac8f4258ba306870b0280079f1c5eb23e" > elem_to_loc_3< / a > < / div > < div class = "ttdeci" > METAL_FUNC IdxT elem_to_loc_3(uint3 elem, constant const int64_t strides[3])< / div > < div class = "ttdef" > < b > Definition< / b > utils.h:136< / div > < / div >
2024-12-07 05:22:39 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a232c5c6b8386cf8ecbf4cdadb6e4176e" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a232c5c6b8386cf8ecbf4cdadb6e4176e" > copy_g_nd1< / a > < / div > < div class = "ttdeci" > void copy_g_nd1(device const T *src, device U *dst, constant const int64_t & src_stride, uint index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:40< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a370d7bbba1a4b0d64da873bafd29a78b" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a370d7bbba1a4b0d64da873bafd29a78b" > copy_gg_nd1< / a > < / div > < div class = "ttdeci" > void copy_gg_nd1(device const T *src, device U *dst, constant const int64_t & src_stride, constant const int64_t & dst_stride, uint index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:101< / div > < / div >
2024-11-23 04:24:16 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a39ec5b7b8351e4332b842982a2ee6260" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a39ec5b7b8351e4332b842982a2ee6260" > copy_g_nd2< / a > < / div > < div class = "ttdeci" > void copy_g_nd2(device const T *src, device U *dst, constant const int64_t *src_strides, uint2 index, uint2 grid_dim)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:50< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a3f3836ad0b6545ec9b9e1864224f7a13" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a3f3836ad0b6545ec9b9e1864224f7a13" > copy_gg_nd3< / a > < / div > < div class = "ttdeci" > void copy_gg_nd3(device const T *src, device U *dst, constant const int64_t *src_strides, constant const int64_t *dst_strides, uint3 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:125< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a71e4103db4689d90ef6f9d5ba93604cf" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a71e4103db4689d90ef6f9d5ba93604cf" > copy_g< / a > < / div > < div class = "ttdeci" > void copy_g(device const T *src, device U *dst, constant const int *src_shape, constant const int64_t *src_strides, constant const int & ndim, uint3 index, uint3 grid_dim)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:75< / div > < / div >
2024-10-15 23:12:17 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a8023e9335cc5334847a8d315042be3a3" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a8023e9335cc5334847a8d315042be3a3" > copy_s2< / a > < / div > < div class = "ttdeci" > void copy_s2(device const T *src, device U *dst, uint2 index, uint2 grid_dim)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:20< / div > < / div >
2025-01-10 05:56:20 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a8548ea41cac179084ddd33d26921576f" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a8548ea41cac179084ddd33d26921576f" > copy_gg_dynamic_nd1< / a > < / div > < div class = "ttdeci" > void copy_gg_dynamic_nd1(device const T *src, device U *dst, constant const int64_t & src_stride, constant const int64_t & dst_stride, constant const int64_t & src_offset, constant const int64_t & dst_offset, uint index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:166< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_a9b9266ee25a4dbcbe4fde883b40170f1" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#a9b9266ee25a4dbcbe4fde883b40170f1" > copy_gg_dynamic_nd2< / a > < / div > < div class = "ttdeci" > void copy_gg_dynamic_nd2(device const T *src, device U *dst, constant const int64_t *src_strides, constant const int64_t *dst_strides, constant const int64_t & src_offset, constant const int64_t & dst_offset, uint2 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:180< / div > < / div >
2024-11-23 04:24:16 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_aab82689380897ff4716b5eafd6ef3ecc" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#aab82689380897ff4716b5eafd6ef3ecc" > copy_g_nd3< / a > < / div > < div class = "ttdeci" > void copy_g_nd3(device const T *src, device U *dst, constant const int64_t *src_strides, uint3 index, uint3 grid_dim)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:62< / div > < / div >
2025-01-10 05:56:20 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_ad0f05a73165d4ee38c9f02c705ea6ca8" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#ad0f05a73165d4ee38c9f02c705ea6ca8" > copy_gg_dynamic< / a > < / div > < div class = "ttdeci" > void copy_gg_dynamic(device const T *src, device U *dst, constant const int *src_shape, constant const int64_t *src_strides, constant const int64_t *dst_strides, constant const int & ndim, constant const int64_t & src_offset, constant const int64_t & dst_offset, uint3 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:208< / div > < / div >
2024-11-23 04:24:16 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_ade9a9eea9b8262a854a11721fe2bb9fa" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#ade9a9eea9b8262a854a11721fe2bb9fa" > copy_gg< / a > < / div > < div class = "ttdeci" > void copy_gg(device const T *src, device U *dst, constant const int *src_shape, constant const int64_t *src_strides, constant const int64_t *dst_strides, constant const int & ndim, uint3 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:137< / div > < / div >
2024-10-15 23:12:17 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_ae26a13e0c8e6c15f7b10078e65970659" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#ae26a13e0c8e6c15f7b10078e65970659" > copy_v< / a > < / div > < div class = "ttdeci" > void copy_v(device const T *src, device U *dst, uint index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:12< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_aee14a5326f53d9b30b0b38e27d180ef3" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#aee14a5326f53d9b30b0b38e27d180ef3" > copy_v2< / a > < / div > < div class = "ttdeci" > void copy_v2(device const T *src, device U *dst, uint2 index, uint2 grid_dim)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:30< / div > < / div >
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_aef09f9b9475345b1bba121d037d222ea" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#aef09f9b9475345b1bba121d037d222ea" > copy_s< / a > < / div > < div class = "ttdeci" > void copy_s(device const T *src, device U *dst, uint index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:4< / div > < / div >
2024-11-23 04:24:16 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_af0b06ac3a96852a64fa4274a94b58301" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#af0b06ac3a96852a64fa4274a94b58301" > copy_gg_nd2< / a > < / div > < div class = "ttdeci" > void copy_gg_nd2(device const T *src, device U *dst, constant const int64_t *src_strides, constant const int64_t *dst_strides, uint2 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:113< / div > < / div >
2025-01-10 05:56:20 +08:00
< div class = "ttc" id = "ametal_2kernels_2copy_8h_html_af33ccc02f10bcb5c19ea7b1dd0af4956" > < div class = "ttname" > < a href = "metal_2kernels_2copy_8h.html#af33ccc02f10bcb5c19ea7b1dd0af4956" > copy_gg_dynamic_nd3< / a > < / div > < div class = "ttdeci" > void copy_gg_dynamic_nd3(device const T *src, device U *dst, constant const int64_t *src_strides, constant const int64_t *dst_strides, constant const int64_t & src_offset, constant const int64_t & dst_offset, uint3 index)< / div > < div class = "ttdef" > < b > Definition< / b > copy.h:194< / div > < / div >
2024-10-15 23:12:17 +08:00
< / div > <!-- fragment --> < / div > <!-- contents -->
< / div > <!-- doc - content -->
2025-01-10 05:56:20 +08:00
<!-- start footer part -->
< div id = "nav-path" class = "navpath" > <!-- id is needed for treeview function! -->
< ul >
< li class = "navelem" > < a class = "el" href = "dir_938ab0ecf10b8b860ff766c820f665fd.html" > mlx< / a > < / li > < li class = "navelem" > < a class = "el" href = "dir_1d446c9bd3c99228254c9484e0bc5c06.html" > backend< / a > < / li > < li class = "navelem" > < a class = "el" href = "dir_d0c977ea65824390717cdb7efc36c157.html" > metal< / a > < / li > < li class = "navelem" > < a class = "el" href = "dir_70a37effa88bcbd6b791977fa1e64356.html" > kernels< / a > < / li > < li class = "navelem" > < a class = "el" href = "metal_2kernels_2copy_8h.html" > copy.h< / a > < / li >
2025-02-07 04:16:29 +08:00
< li class = "footer" > Generated by < a href = "https://www.doxygen.org/index.html" > < img class = "footer" src = "doxygen.svg" width = "104" height = "31" alt = "doxygen" / > < / a > 1.13.2 < / li >
2025-01-10 05:56:20 +08:00
< / ul >
< / div >
2024-10-15 23:12:17 +08:00
< / body >
< / html >