Skip to content

Clarify matrix copy CUDA starter kernel size#266

Open
Ryannnice wants to merge 1 commit into
AlphaGPU:mainfrom
Ryannnice:matrix-copy-kernel-total
Open

Clarify matrix copy CUDA starter kernel size#266
Ryannnice wants to merge 1 commit into
AlphaGPU:mainfrom
Ryannnice:matrix-copy-kernel-total

Conversation

@Ryannnice
Copy link
Copy Markdown

Summary

  • Pass total element count to the Matrix Copy CUDA starter
    kernel instead of N
  • Keep solve signature unchanged and compute N * N on the
    host side

Notes

  • This keeps the starter aligned with a linear element-wise
    copy model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant