docs: Document min_cuda_version parameter for Flash GPU endpoints#593
docs: Document min_cuda_version parameter for Flash GPU endpoints#593promptless[bot] wants to merge 2 commits intomainfrom
Conversation
There was a problem hiding this comment.
(Line 474)
Citation: New min_cuda_version parameter added to Endpoint class in src/runpod_flash/endpoint.py. Default value of "12.8" set in src/runpod_flash/core/resources/serverless.py. CPU endpoints clear this value via _sync_cpu_fields() in serverless_cpu.py.
View source
There was a problem hiding this comment.
(Line 500)
Citation: Valid CUDA versions are validated against the CudaVersion enum via validate_min_cuda_version() in serverless.py. The error message format and validation logic are defined in the PR.
View source
|
Preview deployment for your docs. Learn more about Mintlify Previews.
|
flash/configuration/parameters.mdx
Outdated
|
|
||
| ### min_cuda_version | ||
|
|
||
| **Type**: `str` |
There was a problem hiding this comment.
Citation: Updated type to include CudaVersion enum per reviewer comment from @deanq requesting docs reflect that min_cuda_version accepts the CudaVersion type.
View source
Open this suggestion in Promptless to view citations and reasoning process
Documents the new
min_cuda_versionparameter for Flash endpoints. GPU endpoints now default to CUDA 12.8 to ensure workers run on hosts with recent drivers. Users can override this value to allow older hosts if needed. CPU endpoints are unaffected.Trigger Events
runpod/flash PR #277: feat: default GPU endpoints to minCudaVersion 12.8
Promptless Research (5 files, 1 GitHub PR)
.long_term_context/product_knowledge/product_overview.md.long_term_context/doc_workflow/client_instructions.md.long_term_context/style/client_style_guide.mdflash/configuration/parameters.mdxflash/create-endpoints.mdxAgent Response
Tip: Add or adjust Promptless's style guide in Agent Knowledge Base ✍️