[Doc][Misc] Correcting the document and uploading the model deployment template (#8287)
<!-- Thanks for sending a pull request! BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing/overview.html --> ### What this PR does / why we need it? Correcting the document and uploading the model deployment template ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
This commit is contained in:
@@ -163,7 +163,7 @@ export ASCEND_ENABLE_USE_FABRIC_MEM=1
|
||||
#A2
|
||||
#export HCCL_INTRA_ROCE_ENABLE=1
|
||||
|
||||
#Minimum retransmission timeout of the RDMA,equals 4.096 μs * 2 ^ timeout.
|
||||
#Minimum retransmission timeout of the RDMA, equals 4.096 μs * 2 ^ timeout.
|
||||
#Needs to satisfy the equation: ASCEND_TRANSFER_TIMEOUT > RDMA_TIMEOUT * 7, where 7 is the default number of retry for RDMA transfer.
|
||||
#HCCL_RDMA_TIMEOUT also affects collective communication behavior and should be configured carefully.
|
||||
export HCCL_RDMA_TIMEOUT=17
|
||||
|
||||
Reference in New Issue
Block a user