-
Notifications
You must be signed in to change notification settings - Fork 138
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'ptxas' died due to signal 11 (Invalid memory reference) #322
Comments
I'm confused, do you want a container or a binary? |
I want to install TEI in a container image for future use.
These are the instructions from the official Docker. |
For clarity. The executable code looks exactly like this (from the official Docker image): export CUDA_COMPUTE_CAP=86
export CUDA_HOME=/usr/local/cuda-12.1
export PATH=${PATH}:/usr/local/cuda-12.1/bin
# Limit parallelism
export CARGO_BUILD_JOBS=1
export RAYON_NUM_THREADS=1
export CARGO_BUILD_INCREMENTAL=true
if [ ${CUDA_COMPUTE_CAP} -ge 75 -a ${CUDA_COMPUTE_CAP} -lt 80 ];
then
nvprune \
--generate-code code=sm_${CUDA_COMPUTE_CAP} \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
elif [ ${CUDA_COMPUTE_CAP} -ge 80 -a ${CUDA_COMPUTE_CAP} -lt 90 ];
then
nvprune \
--generate-code code=sm_80 \
--generate-code code=sm_${CUDA_COMPUTE_CAP} \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
elif [ ${CUDA_COMPUTE_CAP} -eq 90 ];
then
nvprune \
--generate-code code=sm_90 \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
else
echo "cuda compute cap ${CUDA_COMPUTE_CAP} is not supported"; exit 1;
fi;
if [ ${CUDA_COMPUTE_CAP} -ge 75 -a ${CUDA_COMPUTE_CAP} -lt 80 ];
then
cargo chef cook --release \
--features candle-cuda-turing \
--features static-linking \
--no-default-features \
--recipe-path recipe.json && \
sccache -s;
else
cargo chef cook --release \
--features candle-cuda \
--features static-linking \
--no-default-features \
--recipe-path recipe.json && \
sccache -s;
fi;
|
System Info
Version: v.1.4.0
Cargo version: cargo 1.79.0 (ffa9cf99a 2024-06-03)
GCC version: 11.4.1
GPU: Compile with CUDA_COMPUTE_CAP=86 on machine without GPU (but with CUDA 12.1).
I plan to use this container with A40, but I don't have a GPU to build it.
Information
Tasks
Reproduction
I start this script:
I get this error:
Expected behavior
TEI compiled.
The text was updated successfully, but these errors were encountered: