Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MNN not working with MPS partitioned Nvidia GPU #3008

Open
amarflybot opened this issue Aug 27, 2024 · 1 comment
Open

MNN not working with MPS partitioned Nvidia GPU #3008

amarflybot opened this issue Aug 27, 2024 · 1 comment
Labels
bug Something isn't working CUDA

Comments

@amarflybot
Copy link

amarflybot commented Aug 27, 2024

Background

The Multi-Process Service (MPS) is an alternative, binary-compatible implementation of the CUDA Application Programming Interface (API). The MPS runtime architecture is designed to transparently enable co-operative multi-process CUDA applications, typically MPI jobs, to utilize Hyper-Q capabilities on A10 GPUs.
MPS Link

平台(如果交叉编译请再附上交叉编译目标平台):

Platform(Include target platform as well if cross-compiling):

CUDA 12.4
Platform: amd (x86-64)
GPU: Nvidia

Github版本:

Github Version:

version_mnn= 2.9.0

编译方式:

Compiling Method

cmake -DCMAKE_BUILD_TYPE=Release -DMNN_USE_THREAD_POOL=OFF -DMNN_CUDA=ON -DMNN_OPENMP=ON ..

编译日志:

Build Log:

Build is fine, and I can integrate the built MNN binary with a http

Issue:

MNN is not able to get CUDA memory when MPS was enabled.

Execution Log:

The device support i8sdot:0, support fp16:0, support i8mm: 0
Error for /build/MNN/source/backend/cuda/core/runtime/CUDARuntime.cpp, 47
face_detection: /build/MNN/source/backend/cuda/core/runtime/CUDARuntime.cpp:47: MNN::CUDARuntime::CUDARuntime(int): Assertion `res' failed.
Aborted (core dumped)
@jxt1234 jxt1234 added bug Something isn't working CUDA labels Sep 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CUDA
Projects
None yet
Development

No branches or pull requests

3 participants
@amarflybot @jxt1234 and others