-
Notifications
You must be signed in to change notification settings - Fork 814
Failed to discover NVIDIA GPU in the running container started by buildah (vfs + chroot) #5227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Isn't the GPU a device? Say /dev/gpu? Could you try ctr=$(buildah from --device /dev/gpu ...) |
A friendly reminder that this issue had no activity for 30 days. |
sorry for late reply. i tried the following: ctr=$(buildah --device /dev/nvidia0 from for.example.com/gpu_image_for_test)
buildah run $ctr /bin/bash and then btw, this container is run in another container with vfs+chroot mode. |
Could you try buildah --device=nvidia.com/gpu=all from ... |
|
What version of buildah are you using? |
~ # buildah version
Version: 1.33.7
Go Version: go1.21.9 (Red Hat 1.21.9-1.module+el8.8.0+632+2dde9914)
Image Spec: 1.1.0-rc.5
Runtime Spec: 1.1.0
CNI Spec: 1.0.0
libcni Version: v1.1.2
image Version: 5.29.2
Git Commit:
Built: Tue Jun 18 11:12:42 2024
OS/Arch: linux/amd64
BuildPlatform: linux/amd64
~ # env | grep BUILDAH
BUILDAH_FORMAT=docker
BUILDAH_ISOLATION=chroot
~ # env | grep STORAGE
STORAGE_DRIVER=vfs |
Any chance you can update the version?
|
|
Yes 1.36 has the patch. |
https://github.com/containers/buildah/blob/release-1.36/pkg/parse/parse_unix.go It seems like the patch is missing. Could you confirm? Thanks. |
Hello all, any update here? I don't see parse_unix.go having the patch that was mentioned. |
@rhatdan your input is needed. |
Does the container have access to the necessary CDI configuration in its |
Any workaround before the PR is merged? |
I think the current expectation is that, if the data in
|
Description
Failed to discover NVIDIA GPU in the running container started by buildah (vfs + chroot)
Steps to reproduce the issue:
buildah
export STORAGE_DRIVER=vfs
and isolationexport BUILDAH_ISOLATION=chroot
buildah
and run withbuildah
Describe the results you received:

Describe the results you expected:
pytorch finds the gpu run the code successfully.
Output of
rpm -q buildah
orapt list buildah
:# rpm -q buildah buildah-1.30.0-1.tl4.x86_64
Output of
buildah version
:# buildah version Version: 1.30.0 Go Version: go1.19 Image Spec: 1.0.2-dev Runtime Spec: 1.1.0-rc.1 CNI Spec: 1.0.0 libcni Version: v1.1.2 image Version: 5.25.0 Git Commit: Built: Fri Jul 14 19:36:27 2023 OS/Arch: linux/amd64 BuildPlatform: linux/amd64
Output of
podman version
if reporting apodman build
issue:Output of
cat /etc/*release
:Output of
uname -a
:Output of
cat /etc/containers/storage.conf
:The text was updated successfully, but these errors were encountered: