When gpu lost, scheduler will assign pod to wrong node #1782

zhiyu0729 · 2021-10-14T09:46:56Z

What happened:
After pod scheduled, it create fail.

Status:         Failed
Reason:         UnexpectedAdmissionError
Message:        Pod Allocate failed due to requested number of devices unavailable for nvidia.com/gpu. Requested: 1, Available: 0, which is unexpected

What you expected to happen:
Pod Shouldn't schedule to this Node.

How to reproduce it (as minimally and precisely as possible):

Node with 4 GPU

Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     4

Start Pod With 4 GPU in this Node.
One GPU error detect by device-plugin, then.

Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     3

Start Volcano Scheduler now.
Create Pod With 1 GPU.
Pod scheduled to this node, but node no idle gpu resource.

Anything else we need to know?:

Environment:

Volcano Version: e83119
Kubernetes version (use kubectl version):
Cloud provider or hardware configuration:
OS (e.g. from /etc/os-release):
Kernel (e.g. uname -a):
Install tools:
Others:

The text was updated successfully, but these errors were encountered:

shinytang6 · 2021-10-14T11:57:19Z

What happened: After pod scheduled, it create fail.
Status:         Failed
Reason:         UnexpectedAdmissionError
Message:        Pod Allocate failed due to requested number of devices unavailable for nvidia.com/gpu. Requested: 1, Available: 0, which is unexpected
What you expected to happen: Pod Shouldn't schedule to this Node.

How to reproduce it (as minimally and precisely as possible):

Node with 4 GPU
Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     4
Start Pod With 4 GPU in this Node.

One GPU error detect by device-plugin, then.
Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     3
Start Volcano Scheduler now.

Create Pod With 1 GPU.

Pod scheduled to this node, but node node idle gpu resource.

Anything else we need to know?:

Environment:

Volcano Version: e83119

Kubernetes version (use kubectl version):

Cloud provider or hardware configuration:

OS (e.g. from /etc/os-release):

Kernel (e.g. uname -a):

Install tools:

Others:

l am a little confused, since the first pod (4 GPU) occupies the node first, why is the pod (1 GPU) can still be scheduled to the node, did l miss sth ?

zhiyu0729 · 2021-10-14T12:35:42Z

What happened: After pod scheduled, it create fail.
Status:         Failed
Reason:         UnexpectedAdmissionError
Message:        Pod Allocate failed due to requested number of devices unavailable for nvidia.com/gpu. Requested: 1, Available: 0, which is unexpected
What you expected to happen: Pod Shouldn't schedule to this Node.
How to reproduce it (as minimally and precisely as possible):

Node with 4 GPU
Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     4
Start Pod With 4 GPU in this Node.

One GPU error detect by device-plugin, then.
Capacity:
  nvidia.com/gpu:     4
Allocatable:
  nvidia.com/gpu:     3
Start Volcano Scheduler now.

Create Pod With 1 GPU.

Pod scheduled to this node, but node node idle gpu resource.

Anything else we need to know?:
Environment:

Volcano Version: e83119

Kubernetes version (use kubectl version):

Cloud provider or hardware configuration:

OS (e.g. from /etc/os-release):

Kernel (e.g. uname -a):

Install tools:

Others:
l am a little confused, since the first pod (4 GPU) occupies the node first, why is the pod (1 GPU) can still be scheduled to the node, did l miss sth ?

The problem is here:

When schedule start , sync scheduler cache, tigger AddPod handler, when pod bind to node tasks list:

volcano/pkg/scheduler/api/node_info.go

Lines 336 to 343 in b119114

    
           func (ni *NodeInfo) allocateIdleResource(ti *TaskInfo) error { 
        
           	if ti.Resreq.LessEqual(ni.Idle, Zero) { 
        
           		ni.Idle.Sub(ti.Resreq) 
        
           		return nil 
        
           	} 
        
           	return fmt.Errorf("selected node NotReady") 
        
           }

first pod Resreq 4 gpu, node has 3 gpu can allocate, it raise error.

volcano/pkg/scheduler/cache/event_handlers.go

Lines 187 to 205 in a7ecd08

    
           // AddPod add pod to scheduler cache 
        
           func (sc *SchedulerCache) AddPod(obj interface{}) { 
        
           	pod, ok := obj.(*v1.Pod) 
        
           	if !ok { 
        
           		klog.Errorf("Cannot convert to *v1.Pod: %v", obj) 
        
           		return 
        
           	} 
        
           	sc.Mutex.Lock() 
        
           	defer sc.Mutex.Unlock() 
        
           	err := sc.addPod(pod) 
        
           	if err != nil { 
        
           		klog.Errorf("Failed to add pod <%s/%s> into cache: %v", 
        
           			pod.Namespace, pod.Name, err) 
        
           		return 
        
           	} 
        
           	klog.V(3).Infof("Added pod <%s/%v> into cache.", pod.Namespace, pod.Name) 
        
           }

but in AddPod it just ingore this pod, so node idle resource is 3 gpu.

zhiyu0729 added the kind/bug Categorizes issue or PR as related to a bug. label Oct 14, 2021

zhiyu0729 mentioned this issue Oct 14, 2021

Catch add pod out of sync error #1783

Merged

zhiyu0729 changed the title ~~gpu error occur, cause scheduling to the wrong node~~ When gpu lost, scheduler pod to the wrong node Oct 14, 2021

zhiyu0729 changed the title ~~When gpu lost, scheduler pod to the wrong node~~ When gpu lost, scheduler will schedule pod to the wrong node Oct 14, 2021

zhiyu0729 changed the title ~~When gpu lost, scheduler will schedule pod to the wrong node~~ When gpu lost, scheduler will assign pod to the wrong node Oct 14, 2021

zhiyu0729 changed the title ~~When gpu lost, scheduler will assign pod to the wrong node~~ When gpu lost, scheduler will assign pod to wrong node Oct 14, 2021

Thor-wl added area/scheduling priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. labels Oct 15, 2021

zhiyu0729 closed this as completed Oct 18, 2021

eggiter mentioned this issue Nov 2, 2021

The Lost of GPUs problem has not been resolved completely #1818

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When gpu lost, scheduler will assign pod to wrong node #1782

When gpu lost, scheduler will assign pod to wrong node #1782

zhiyu0729 commented Oct 14, 2021 •

edited

Loading

shinytang6 commented Oct 14, 2021

zhiyu0729 commented Oct 14, 2021

When gpu lost, scheduler will assign pod to wrong node #1782

When gpu lost, scheduler will assign pod to wrong node #1782

Comments

zhiyu0729 commented Oct 14, 2021 • edited Loading

shinytang6 commented Oct 14, 2021

zhiyu0729 commented Oct 14, 2021

zhiyu0729 commented Oct 14, 2021 •

edited

Loading