fix(mm): premature insufficient VRAM reporting #6156

psychedelicious · 2024-04-05T07:58:28Z

Summary

Remove the new VRAM checking logic from the ModelCache, which prematurely reports insufficient VRAM on Windows.

See #6106 for details.

Related Issues / Discussions

QA Instructions

Use the generation settings in #6106 on a 12GB GPU on Windows. That combination will trigger the premature OOM on main, but will work on this branch (assuming the system has ~3GB free RAM).

Merge Plan

n/a

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

This check prematurely reports insufficient VRAM on Windows. See #6106 for details.

psychedelicious requested review from lstein, blessedcoolant, GreggHelt2, brandonrising, RyanJDick and hipsterusername as code owners April 5, 2024 07:58

github-actions bot added python PRs that change python files backend PRs that change backend files docs PRs that change docs labels Apr 5, 2024

hipsterusername approved these changes Apr 5, 2024

View reviewed changes

psychedelicious enabled auto-merge (rebase) April 6, 2024 03:27

psychedelicious added 3 commits April 6, 2024 14:27

fix(mm): remove vram check

31849ec

This check prematurely reports insufficient VRAM on Windows. See #6106 for details.

fix(mm): typing issues in model cache

5b3a9d3

docs: update FAQ.md (shared GPU memory)

4473746

psychedelicious force-pushed the psyche/fix/mm/remove-vram-check branch from ba2274e to 4473746 Compare April 6, 2024 03:27

psychedelicious merged commit a95756f into main Apr 6, 2024
14 checks passed

psychedelicious deleted the psyche/fix/mm/remove-vram-check branch April 6, 2024 03:35

psychedelicious mentioned this pull request Apr 6, 2024

Add a config variable that disable VRAM OOM conditions #6124

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(mm): premature insufficient VRAM reporting #6156

fix(mm): premature insufficient VRAM reporting #6156

psychedelicious commented Apr 5, 2024

fix(mm): premature insufficient VRAM reporting #6156

fix(mm): premature insufficient VRAM reporting #6156

Conversation

psychedelicious commented Apr 5, 2024

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist