Fix edge cases in (de)serialize_torch_tensor #591

justheuristic · 2023-09-05T19:52:20Z

During an earlier patch, we lost the requires_grad property during serialize_torch_tensor. This PR adds it back.

codecov · 2023-09-05T20:05:36Z

Codecov Report

Merging #591 (12e8fc2) into master (64f1f1e) will increase coverage by 0.17%.
The diff coverage is 88.88%.

@@            Coverage Diff             @@
##           master     #591      +/-   ##
==========================================
+ Coverage   85.20%   85.37%   +0.17%     
==========================================
  Files          81       81              
  Lines        8009     8022      +13     
==========================================
+ Hits         6824     6849      +25     
+ Misses       1185     1173      -12

Files Changed	Coverage Δ
hivemind/compression/floating.py	`89.23% <84.61%> (-2.15%)`	⬇️
hivemind/compression/quantization.py	`94.21% <92.30%> (-0.62%)`	⬇️
hivemind/compression/base.py	`94.36% <100.00%> (+0.08%)`	⬆️

... and 4 files with indirect coverage changes

mryab · 2023-09-05T21:56:13Z

hivemind/compression/floating.py

@@ -12,22 +12,28 @@ class Float16Compression(CompressionBase):
    FP16_MIN, FP16_MAX = torch.finfo(torch.float16).min, torch.finfo(torch.float16).max

    def compress(self, tensor: torch.Tensor, info: CompressionInfo, allow_inplace: bool = False) -> runtime_pb2.Tensor:
+        assert torch.is_floating_point(tensor) and tensor.dtype != torch.bfloat16


Is there a reason why we should fail with an error in case of bf16 inputs? It is indeed not sensible, but if the user wants to do so, it's probably better to issue a warning instead of flat out refusing to pass that through quantization

added ValueError with a more user-legible reason

mryab · 2023-09-05T21:58:40Z

hivemind/compression/quantization.py

@@ -135,14 +138,15 @@ def quantize(
        except ImportError:
            raise ImportError(BNB_MISSING_MESSAGE)

-        quantized, (absmax, codebook) = quantize_blockwise(tensor)
+        quantized, (absmax, codebook, *extra_params) = quantize_blockwise(tensor, blocksize=4096, nested=False)
+        assert tuple(extra_params) == (4096, False, tensor.dtype, None, None)  # blocksize, nested, dtype, offset, s2


Maybe we can make that tuple on the right a module-level constant? It's used twice in the code, better to make it clear we're using some predefined values

done, thanks for the suggestion

* serialize with requires_grad * ensure that all compression methods return tensor of the original dtype * test that all compression methods preserve dtype and requires_grad --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

* serialize with requires_grad * ensure that all compression methods return tensor of the original dtype * test that all compression methods preserve dtype and requires_grad --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: Max Ryabinin <mryabinin0@gmail.com> (cherry picked from commit 2873252)

serialize with requires_grad

c69feed

justheuristic requested a review from mryab September 5, 2023 19:52

mryab approved these changes Sep 5, 2023

View reviewed changes

mryab changed the title ~~serialize with requires_grad~~ Serialize the requires_grad tensor property Sep 5, 2023

justheuristic and others added 4 commits September 5, 2023 23:23

Update quantization.py

058690e

Update floating.py

c621685

Update test_compression.py

3363f11

moar review

ee4164f

justheuristic changed the title ~~Serialize the requires_grad tensor property~~ Fix edge cases in (de)serialize_torch_tensor Sep 5, 2023

moar review

d92ca9f

mryab approved these changes Sep 5, 2023

View reviewed changes

review

12e8fc2

justheuristic merged commit 2873252 into master Sep 5, 2023
14 checks passed

justheuristic deleted the fix-requires-grad branch September 5, 2023 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix edge cases in (de)serialize_torch_tensor #591

Fix edge cases in (de)serialize_torch_tensor #591

justheuristic commented Sep 5, 2023

codecov bot commented Sep 5, 2023 •

edited

Loading

mryab Sep 5, 2023

justheuristic Sep 5, 2023

mryab Sep 5, 2023

justheuristic Sep 5, 2023 •

edited

Loading

Fix edge cases in (de)serialize_torch_tensor #591

Fix edge cases in (de)serialize_torch_tensor #591

Conversation

justheuristic commented Sep 5, 2023

codecov bot commented Sep 5, 2023 • edited Loading

Codecov Report

mryab Sep 5, 2023

Choose a reason for hiding this comment

justheuristic Sep 5, 2023

Choose a reason for hiding this comment

mryab Sep 5, 2023

Choose a reason for hiding this comment

justheuristic Sep 5, 2023 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Sep 5, 2023 •

edited

Loading

justheuristic Sep 5, 2023 •

edited

Loading