Fix data type when getting a line offset for a segmented hrit_jma #2930

k3a · 2024-10-15T14:45:23Z

Fixes the following uint8 overflow when loading Himawari hrit data distributed by EumetCast:

Traceback (most recent call last):
  File "/Work/meteo/satpy-tests/./test5.py", line 18, in <module>
    scn = Scene(filenames=filenames, reader='ahi_hrit')
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/scene.py", line 155, in __init__
    self._readers = self._create_reader_instances(filenames=filenames,
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/scene.py", line 176, in _create_reader_instances
    return load_readers(filenames=filenames,
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/__init__.py", line 580, in load_readers
    reader_instance.create_storage_items(
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 617, in create_storage_items
    return self.create_filehandlers(files, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 1180, in create_filehandlers
    created_fhs = super(GEOSegmentYAMLReader, self).create_filehandlers(
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 629, in create_filehandlers
    filehandlers = self._new_filehandlers_for_filetype(filetype_info,
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 612, in _new_filehandlers_for_filetype
    return list(filtered_iter)
           ^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 594, in filter_fh_by_metadata
    for filehandler in filehandlers:
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/yaml_reader.py", line 590, in _new_filehandler_instances
    yield filetype_cls(filename, filename_info, filetype_info, *req_fh, **fh_kwargs)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/hrit_jma.py", line 278, in __init__
    self.area = self._get_area_def()
                ^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/hrit_jma.py", line 365, in _get_area_def
    "loff": self._get_line_offset(),
            ^^^^^^^^^^^^^^^^^^^^^^^
  File "/Work/meteo/venv/lib/python3.12/site-packages/satpy/readers/hrit_jma.py", line 349, in _get_line_offset
    loff -= (self.mda["total_no_image_segm"] - segment_number - 1) * nlines
            ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~
OverflowError: Python integer 550 out of bounds for uint8

mraspaud · 2024-10-15T16:57:32Z

@k3a thanks for your contribution! Do you think you can add a little test for this (to ensure that the bug doesn't come back in the future), and add your name to the Authors files?

codecov · 2024-10-17T07:23:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.08%. Comparing base (b8991c0) to head (7adbcdd).
Report is 130 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #2930    +/-   ##
========================================
  Coverage   96.07%   96.08%            
========================================
  Files         373      377     +4     
  Lines       54491    55067   +576     
========================================
+ Hits        52352    52909   +557     
- Misses       2139     2158    +19

Flag	Coverage Δ
behaviourtests	`3.94% <0.00%> (-0.05%)`	⬇️
unittests	`96.17% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

coveralls · 2024-10-17T07:36:29Z

Pull Request Test Coverage Report for Build 11416592783

Details

3 of 3 (100.0%) changed or added relevant lines in 2 files are covered.
8 unchanged lines in 3 files lost coverage.
Overall coverage decreased (-0.02%) to 96.187%

Files with Coverage Reduction	New Missed Lines	%
satpy/tests/utils.py	2	93.16%
satpy/tests/reader_tests/gms/test_gms5_vissr_l1b.py	3	98.67%
satpy/tests/reader_tests/gms/test_gms5_vissr_navigation.py	3	97.18%

Totals
Change from base Build 11403042925:	-0.02%
Covered Lines:	53153
Relevant Lines:	55260

💛 - Coveralls

k3a · 2024-10-19T10:14:24Z

No new tests seem necessary. I have just updated the _get_mda helper function to convert into the proper numpy types according to the dtype specs.

djhoese · 2024-10-19T14:09:26Z

If the existing tests didn't fail before your fix then we need new tests that would fail and now pass unless I'm misunderstanding something.

pnuu · 2024-10-19T14:20:04Z

I don't have an up-to-date environment at hand, but I think the modified data represents more closely the actual data, so might fail without the changes in the code. I'll see if I have the time to create a new env to test with.

k3a · 2024-10-19T14:26:06Z

I have, of course, tested that the current tests will fail without the fix. There are tests testing the case with segmented hrit but that _get_mda needs to retype args into proper np types to trigger this bug because when data is parsed it is parsed into those concrete types, not a generic python ints:

FAILED test_ahi_hrit.py::TestHRITJMAFileHandler::test_get_area_def - OverflowError: Python integer 275 out of bounds for uint8
FAILED test_ahi_hrit.py::TestHRITJMAFileHandler::test_get_dataset - OverflowError: Python integer 275 out of bounds for uint8
FAILED test_ahi_hrit.py::TestHRITJMAFileHandler::test_init - OverflowError: Python integer 11000 out of bounds for uint8
FAILED test_ahi_hrit.py::TestHRITJMAFileHandler::test_mask_space - OverflowError: Python integer 275 out of bounds for uint8

k3a requested review from sfinkens and mraspaud as code owners October 15, 2024 14:45

mraspaud assigned k3a Oct 15, 2024

mraspaud added bug component:readers labels Oct 15, 2024

k3a force-pushed the main branch from 6487bd4 to 7adbcdd Compare October 19, 2024 09:53

k3a requested a review from djhoese as a code owner October 19, 2024 09:53

Fix data type when getting a line offset for a segmented hrit_jma

876d3bb

k3a force-pushed the main branch from 7adbcdd to 876d3bb Compare October 20, 2024 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix data type when getting a line offset for a segmented hrit_jma #2930

Fix data type when getting a line offset for a segmented hrit_jma #2930

k3a commented Oct 15, 2024

mraspaud commented Oct 15, 2024 •

edited

Loading

codecov bot commented Oct 17, 2024 •

edited

Loading

coveralls commented Oct 17, 2024 •

edited

Loading

k3a commented Oct 19, 2024

djhoese commented Oct 19, 2024

pnuu commented Oct 19, 2024

k3a commented Oct 19, 2024 •

edited

Loading

Fix data type when getting a line offset for a segmented hrit_jma #2930

Are you sure you want to change the base?

Fix data type when getting a line offset for a segmented hrit_jma #2930

Conversation

k3a commented Oct 15, 2024

mraspaud commented Oct 15, 2024 • edited Loading

codecov bot commented Oct 17, 2024 • edited Loading

Codecov Report

coveralls commented Oct 17, 2024 • edited Loading

Pull Request Test Coverage Report for Build 11416592783

Details

💛 - Coveralls

k3a commented Oct 19, 2024

djhoese commented Oct 19, 2024

pnuu commented Oct 19, 2024

k3a commented Oct 19, 2024 • edited Loading

mraspaud commented Oct 15, 2024 •

edited

Loading

codecov bot commented Oct 17, 2024 •

edited

Loading

coveralls commented Oct 17, 2024 •

edited

Loading

k3a commented Oct 19, 2024 •

edited

Loading