Check if experiment_id attribute is present in current dataset when being processed by PrePARE #644

mauzey1 · 2022-01-21T22:57:03Z

Fixes #643

durack1

@mauzey1 thanks for catching this - @matthew-mizielinski are there any other obvious tweaks you suggest?

piotr-florek-mohc

Looks good to me.

piotr-florek-mohc · 2022-01-24T13:56:23Z

Just a quick note, this fix doesn't appear to resolve the whole issue raised in #643 entirely (at least it didn't do it for me), as there are other problems with the netCDF file, which cmor to fail because of weird characters appearing in the log file. The quick fix, as commented by @matthew.mizielinski, is to provide encoding parameter to the PrePARE.py script.

diff --git a/LibCV/PrePARE/PrePARE.py b/LibCV/PrePARE/PrePARE.py
index 6fcd570..97f1c9e 100755
--- a/LibCV/PrePARE/PrePARE.py
+++ b/LibCV/PrePARE/PrePARE.py
@@ -931,7 +931,7 @@ def main():
         remove_ansi = re.compile(r'\x1b\[[0-?]*[ -/]*[@-~]')
         for logfile in set(logfiles):
             if not os.stat(logfile).st_size == 0:
-                with open(logfile, 'r') as f:
+                with open(logfile, 'r', encoding='utf8', errors='ignore') as f:
                     log_text = f.read()
                     if args.no_text_color:
                         log_text = remove_ansi.sub('', log_text)
@@ -955,7 +955,7 @@ def main():
             logfile, rc = process(source)
             errors += rc
             if not os.stat(logfile).st_size == 0:
-                with open(logfile, 'r') as f:
+                with open(logfile, 'r', encoding='utf8', errors='ignore') as f:
                     log_text = f.read()
                     if args.no_text_color:
                         log_text = remove_ansi.sub('', log_text)

mauzey1 · 2022-01-24T18:27:52Z

@piotr-florek-mohc

Okay, I have added the suggested changes to PrePARE. I would still like to keep finding bugs in CMOR_CV code similar to the ones I have found, but this will at least reduce the chances of PrePARE crashing when processing erroneous NetCDF files.

durack1 · 2022-01-24T19:59:19Z

@mauzey1 @piotr-florek-mohc (and @matthew-mizielinski) thanks for this tweak!

mauzey1 · 2022-01-24T23:24:41Z

The MacOS Python 3.9 check failed due to a server error with conda. The last change committed would not have affected the tests anyway. I will merge this now.

Check if experiment_id attribute is present in current dataset

ffafb0b

mauzey1 requested review from durack1 and piotr-florek-mohc January 21, 2022 22:57

durack1 approved these changes Jan 22, 2022

View reviewed changes

piotr-florek-mohc approved these changes Jan 24, 2022

View reviewed changes

Ignore utf-8 character errors when reading log files in PrePARE

71abfdb

mauzey1 merged commit ea62700 into master Jan 24, 2022

mauzey1 deleted the 643_prepare_unicode_error branch January 24, 2022 23:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check if experiment_id attribute is present in current dataset when being processed by PrePARE #644

Check if experiment_id attribute is present in current dataset when being processed by PrePARE #644

mauzey1 commented Jan 21, 2022

durack1 left a comment

piotr-florek-mohc left a comment

piotr-florek-mohc commented Jan 24, 2022

mauzey1 commented Jan 24, 2022

durack1 commented Jan 24, 2022

mauzey1 commented Jan 24, 2022 •

edited

Loading

Check if experiment_id attribute is present in current dataset when being processed by PrePARE #644

Check if experiment_id attribute is present in current dataset when being processed by PrePARE #644

Conversation

mauzey1 commented Jan 21, 2022

durack1 left a comment

Choose a reason for hiding this comment

piotr-florek-mohc left a comment

Choose a reason for hiding this comment

piotr-florek-mohc commented Jan 24, 2022

mauzey1 commented Jan 24, 2022

durack1 commented Jan 24, 2022

mauzey1 commented Jan 24, 2022 • edited Loading

mauzey1 commented Jan 24, 2022 •

edited

Loading