Include the column name in the error message for an unexpected NULL #397

angusholder · 2024-09-19T08:52:40Z

Previously if you inserted a NULL into a column that isn't Nullable, the error you got was

Unable to create Python array. This is usually caused by trying to insert None values into a ClickHouse column that is not Nullable

which is unhelpful for working out which column is the problem. I've modified that error path so it can include the column name in the error, like so:

Failed to write column 'bus_voltage': Unable to create Python array. This is usually caused by trying to insert None values into a ClickHouse column that is not Nullable

I felt like this was a pretty small change so I didn't add tests or file an issue, I hope that's okay.

angusholder · 2024-09-19T09:07:48Z

clickhouse_connect/driver/insert.py

@@ -198,3 +199,12 @@ def _convert_numpy(self, np_array):
                data[ix] = data[ix].tolist()
        self.column_oriented = True
        return data
+
+    def start_column(self, name: str):


This gets called during the insert to tell us what the current column being inserted is, so I store its name here so we have it if we run into an error during inserting that column

angusholder · 2024-09-19T09:08:39Z

clickhouse_connect/driver/insert.py

+        self._column_name = name
+
+    def make_data_error(self, error_message: str) -> DataError:
+        if self._column_name is not None:


Here's where we use the column name that was stored by start_column(). I don't know if it's possible to reach here by doing a column insert without start_column() having been called, but I handled None just in case

It's not possible, so it's safe to remove the None check.

angusholder · 2024-09-19T09:09:59Z

clickhouse_connect/driver/common.py

@@ -54,8 +55,8 @@ def write_array(code: str, column: Sequence, dest: MutableSequence):
        buff = struct.Struct(f'<{len(column)}{code}')
        dest += buff.pack(*column)
    except (TypeError, OverflowError, struct.error) as ex:
-        raise DataError('Unable to create Python array.  This is usually caused by trying to insert None ' +
-                        'values into a ClickHouse column that is not Nullable') from ex
+        raise ctx.make_data_error('Unable to create Python array.  This is usually caused by trying to insert None ' +


This was the only error I really needed improving, but I modified the other places where DataError could be raised too, in string.py

genzgd

Thanks so much, I agree it's hard to track down obscure insert errors and this will help. Stashing the column name in InsertContext makes a lot of sense, btw -- I haven't looked deeply, but maybe we add that to the BaseQueryContext instead since it doesn't hurt to have it around for queries as well.

genzgd · 2024-09-19T15:43:01Z

clickhouse_connect/driver/common.py

@@ -38,12 +38,13 @@ def array_type(size: int, signed: bool):
    return code if signed else code.upper()


-def write_array(code: str, column: Sequence, dest: MutableSequence):
+def write_array(code: str, column: Sequence, dest: MutableSequence, ctx):


I think you can add the ctx Type here too?

genzgd · 2024-09-19T15:54:27Z

clickhouse_connect/driver/insert.py

+        self._column_name = name
+
+    def make_data_error(self, error_message: str) -> DataError:
+        if self._column_name is not None:


It's not possible, so it's safe to remove the None check.

CLAassistant · 2024-09-19T16:22:19Z

All committers have signed the CLA.

angusholder · 2024-09-19T16:26:25Z

Thanks so much, I agree it's hard to track down obscure insert errors and this will help. Stashing the column name in InsertContext makes a lot of sense, btw -- I haven't looked deeply, but maybe we add that to the BaseQueryContext instead since it doesn't hurt to have it around for queries as well.

Sounds like a good idea, I've moved it to BaseQueryContext as you suggest.

Should be ready for another CI run now

genzgd · 2024-09-19T17:04:07Z

Thanks again! I'm hoping to do another release next week that will include this.

angusholder · 2024-09-19T19:03:19Z

No problem! Great to hear, thanks

Improve error messages to include column name

9fa0ba7

angusholder commented Sep 19, 2024

View reviewed changes

genzgd reviewed Sep 19, 2024

View reviewed changes

angusholder added 3 commits September 19, 2024 17:19

Fix imports, add type

e065f05

Move column name into BaseQueryContext

527fde4

Remove unnecessary None check

35115e6

Fix import cycle

279e1a5

genzgd merged commit b90cdf9 into ClickHouse:main Sep 19, 2024
33 checks passed

angusholder deleted the better-column-errors branch September 19, 2024 19:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include the column name in the error message for an unexpected NULL #397

Include the column name in the error message for an unexpected NULL #397

angusholder commented Sep 19, 2024

angusholder Sep 19, 2024 •

edited

Loading

angusholder Sep 19, 2024 •

edited

Loading

genzgd Sep 19, 2024

angusholder Sep 19, 2024

genzgd left a comment

genzgd Sep 19, 2024

genzgd Sep 19, 2024

CLAassistant commented Sep 19, 2024 •

edited

Loading

angusholder commented Sep 19, 2024 •

edited

Loading

genzgd commented Sep 19, 2024

angusholder commented Sep 19, 2024

Include the column name in the error message for an unexpected NULL #397

Include the column name in the error message for an unexpected NULL #397

Conversation

angusholder commented Sep 19, 2024

angusholder Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

angusholder Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

genzgd Sep 19, 2024

Choose a reason for hiding this comment

angusholder Sep 19, 2024

Choose a reason for hiding this comment

genzgd left a comment

Choose a reason for hiding this comment

genzgd Sep 19, 2024

Choose a reason for hiding this comment

genzgd Sep 19, 2024

Choose a reason for hiding this comment

CLAassistant commented Sep 19, 2024 • edited Loading

angusholder commented Sep 19, 2024 • edited Loading

genzgd commented Sep 19, 2024

angusholder commented Sep 19, 2024

angusholder Sep 19, 2024 •

edited

Loading

angusholder Sep 19, 2024 •

edited

Loading

CLAassistant commented Sep 19, 2024 •

edited

Loading

angusholder commented Sep 19, 2024 •

edited

Loading