Not enough resources for query planning - BigQuery connection breaks with too many tests #232

alberto-maurel · 2022-10-27T16:22:35Z

Hi

We are facing the following problem when running dbt_artifacts.upload_results(results) in the on-run-end hook:

11:39:18  Running 2 on-run-end hooks
11:39:22  Uploading model executions
11:39:29  Uploading seed executions
11:39:31  Uploading snapshot executions
11:39:33  Uploading test executions
11:40:07  Database error while running on-run-end
11:40:07  Encountered an error:
Database Error
  Resources exceeded during query execution: Not enough resources for query planning - too many subqueries or query is too complex.
...
google.api_core.exceptions.BadRequest: 400 Resources exceeded during query execution: Not enough resources for query planning - too many subqueries or query is too complex.

Location: us-east1
Job ID: 8eedde3d-33b8-4157-9dc8-c8130579f75b

...
  File "/opt/hostedtoolcache/Python/3.8.13/x64/lib/python3.8/site-packages/dbt/adapters/bigquery/connections.py", line 186, in handle_error
    raise DatabaseException(error_msg)
dbt.exceptions.DatabaseException: Database Error
  Resources exceeded during query execution: Not enough resources for query planning - too many subqueries or query is too complex.

When checking that Job ID, the query executed is the following one:

insert into dataset.dbt_ci.artifacts_src__test_executions
    VALUES 
        ('run_id', 'test.dataset.test_1', '2022-10-27 11:25:36.300150+00:00', False, 'Thread-7', 'pass', '2022-10-27 11:28:22.783830', '2022-10-27 11:28:23.692153', 0.9184236526489258, null, 0),
        ('run_id', 'test.dataset.test_2', '2022-10-27 11:25:36.300150+00:00', False, 'Thread-9', 'pass', '2022-10-27 11:28:22.766246', '2022-10-27 11:28:23.897758', 1.1385383605957031, null, 0),
        ...
        ('run_id', 'test.dataset.test_1069', '2022-10-27 11:25:36.300150+00:00', False, 'Thread-9', 'pass', '2022-10-27 11:28:22.766246', '2022-10-27 11:28:23.897758', 1.1385383605957031, null, 0)

As we have 1050+ tests, BigQuery is not able to process the query and spits out that error. I've performed some manual tests and it seems to start complaining between 850-900 tests.

Would it be possible to modify the way the data is inserted in the tables in BigQuery to a more efficient one? As an alternative, I've thought about substituting this for something like this:

INSERT INTO capchase.dbt_ci.artifacts_src__test_executions
SELECT 
        tests_data.c1,
        tests_data.c2,
        CAST(tests_data.c3 AS TIMESTAMP),
        tests_data.c4,
        tests_data.c5,
        tests_data.c6,
        CAST(tests_data.c7 AS TIMESTAMP),
        CAST(tests_data.c8 AS TIMESTAMP),
        tests_data.c9,
        tests_data.c10,
        tests_data.c11
FROM UNNEST([STRUCT('run_id' AS c1, 'test.dataset.test_1' AS c2, '2022-10-27 11:25:36.300150+00:00' AS c3, False AS c4, 'Thread-7' AS c5, 'pass' AS c6, '2022-10-27 11:28:22.783830' AS c7, '2022-10-27 11:28:23.692153' AS c8, 0.9184236526489258 AS c9, null AS c10, 0 AS c11),
        ('run_id', 'test.dataset.test_2', '2022-10-27 11:25:36.300150+00:00', False, 'Thread-9', 'pass', '2022-10-27 11:28:22.766246', '2022-10-27 11:28:23.897758', 1.1385383605957031, null, 0),
        ...
        ('run_id', 'test.dataset.test_1069', '2022-10-27 11:25:36.300150+00:00', False, 'Thread-9', 'pass', '2022-10-27 11:28:22.766246', '2022-10-27 11:28:23.897758', 1.1385383605957031, null, 0)]) tests_data

This second approach is able to handle the 1050 tests seamlessly, and as side-effect, it runs 7 times faster (current left, proposed right):

Thanks a lot!

The text was updated successfully, but these errors were encountered:

jens-koster · 2023-02-17T08:48:49Z

we're running into this as well.

kieronellis · 2023-02-24T15:02:37Z

Same here. Is there any fix coming?

ghost · 2023-04-14T09:49:37Z

We also have this issue, with the upload_models step as well. If anyone know any workaround besides skipping this step, I'll take it 😀

samw430 · 2023-04-26T20:54:13Z

++

glsdown · 2023-04-28T14:12:16Z

Hi @alberto-maurel .

Thanks for raising this, and I'm really sorry for how long it has taken for someone to get back to you. We will look into this and get a fix in place. Thank you for your suggested approach - this is really helpful, and will definitely speed up a resolution 🤞

Related issue: Uploading tests fails on a large repo with 19K+ tests #255

glsdown · 2023-05-25T11:03:24Z

Hi @alberto-maurel

Thanks for your patience with this. We have just released 2.4.0 which should include a fix for this situation. Please let us know if it hasn't fixed it, and feel free to reopen the issue.

glsdown added bug Something isn't working priority Priority issue or pull request labels Apr 28, 2023

glsdown mentioned this issue May 5, 2023

Upload large number of tests (>20K) in chunks to prevent Snowflake errors #302

Merged

12 tasks

glsdown closed this as completed May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not enough resources for query planning - BigQuery connection breaks with too many tests #232

Not enough resources for query planning - BigQuery connection breaks with too many tests #232

alberto-maurel commented Oct 27, 2022

jens-koster commented Feb 17, 2023

kieronellis commented Feb 24, 2023

ghost commented Apr 14, 2023

samw430 commented Apr 26, 2023

glsdown commented Apr 28, 2023 •

edited

Loading

glsdown commented May 25, 2023

Not enough resources for query planning - BigQuery connection breaks with too many tests #232

Not enough resources for query planning - BigQuery connection breaks with too many tests #232

Comments

alberto-maurel commented Oct 27, 2022

jens-koster commented Feb 17, 2023

kieronellis commented Feb 24, 2023

ghost commented Apr 14, 2023

samw430 commented Apr 26, 2023

glsdown commented Apr 28, 2023 • edited Loading

glsdown commented May 25, 2023

glsdown commented Apr 28, 2023 •

edited

Loading