Handle errors better in `tables.scala` #243

larsgeorge-db · 2023-09-20T16:04:23Z

URL: https://e2-demo-field-eng.cloud.databricks.com/?o=1444828305810485#job/18199892189402/run/374920315594814

NoSuchDatabaseException: [SCHEMA_NOT_FOUND] The schema `tomasz_kurzydym_002i_da_sml` cannot be found. Verify the spelling and correctness of the schema and catalog.
If you did not qualify the name with a catalog, verify the current_schema() output, or qualify the name with the correct catalog.
To tolerate the error on drop use DROP SCHEMA IF EXISTS.
If you did not qualify the name with a catalog, verify the current_schema() output, or qualify the name with the correct catalog.
To tolerate the error on drop use DROP SCHEMA IF EXISTS.
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalog.requireDbExists(ExternalCatalog.scala:42)
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalog.requireDbExists$(ExternalCatalog.scala:40)
	at org.apache.spark.sql.hive.HiveExternalCatalog.requireDbExists(HiveExternalCatalog.scala:68)
	at org.apache.spark.sql.hive.HiveExternalCatalog.$anonfun$listTables$1(HiveExternalCatalog.scala:1055)
	at com.databricks.spark.util.FrameProfiler$.record(FrameProfiler.scala:94)
	at org.apache.spark.sql.hive.HiveExternalCatalog.$anonfun$withClient$2(HiveExternalCatalog.scala:155)
	at org.apache.spark.sql.hive.HiveExternalCatalog.maybeSynchronized(HiveExternalCatalog.scala:116)
	at org.apache.spark.sql.hive.HiveExternalCatalog.$anonfun$withClient$1(HiveExternalCatalog.scala:154)
	at com.databricks.backend.daemon.driver.ProgressReporter$.withStatusCode(ProgressReporter.scala:364)
	at com.databricks.spark.util.SparkDatabricksProgressReporter$.withStatusCode(ProgressReporter.scala:34)
	at org.apache.spark.sql.hive.HiveExternalCatalog.withClient(HiveExternalCatalog.scala:153)
	at org.apache.spark.sql.hive.HiveExternalCatalog.listTables(HiveExternalCatalog.scala:1054)
	at com.databricks.sql.acl.SecuredHiveExternalCatalog.com$databricks$sql$acl$SecuredExternalCatalog$$super$listTables(SecuredHiveExternalCatalog.scala:26)
	at com.databricks.sql.acl.SecuredExternalCatalog.$anonfun$listTables$3(SecuredExternalCatalog.scala:85)
	at com.databricks.backend.daemon.driver.ProgressReporter$.withStatusCode(ProgressReporter.scala:364)
	at com.databricks.spark.util.SparkDatabricksProgressReporter$.withStatusCode(ProgressReporter.scala:34)
	at com.databricks.sql.acl.SecuredExternalCatalog.listTables(SecuredExternalCatalog.scala:85)
	at com.databricks.sql.acl.SecuredExternalCatalog.listTables$(SecuredExternalCatalog.scala:80)
	at com.databricks.sql.acl.SecuredHiveExternalCatalog.listTables(SecuredHiveExternalCatalog.scala:26)
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalogWithListener.$anonfun$listTables$1(ExternalCatalogWithListener.scala:188)
	at org.apache.spark.sql.catalyst.MetricKeyUtils$.measure(MetricKey.scala:311)
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalogWithListener.$anonfun$profile$1(ExternalCatalogWithListener.scala:54)
	at com.databricks.spark.util.FrameProfiler$.record(FrameProfiler.scala:94)
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalogWithListener.profile(ExternalCatalogWithListener.scala:53)
	at org.apache.spark.sql.catalyst.catalog.ExternalCatalogWithListener.listTables(ExternalCatalogWithListener.scala:188)
	at $linedaa40f8f4b024c098bbf51b3db71961f25.$read$$iw$$iw$$iw$$iw$$iw$$iw.$anonfun$metadataForAllTables$1(
command-3121298695456685:13)
	at scala.collection.parallel.mutable.ParArray$ParArrayIterator.flatmap2combiner(ParArray.scala:419)
	at scala.collection.parallel.ParIterableLike$FlatMap.leaf(ParIterableLike.scala:1082)
	at scala.collection.parallel.Task.$anonfun$tryLeaf$1(Tasks.scala:53)
	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
	at scala.util.control.Breaks$$anon$1.catchBreak(Breaks.scala:67)
	at scala.collection.parallel.Task.tryLeaf(Tasks.scala:56)
	at scala.collection.parallel.Task.tryLeaf$(Tasks.scala:50)
	at scala.collection.parallel.ParIterableLike$FlatMap.tryLeaf(ParIterableLike.scala:1078)
	at scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask.internal(Tasks.scala:170)
	at scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask.internal$(Tasks.scala:157)
	at scala.collection.parallel.AdaptiveWorkStealingForkJoinTasks$WrappedTask.internal(Tasks.scala:440)
	at scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask.compute(Tasks.scala:150)
	at scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask.compute$(Tasks.scala:149)
	at scala.collection.parallel.AdaptiveWorkStealingForkJoinTasks$WrappedTask.compute(Tasks.scala:440)
	at java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189)
	at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
	at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
	at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
	at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)

The text was updated successfully, but these errors were encountered:

Fixes #243

* Added batched iteration for `INSERT INTO` queries in `StatementExecutionBackend` with default `max_records_per_batch=1000` ([#237](#237)). * Added crawler for mount points ([#209](#209)). * Added crawlers for compatibility of jobs and clusters, along with basic recommendations for external locations ([#244](#244)). * Added safe return on grants ([#246](#246)). * Added ability to specify empty group filter in the installer script ([#216](#216)) ([#217](#217)). * Added ability to install application by multiple different users on the same workspace ([#235](#235)). * Added dashboard creation on installation and a requirement for `warehouse_id` in config, so that the assessment dashboards are refreshed automatically after job runs ([#214](#214)). * Added reliance on rate limiting from Databricks SDK for listing workspace ([#258](#258)). * Fixed errors in corner cases where Azure Service Principal Credentials were not available in Spark context ([#254](#254)). * Fixed `DESCRIBE TABLE` throwing errors when listing Legacy Table ACLs ([#238](#238)). * Fixed `file already exists` error in the installer script ([#219](#219)) ([#222](#222)). * Fixed `guess_external_locations` failure with `AttributeError: as_dict` and added an integration test ([#259](#259)). * Fixed error handling edge cases in `crawl_tables` task ([#243](#243)) ([#251](#251)). * Fixed `crawl_permissions` task failure on folder names containing a forward slash ([#234](#234)). * Improved `README` notebook documentation ([#260](#260), [#228](#228), [#252](#252), [#223](#223), [#225](#225)). * Removed redundant `.python-version` file ([#221](#221)). * Removed discovery of account groups from `crawl_permissions` task ([#240](#240)). * Updated databricks-sdk requirement from ~=0.8.0 to ~=0.9.0 ([#245](#245)).

larsgeorge-db added the bug Something isn't working label Sep 20, 2023

larsgeorge-db added this to the 1 week milestone Sep 20, 2023

larsgeorge-db self-assigned this Sep 20, 2023

nfx added enhancement New feature or request and removed bug Something isn't working labels Sep 20, 2023

larsgeorge-db added a commit that referenced this issue Sep 21, 2023

Handle errors better in tables.scala (#243)

5b99e6d

larsgeorge-db mentioned this issue Sep 21, 2023

Handle errors better in tables.scala (#243) #251

Merged

github-merge-queue bot pushed a commit that referenced this issue Sep 21, 2023

Handle errors better in tables.scala (#243) (#251)

06f104e

Fixes #243

nfx closed this as completed in #251 Sep 21, 2023

nfx mentioned this issue Sep 21, 2023

Release v0.1.1 #261

Merged

nfx removed this from the 1 week milestone Sep 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle errors better in `tables.scala` #243

Handle errors better in `tables.scala` #243

larsgeorge-db commented Sep 20, 2023

Handle errors better in tables.scala #243

Handle errors better in tables.scala #243

Comments

larsgeorge-db commented Sep 20, 2023

Handle errors better in `tables.scala` #243

Handle errors better in `tables.scala` #243