Flesh out bigquery API #1045

tseaver · 2015-08-07T18:23:19Z

Add missing top-level convenience imports
Add missing Client.list_datasets and Dataset.list_tables methods.

gcloud/bigquery/dataset.py

+        :rtype: :class:`gcloud.bigquery.dataset.Dataset`
+        :returns: Dataset parsed from ``resource``.
+        """
+        name = resource['datasetReference']['datasetId']


dhermes · 2015-08-08T00:17:04Z

Really easy to review! Thanks.

Only a few hangups, pubsub in docstrings, KeyError questions in factories and use of ? in a docstring.

Addresses: https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571762 https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571777 https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571768 https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571771

Addresses: https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571626

Addresses: https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571658

tseaver · 2015-08-10T14:00:38Z

@dhermes I think everything is resolved, except the question about KeyErrors for missing datasetReference / tableReference entries: my take is that we should go forward as is:

We have no evidence that the back-end fails to set those values.
Without them, we have no way to construct a Dataset / Table which could be used to make API requests (without a name), which means the KeyError is the earliest possible detection for the useless resource.

gcloud/bigquery/client.py

+        """List datasets for the project associated with this client.
+
+        See:
+        https://cloud.google.com/bigquery/reference/rest/v1beta2/projects/datasets/list


dhermes · 2015-08-10T17:21:28Z

I agree that we can be mostly secure in the belief that the backend will send good data and that without those keys, we can't do anything anyhow.

I just wanted to discuss the possibility that we would provide a more specific error message than KeyError: 'datasetReference' or KeyError: 'datasetId'.

Addresses: #1045 (comment) #1045 (comment)

Addresses: #1045 (comment)

tseaver · 2015-08-10T17:36:16Z

FWIW, I'd generally prefer not to re-wrap exceptions (losing the original traceback really damages debuggability, for instance).

dhermes · 2015-08-10T17:37:44Z

If we raise one line after the KeyError would have occurred, how does that damage debuggability? If the method fails and the user sees an error in a method they've never heard of, isn't that also pretty low quality debuggability?

tseaver · 2015-08-10T17:58:18Z

Hiding the key error is the harm I'm talking about: the user won't be better able to debug some other error more easily than "that key isn't in the resource as expected".

dhermes · 2015-08-10T18:20:09Z

Maybe we are thinking of different things.

This is what I have in mind:

if ('datasetReference' not in resource or
    'datasetId' not in resource['datasetReference']):
  raise KeyError('The resource returned from the server did not contain '
                 'the the necessary information to create a Dataset '
                 'object. The resource needs to contain a dictionary value '
                 'at the datasetReference key and within that dictionary '
                 'needs the datasetId.')
name = resource['datasetReference']['datasetId']

What harm does this cause? I'm just suggesting we provide more information than what a KeyError would on its own.

tseaver · 2015-08-10T18:40:44Z

When a Python programmer sees a traceback for a KeyError where the bottommost line is:

name = resource['datasetReference']['datasetId']

doesn't she already know the same information you typed into that waaay-long error message? If debugging it, she will still dump out the contents of resource and try to figure out why those keys are missing.

dhermes · 2015-08-10T19:43:22Z

Yes that's the root of my original question.

Do we want that Python programmer to just see that KeyError and try to figure out what resource was and where it came from, or do we want to give them more information which will explain why the error occurred. I was leaning towards the latter since it is not a method users would ever invoke, hence their knowledge of the inputs and failure modes would be minimal.

Addresses: #1045 (comment)

tseaver · 2015-08-10T20:31:44Z

@dhermes 34a747c adds a check such as you suggested.

dhermes · 2015-08-10T21:31:08Z

LGTM

Flesh out bigquery API

tseaver added 6 commits August 7, 2015 11:52

Add public API entties from 'bigquery.table'.

58ae075

Add 'Dataset.from_api_repr' factory.

f46de66

Add 'bigquery.client.Client.list_datasets' API method.

e427864

Avoid all-uppercase for non-constant variable name.

f3089d5

Add 'Table.from_api_repr' factory.

853d1f9

Add 'Dataset.list_tables' API method.

e409492

tseaver added the api: bigquery Issues related to the BigQuery API. label Aug 7, 2015

googlebot added the cla: yes This human has signed the Contributor License Agreement. label Aug 7, 2015

tseaver mentioned this pull request Aug 7, 2015

Add Dataset access support #1046

Merged

dhermes reviewed Aug 8, 2015
View reviewed changes

tseaver added 3 commits August 8, 2015 12:22

Explain boolean value w/o question.

5314507

Addresses: https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571626

Reword :returns: for clarity.

8b4d6f3

Addresses: https://github.com/GoogleCloudPlatform/gcloud-python/pull/1045/files#r36571658

dhermes reviewed Aug 10, 2015
View reviewed changes

gcloud/bigquery/client.py

"""List datasets for the project associated with this client.

See:

https://cloud.google.com/bigquery/reference/rest/v1beta2/projects/datasets/list

This comment was marked as spam.

Sign in to view

This comment was marked as spam.

Sign in to view

tseaver added 2 commits August 10, 2015 13:30

Fix API docs URLs.

9a73871

Addresses: #1045 (comment) #1045 (comment)

Fix typo.

2cd2cd2

Addresses: #1045 (comment)

Pre-check for missing dataset/table name when parsing resource.

34a747c

Addresses: #1045 (comment)

tseaver added a commit that referenced this pull request Aug 10, 2015

Merge pull request #1045 from tseaver/bigquery-flesh_out_api

3357530

Flesh out bigquery API

tseaver merged commit 3357530 into googleapis:master Aug 10, 2015

tseaver deleted the bigquery-flesh_out_api branch August 10, 2015 21:33

dhermes mentioned this pull request Aug 12, 2015

Upgrading version to 0.7.1. #1057

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flesh out bigquery API #1045

Flesh out bigquery API #1045

tseaver commented Aug 7, 2015

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

dhermes commented Aug 8, 2015

tseaver commented Aug 10, 2015

This comment was marked as spam.

This comment was marked as spam.

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

Flesh out bigquery API #1045

Flesh out bigquery API #1045

Conversation

tseaver commented Aug 7, 2015

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

dhermes commented Aug 8, 2015

tseaver commented Aug 10, 2015

This comment was marked as spam.

This comment was marked as spam.

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015

tseaver commented Aug 10, 2015

dhermes commented Aug 10, 2015