Theme 2 Scenario 2A, Update usgs notebook #177

birdage · 2014-09-12T14:10:43Z

@Bobfrat lets keep this open, and i will be pushing changes to it. will note when complete

birdage · 2014-12-01T17:23:30Z

@Bobfrat @ocefpaf can someone merge this pls

ocefpaf · 2014-12-01T17:25:45Z

Theme_2_Extreme_Events/Scenario_2A/USGS_Gauges/Rapid_Deploy_Gauges.py

-        #file_name_list.append(processTextFile(url,html_link,'wl'))
-        wl_count+=1
-print "num water level:",wl_count
+if (os.path.isdir("./data_files/")):


@birdage This will break on windows machines. Use os.path.join.

@ocefpaf want me to correct it before merge?

ocefpaf · 2014-12-01T17:31:50Z

IMHO it is OK for the merge @birdage, but I do not use windows 😁 Maybe we can open an issue after the merge to remind that the paths need to be fixed.

birdage · 2014-12-01T17:32:41Z

@ocefpaf sounds like a good idea, thanks!

ocefpaf · 2014-12-01T17:35:41Z

Almost done reviewing. I will merge as soon as I finish running it here.

ocefpaf · 2014-12-01T17:55:08Z

Theme_2_Extreme_Events/Scenario_2A/USGS_Gauges/Rapid_Deploy_Gauges.py

-for file_name in files:
-    print count,file_name
+display(pb)
+for fi,file_name in enumerate(files):


This loop is too complex and easily breakable (in fact I am having trouble to run it right now). Maybe an abstraction to read the metadata is in hand.

Also the actual data and dates can be easily read with pandas and stored in the dictionary instead of the numpy array in the next cell.

ocefpaf · 2014-12-01T19:14:15Z

Some suggestions (I am OK to merge without these):
- Add a note mentioning what are the files sample_data_files.zip and track.csv
- Add %matplotlib inline in the beginning of the notebook. (Newer versions of IPython notebook need that to display the figures.)
- Always reset the kernel and re-run the notebook before sending a PR. That helps to debug (with a proper cell count in the notebook), assert that things ran in order, and avoid namespace contamination.
One fix (this is necessary for the merge):
- I cannot run the notebook locally. I get zero length data after running the main files loop. After I re-factored the loop things worked fine. Can you run it there the way it is? Please check if I did not break anything:

from pandas import read_csv


full_data = {}

def parse_metadata(fname):
    meta_data = {}
    fields = {'Sensor location latitude': 'lat',
              'Sensor location longitude': 'lon',
              'Site id =': 'name',
              'Sensor elevation above NAVD 88 =': 'elevation',
              'Barometric sensor site (source of bp) =': 'bp_source',
              'Lowest recordable water elevation is': 'lowest_wl'}
    with open(os.path.join('data_files', fname)) as f:
        content = f.readlines()
        for k, ln in enumerate(content):
            content[k] = ln.strip()
            if content[k].startswith('#'):
                for field in fields:
                    if field in content[k]:
                        if fields[field] == 'name':
                            meta_data[fields[field]] = content[k].split(field)[-1]
                        else:
                            val = (content[k].split(field)[-1])
                            meta_data[fields[field]] = float(non_decimal.sub('', val))
                        if fields[field] == 'lon':
                            meta_data[fields[field]] = -meta_data[fields[field]]
    return meta_data

for count, fname in enumerate(files):
    print('{}: {} of {} '.format(fname, count+1, len(files)))
    meta_data = parse_metadata(fname)
    kw = dict(parse_dates=True, sep='\t', skiprows=29, index_col=0)
    actual_data = read_csv(os.path.join('data_files', fname), **kw)
    full_data[fname] = {'meta': meta_data,
                        'data': actual_data}

Done reviewing. Over to you @birdage.

birdage · 2014-12-01T21:15:17Z

@ocefpaf regarding comments

i thought we moved away from %matplotlib inline in the code? and you had to adopt the ' ipython notebook with --pylab=inline' method of starting ipython notebook?
i dont think the user needs to know too much about the data files and track lines, though i do mention in the header that the data files are zipped etc
id suggest committing your changes to the loop and ill pull and test.

ocefpaf · 2014-12-01T21:59:02Z

i thought we moved away from %matplotlib inline in the code? and you had to adopt the ' ipython notebook with --pylab=inline' method of starting ipython notebook?

I would go away from %pylab, but that is just me...

i dont think the user needs to know too much about the data files and track lines, though i do mention in the header that the data files are zipped etc

OK. I missed the header info. That is what I would like to see, a link to the original data. Thanks!

id suggest committing your changes to the loop and ill putt and test.

I will prepare a PR to your branch tomorrow and lets take a look at the results then.

birdage · 2014-12-01T22:00:18Z

@ocefpaf Thanks for the comments, well sort it tomorrow.

ocefpaf · 2014-12-02T13:32:17Z

@birdage I sent a PR to your branch birdage#1 let me know if that works for you.

http://nbviewer.ipython.org/github/ocefpaf/system-test/blob/update_usgs_pandas/Theme_2_Extreme_Events/Scenario_2A/USGS_Gauges/Rapid_Deploy_Gauges.ipynb

jkupiec · 2014-12-05T17:02:18Z

@ocefpaf, since #204 is closed, is this PR good to merge? Also, if and when the merge is complete, does that make #203 ready to close?

ocefpaf · 2014-12-05T18:08:41Z

Hi John,
The PR is not pending due to #204. I sent the fix to @birdage branch so he
can modify the PR before we merge.
Em 05/12/2014 15:02, "John Kupiec" notifications@github.com escreveu:

@ocefpaf https://github.com/ocefpaf, since #204
#204 is closed, is this PR
good to merge?

—
Reply to this email directly or view it on GitHub
#177 (comment).

birdage · 2014-12-05T18:31:10Z

@ocefpaf @jkupiec this is on my plate now to finish up

birdage · 2014-12-08T16:35:14Z

@ocefpaf thats for the updates, the processing code runs a bit slower but i think the refactor makes it easier to read. ive made some simple changes, will rebase and commit.

Update usgs pandas

birdage · 2014-12-08T16:57:11Z

@ocefpaf looks like were ready to merge in, could you do it if possible as i dont want to self merge

ocefpaf · 2014-12-08T17:02:35Z

May I ask one last thing? Can you restart the kernel and re-run the notebook so we have a meaningful view at nbviewer:

http://nbviewer.ipython.org/github/birdage/system-test/blob/update_usgs/Theme_2_Extreme_Events/Scenario_2A/USGS_Gauges/Rapid_Deploy_Gauges.ipynb

birdage · 2014-12-08T17:52:58Z

@ocefpaf once the travis has run i think were good.

Theme 2 Scenario 2A, Update usgs notebook

ocefpaf · 2014-12-08T18:45:08Z

Nice!

ocefpaf mentioned this pull request Nov 18, 2014

Merge outstanding pull requests #198

Closed

ocefpaf reviewed Dec 1, 2014
View reviewed changes

ocefpaf mentioned this pull request Dec 2, 2014

Update usgs pandas birdage/system-test#1

Merged

ocefpaf mentioned this pull request Dec 4, 2014

.txt files #204

Closed

birdage mentioned this pull request Dec 5, 2014

Update Theme 2 directory structure #206

Closed

birdage and others added 9 commits December 8, 2014 11:37

update usgs

1e2c488

update usgs notebook

a8eb7e8

remove un-needed cell

f4e2e59

fix units

2cff4b7

fix progress bar

f441ddf

fix merge issue

2b6cdc5

fix req file issues

cc8306d

Re-factored loop.

d6d0a8b

Merge pull request #1 from ocefpaf/update_usgs_pandas

bdfdac0

Update usgs pandas

update comments

9805c84

birdage force-pushed the update_usgs branch from fb49698 to 9805c84 Compare December 8, 2014 16:42

add python file and rerun notebook with restarted kernal

8b26c1f

ocefpaf added a commit that referenced this pull request Dec 8, 2014

Merge pull request #177 from birdage/update_usgs

08074ee

Theme 2 Scenario 2A, Update usgs notebook

ocefpaf merged commit 08074ee into ioos:master Dec 8, 2014

birdage deleted the update_usgs branch December 8, 2014 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Theme 2 Scenario 2A, Update usgs notebook #177

Theme 2 Scenario 2A, Update usgs notebook #177

birdage commented Sep 12, 2014

birdage commented Dec 1, 2014

ocefpaf Dec 1, 2014

birdage Dec 1, 2014

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 1, 2014

ocefpaf Dec 1, 2014

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 2, 2014

jkupiec commented Dec 5, 2014

ocefpaf commented Dec 5, 2014

birdage commented Dec 5, 2014

birdage commented Dec 8, 2014

birdage commented Dec 8, 2014

ocefpaf commented Dec 8, 2014

birdage commented Dec 8, 2014

ocefpaf commented Dec 8, 2014

Theme 2 Scenario 2A, Update usgs notebook #177

Theme 2 Scenario 2A, Update usgs notebook #177

Conversation

birdage commented Sep 12, 2014

birdage commented Dec 1, 2014

ocefpaf Dec 1, 2014

Choose a reason for hiding this comment

birdage Dec 1, 2014

Choose a reason for hiding this comment

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 1, 2014

ocefpaf Dec 1, 2014

Choose a reason for hiding this comment

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 1, 2014

birdage commented Dec 1, 2014

ocefpaf commented Dec 2, 2014

jkupiec commented Dec 5, 2014

ocefpaf commented Dec 5, 2014

birdage commented Dec 5, 2014

birdage commented Dec 8, 2014

birdage commented Dec 8, 2014

ocefpaf commented Dec 8, 2014

birdage commented Dec 8, 2014

ocefpaf commented Dec 8, 2014