Config file algorithm specification #29

ceholden · 2015-08-09T02:58:25Z

Make way for more timeseries algorithms within module by changing configuration file to be able to point to many different algorithms:

Add new submodule, algorithms
Rename yatsm to ccdc and place into algorithms submodule. YATSM class to CCDCesque
Change configuration file by adding algorithm key under YATSM section. The algorithm specified by algorithm key will be searched for as the section title from which to extract algorithm parameterization information.
Add new YATSM section for options generic to all timeseries algorithms, like reverse or robust.
Remove robust results and omission and commission tests from current YATSM (future, CCDCesque) and place into yatsm.algorithms.yatsm. These will be parameterized in YATSM metadata section.
Add section for regression/predictive model method configuration (see Model prediction / regression selection #26)

Propose change example:

[metadata]
version = 0.5

[YATSM]
algorithm = CCDCesque
regression = LassoCV
design_matrix = 1 + x + harm(x, 1)
reverse = False
robust = False
commission_alpha = 
...

[CCDCesque]
consecutive = 5

[LassoCV]
pickle = somefile.pkl
...

It is very difficult to imagine specifying all arguments to a sklearn classifier or regression estimator via a config file. Things like n_alpha could play well, but how would we specify alphas = np.logspace(0.001, 30, 50)? This proposed format sidesteps these concerns by requiring that regression options provide a pickled file from sklearn.external.joblib that already contains the parameterization desired. If the pickle item is not provided, but the section is labeled, default to a pickle of an existing regression object packaged with yatsm.

Target v0.5.0 as milestone to coincide with another major rehaul (#28).

The text was updated successfully, but these errors were encountered:

- YATSM is now a baseclass for code reuse (plotting, predictions, etc) - YATSM also defines timeseries model interface resembling sklearn - `__init__` contains 'hyperparameters' - `fit` runs model; predict/plot/score methods for results - `__iter__` yields segment records over all segments - `__len__` defines how many segments in model - Move comission/omission/robust re-fits to postprocess.py - Temporarily breaks postprocessing in line/pixel CLI

ceholden · 2015-08-21T19:37:07Z

Also now using YAML! See #30

See issues #26, #29, #30

- Add min_values/max_values in place of valid_range - CSV file has header - Use Pandas to parse CSV (so add as requirement) - Update examples - Bump version - Check for implementation of YATSM algorithm - Put YATSM algo class in config

See issues #26, #29, #30

- fit_indices were never used; fit all of Y as it is passed for a reason - pass `dates` to fit() rather than relying on ordinal dates in X - should be faster and less confusing - design_info isn't needed anymore; remove tie to X - test_indices lingers as not so hyper hyperparameter

ceholden added enhancement inprogress labels Aug 9, 2015

ceholden added this to the v0.5.0 milestone Aug 9, 2015

ceholden added a commit that referenced this issue Aug 21, 2015

Fix line/pixel runners; fix postprocessors #29

2660f47

ceholden added the backward_incompatible label Aug 21, 2015

ceholden added a commit that referenced this issue Aug 21, 2015

Switch to YAML config files; add model pickles

0666937

See issues #26, #29, #30

ceholden added a commit that referenced this issue Aug 25, 2015

Refactor line CLI; updates per #29, #30

7e4da0c

ceholden added a commit that referenced this issue Aug 25, 2015

Implement sklearn pickles for YATSM algos & CLI

789e3e9

See issues #26, #29, #30

ceholden mentioned this issue Sep 4, 2015

Better requirements specification #32

Closed

ceholden closed this as completed Sep 9, 2015

ceholden removed the inprogress label Sep 9, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Config file algorithm specification #29

Config file algorithm specification #29

ceholden commented Aug 9, 2015

ceholden commented Aug 21, 2015

Config file algorithm specification #29

Config file algorithm specification #29

Comments

ceholden commented Aug 9, 2015

ceholden commented Aug 21, 2015