Enable in-memory feature caching for properties #472

bhargav · 2017-03-04T06:40:21Z

Work Items

Implements feature caching for static features that do not change between runs.
Update documentation
Add unit tests
~~Verify/Reason if we need a function to explicitly clear the cache?~~

Idea: Cache static features that do not change across learning iterations. Improves training speed at the cost of using more memory for caching the features. Tested this on the Chunker app and there is a significant improvements to training time.

Can be extended further with other caching implementation like MapDB (which supports on-disk caching)

danyaljj · 2017-03-05T05:14:00Z

One big concern I have is the confusion that it might create a little confusion with cache = true (vs isStatic ). Especially in the documentation here. We might consider changing the terminology and clarify the difference (+use cases).

danyaljj · 2017-03-05T05:17:15Z

saul-core/src/main/scala/edu/illinois/cs/cogcomp/saul/datamodel/DataModel.scala

-      val a = new BooleanProperty[T](name, cachedF) with NodeProperty[T] { override def node: Node[T] = papply.node }
+      val a = new BooleanProperty[T](name, cachedF) with NodeProperty[T] {
+        override def node: Node[T] = papply.node
+        override val isCachingEnabled: Boolean = isStatic


Maybe change isCachingEnabled to sth closed to static? isStatic (or isStaticEnabled) itself sounds more clear

danyaljj · 2017-03-05T05:24:50Z

On the implementation, I'd suggestion different approach: we can reuse the existing caching. The only thing we have to do is to make sure we don't clean in between each training iterations. That is handled here by this function.. So the only change needed is, not calling the clear function for properties for which isStatic is specified.

bhargav · 2017-04-09T04:40:45Z

This is ready to be reviewed.

kordjamshidi

Just one renaming suggestions, otherwise this is good to me to merge.

kordjamshidi · 2017-07-18T14:44:14Z

saul-core/doc/CONCEPTUALSTRUCTURE.md

+If you want to cache the value of a feature during a single iteration, use the `cache` parameter.
+
+The `cache` parameter allows the value to be cached within a training/testing iteration. This is useful if you one of your features depends on evaluation of a Classifier on other instances as well. This recursive evaluation of the Classifier might be expensive and caching would speed-up performance. Look at a sample usage of this parameter in the [POSTagging Example](../../saul-examples/src/main/scala/edu/illinois/cs/cogcomp/saulexamples/nlp/POSTagger/POSDataModel.scala#L66).
+


if you one of your => if one of your

kordjamshidi · 2017-07-18T14:45:23Z

saul-core/doc/CONCEPTUALSTRUCTURE.md

+ val posWindow = property(token, cache = true) {
+    (t: ConllRawToken) => t.getNeighbors.map(n => posWindow(n))
+ }
+ ```


maybe iterationCache or perIterationCache? (instead of cache)

danyaljj reviewed Mar 5, 2017

View reviewed changes

bhargav force-pushed the property-in_memory-caching branch from c17096e to 699f238 Compare March 11, 2017 23:21

Bhargav Mangipudi added 2 commits April 8, 2017 23:16

Basic feature vector caching.

d75a4f6

More documentation and minor changes.

08dec8b

bhargav force-pushed the property-in_memory-caching branch from ce9450a to 08dec8b Compare April 9, 2017 04:19

bhargav changed the title ~~[WIP] Enable in-memory feature caching for properties~~ Enable in-memory feature caching for properties Apr 9, 2017

bhargav requested a review from kordjamshidi April 9, 2017 04:40

kordjamshidi reviewed Jul 18, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable in-memory feature caching for properties #472

Enable in-memory feature caching for properties #472

bhargav commented Mar 4, 2017 •

edited

Loading

danyaljj commented Mar 5, 2017 •

edited

Loading

danyaljj Mar 5, 2017

danyaljj commented Mar 5, 2017 •

edited

Loading

bhargav commented Apr 9, 2017

kordjamshidi left a comment

kordjamshidi Jul 18, 2017

kordjamshidi Jul 18, 2017

		If you want to cache the value of a feature during a single iteration, use the `cache` parameter.

		The `cache` parameter allows the value to be cached within a training/testing iteration. This is useful if you one of your features depends on evaluation of a Classifier on other instances as well. This recursive evaluation of the Classifier might be expensive and caching would speed-up performance. Look at a sample usage of this parameter in the [POSTagging Example](../../saul-examples/src/main/scala/edu/illinois/cs/cogcomp/saulexamples/nlp/POSTagger/POSDataModel.scala#L66).

Enable in-memory feature caching for properties #472

Are you sure you want to change the base?

Enable in-memory feature caching for properties #472

Conversation

bhargav commented Mar 4, 2017 • edited Loading

danyaljj commented Mar 5, 2017 • edited Loading

danyaljj Mar 5, 2017

Choose a reason for hiding this comment

danyaljj commented Mar 5, 2017 • edited Loading

bhargav commented Apr 9, 2017

kordjamshidi left a comment

Choose a reason for hiding this comment

kordjamshidi Jul 18, 2017

Choose a reason for hiding this comment

kordjamshidi Jul 18, 2017

Choose a reason for hiding this comment

bhargav commented Mar 4, 2017 •

edited

Loading

danyaljj commented Mar 5, 2017 •

edited

Loading

danyaljj commented Mar 5, 2017 •

edited

Loading