SRL Annotator #473

bhargav · 2017-04-14T06:11:13Z

Annotator for the Semantic Role Labeler

~~Change the DataModel to be an object~~
Add Annotator class
Remove redundant SRLConfigurator.java
~~Train models for Predicate Sense and populate Sense information in annotator~~
Unit Test for Annotator
~~Move constants used in Annotator to someplace better~~ (Dropping this)
Retrain and deploy all the models.

I'm dropping the requirement for training Predicate Sense for this PR. I can see if the illinois-verbsense package can be integrated as a separate PR.

kordjamshidi · 2017-04-16T23:13:30Z

@bhargav this sounds great. However, do you think it is possible to populate all the data in a single datamodel object efficiently? the reason that I defined a class for SRL datamodel was to be able to to have small graphs and then integrate them.

bhargav · 2017-04-17T16:51:38Z

@kordjamshidi Population works fine for small datasets (examples the test set has ~2000 sentences) but it takes much longer to populate large number of sentences.

I did some profiling and most of the time during population of large graphs is spent in the textAnnotationToRelationMatch matching function. Trying to debug the issue with this behavior.

kordjamshidi · 2017-04-17T16:53:45Z

Yes, right. I am sure the issue was related to establishing matching edges. I hope you can see what is the actual issue.

bhargav · 2017-05-02T06:07:15Z

...a/edu/illinois/cs/cogcomp/saulexamples/nlp/SemanticRoleLabeling/SRLMultiGraphDataModel.scala

-  sentencesToRelations.addSensor(textAnnotationToRelationMatch _)
+
+  // TODO - Verify if we really need this matching sensor. This is very inefficient.
+  //  sentencesToRelations.addSensor(textAnnotationToRelationMatch _)


@kordjamshidi Commenting out this matching sensor mitigates the data model population inefficiency. I have tried training models after commenting this out and training/testing works fine. Let me know if you have any concerns about this or any use-case that I didn't verify.

The root problem is still not fixed. The matching logic iterates over all possible instantiations of the edges while adding a single instance. Source

@bhargav we populate the relations two times, one with generating sensor that populates relations in the very beginning when we populate the sentences. Later we add candidate relations from XuePalmer candidate generator and we added matching sensor to make sure the generated Xue palmer candidates are also connected to the right sentences. I am not sure why it should work correctly without the matching sensor, especially in the case that we use constraints.

Okay. Let me have a look again. I ran aTr, bTr, cTr and dTr and the results were similar to the ones in the readme. Can check again.

Will investigate the matching sensor issue.

bhargav · 2017-05-02T06:08:03Z

I have trained and deployed models trained with PARSE_STANFORD.

kordjamshidi · 2017-05-02T11:35:11Z

@bhargav thanks, I will review this, but I hoped we can find a better solution for the population with the matching sensor. Maybe in another PR then.

bhargav · 2017-05-02T21:42:56Z

Semaphore failed with the following exception related to MapDB.

[error] Uncaught exception when running edu.illinois.cs.cogcomp.saulexamples.nlp.SemanticRoleLabeling.ModelsTest: org.mapdb.DBException$DataCorruption: Header checksum broken. Store was not closed correctly, or is corrupted
sbt.ForkMain$ForkError: Header checksum broken. Store was not closed correctly, or is corrupted

danyaljj · 2017-05-03T05:56:45Z

...st/scala/edu/illinois/cs/cogcomp/saulexamples/nlp/SemanticRoleLabeling/ConstraintsTest.scala

 import edu.illinois.cs.cogcomp.saul.classifier.{ ConstrainedClassifier, Learnable }
 import edu.illinois.cs.cogcomp.saul.constraint.ConstraintTypeConversion._
 import edu.illinois.cs.cogcomp.saul.datamodel.DataModel
 import edu.illinois.cs.cogcomp.saulexamples.nlp.CommonSensors._
 import edu.illinois.cs.cogcomp.saulexamples.nlp.SemanticRoleLabeling.SRLClassifiers.argumentTypeLearner
 import edu.illinois.cs.cogcomp.saulexamples.nlp.SemanticRoleLabeling.SRLSensors._
 import org.scalatest.{ FlatSpec, Matchers }
+
+import scala.collection.JavaConverters._
 import scala.collection.JavaConversions._


Not very important for now: we should try NOT to use JavaConversions, and instead use JavaConverters

danyaljj · 2017-05-03T05:57:28Z

Thanks! Looks good to me!

Merge call with @kordjamshidi

kordjamshidi · 2017-05-03T11:51:13Z

Did any result change? @bhargav

bhargav · 2017-05-10T18:15:14Z

Reverting to the previous data models. Models trained with PARSE_STANFORD were couple of points lower than PARSE_GOLD on per-argument evaluation. But performance of the pipeline model using the PredicateArgumentEvaluator leads to similar results.

Model	Precision	Recall	F1
VerbSRL (PARSE_GOLD)	65.56	64.08	64.81
Verb SRL (PARSE_STANFORD)	65.52	63.48	64.48

Bhargav Mangipudi added 7 commits April 13, 2017 22:54

Add an SRL Annotator.

5c80a34

Fix prerequisites.

246fb33

SRL DataModel change to object instead of a class.

c429ea6

Formatting fixes.

02a0e96

Fix some unit tests.

b28e070

Add a unit test for the annotator.

78ca26c

Update dependencies.

0abd4d4

Bhargav Mangipudi added 4 commits April 17, 2017 19:15

Some fixes to SRLApps and Pipeline usage.

26e8907

Some fixes.

0a11468

Update SRL Model and minor changes to annotator.

a579ecb

Merge remote-tracking branch 'upstream/master' into srl_annotator

84ec2cb

bhargav changed the title ~~[WIP] SRL Annotator~~ SRL Annotator May 2, 2017

bhargav requested review from danyaljj and kordjamshidi May 2, 2017 06:02

bhargav commented May 2, 2017

View reviewed changes

Bhargav Mangipudi added 2 commits May 2, 2017 13:14

Replace Gurobi -> OJAlgo so that unit test passes.

130f0b7

Some fixes for unit tests.

7b038d4

Disable Pipeline caching for an SRL unit test.

ac3b3d6

danyaljj reviewed May 3, 2017

View reviewed changes

danyaljj approved these changes May 3, 2017

View reviewed changes

Add the matching sensor.

8523221

bhargav changed the title ~~SRL Annotator~~ [WIP] SRL Annotator May 3, 2017

Bhargav Mangipudi added 2 commits May 3, 2017 19:35

Make SRLMultiGraphDataModel a class again.

fda3710

Mark SRL Annotator unit test as HighMemoryTest

a6f92bf

bhargav changed the title ~~[WIP] SRL Annotator~~ SRL Annotator May 5, 2017

Bhargav Mangipudi added 12 commits June 27, 2017 16:45

Minor updates.

683e4f5

Fixes to population.

33b40f6

Some updates to the annotator.

25bfbc7

Add an SRL Evaluation using the SRL Annotator.

d805a9e

Add some comments to the SRL Evaluator.

cb3367f

Updates to annotator: Add configuration.

a148186

Fix some configuration: Remove PARSE_VIEW option.y

ac12007

Fix a SRL DataModel unit test.

1043287

Add a Greedy Decoder for SRL Arguments.

745dd60

Update SRLEvaluation to skip the "V" class.

4e12063

Add changes to support PARSE_CHARNIAK

dee019e

Fix incorrect count of data model.

b85a5b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SRL Annotator #473

SRL Annotator #473

bhargav commented Apr 14, 2017 •

edited

Loading

kordjamshidi commented Apr 16, 2017

bhargav commented Apr 17, 2017

kordjamshidi commented Apr 17, 2017

bhargav May 2, 2017

kordjamshidi May 3, 2017

bhargav May 3, 2017

bhargav commented May 2, 2017

kordjamshidi commented May 2, 2017

bhargav commented May 2, 2017

danyaljj May 3, 2017

danyaljj commented May 3, 2017 •

edited

Loading

kordjamshidi commented May 3, 2017

bhargav commented May 10, 2017

SRL Annotator #473

Are you sure you want to change the base?

SRL Annotator #473

Conversation

bhargav commented Apr 14, 2017 • edited Loading

kordjamshidi commented Apr 16, 2017

bhargav commented Apr 17, 2017

kordjamshidi commented Apr 17, 2017

bhargav May 2, 2017

Choose a reason for hiding this comment

kordjamshidi May 3, 2017

Choose a reason for hiding this comment

bhargav May 3, 2017

Choose a reason for hiding this comment

bhargav commented May 2, 2017

kordjamshidi commented May 2, 2017

bhargav commented May 2, 2017

danyaljj May 3, 2017

Choose a reason for hiding this comment

danyaljj commented May 3, 2017 • edited Loading

kordjamshidi commented May 3, 2017

bhargav commented May 10, 2017

bhargav commented Apr 14, 2017 •

edited

Loading

danyaljj commented May 3, 2017 •

edited

Loading