Tolerance expressed in percentage now computes correctly. [BLD-522] #3489

jmclaus · 2014-04-28T12:51:08Z

@auraz @valera-rozuvan It fixes the issue, tested it with -100 +- 10%; 100 +- 10%; 0 +- 10%; 10 +- 100%; -10 +- 100%. I guess we should add tests.

singingwolfboy · 2014-04-28T14:24:12Z

Adding tests is always good!

jmclaus · 2014-04-29T10:51:05Z

@auraz @valera-rozuvan @singingwolfboy

This is an example of a test that did not pick up an error in the source code. Here are the details:

The erroneous code in src was (with complex1 : student result; complex2 : instructor result):

tolerance = tolerance * max(abs(complex1), abs(complex2))

In the original tests, the answer was 4.0 and the tolerance 10%. The range of correct answers is then [3.6, 4.4].

But 4.44 would generate a correct answer as

tolerance = 0.1 * max(4, 4.44) = 0.444 and the erroneous range of correct answers was set to [3.556, 4.444].

The tests would only check 4.5 and not a value closer to the bounds of correctness. I modified this but arbitrarily test 4.4000001 for incorrectness. Should something even tighter be put in place? In that case, what do you think it should be?

auraz · 2014-04-29T14:51:59Z

@jmclaus this is not copy-paste error. This code was originally there:
https://github.com/edx/edx-platform/blob/rc/2013-12-03/common/lib/capa/capa/util.py

Please also fix https://github.com/edx/edx-platform/blob/master/common/lib/capa/capa/util.py#L33, as it is copy paste from https://github.com/edx/edx-platform/blob/master/common/lib/capa/capa/util.py#L36

auraz · 2014-04-29T14:52:45Z

Also, I suggest to rename complex1 and complex2 to student_complex and teacher_complex or like that.

auraz · 2014-04-29T14:56:51Z

Regarding tighter tests: I can't imagine what needs to be tested tighter right now.

auraz · 2014-04-29T15:58:18Z

common/lib/capa/capa/tests/test_responsetypes.py

+        # Mixed negative/positive range
+        problem = self.build_problem(answer=0, tolerance="10%")
+        correct_responses = ["0", "0.1", "-0.1", "0.10", "-0.10"]
+        incorrect_responses = ["", "-0.1000001", "0.1000001", "0"]


Why zero is in incorrect responses?

@auraz No reason, removed.

auraz · 2014-04-29T15:59:25Z

@jmclaus for some reason Jenkins tests are not run for this PR.

jmclaus · 2014-04-30T13:31:17Z

@auraz Also, I suggest to rename complex1 and complex2 to student_complex and teacher_complex or like that. --> Renamed complex1 and complex2 to student_complex and instructor_complex.

jmclaus · 2014-04-30T13:32:28Z

@auraz Regarding tighter tests: I can't imagine what needs to be tested tighter right now. --> OK, I'll leave the new tests as is.

jmclaus · 2014-04-30T13:37:14Z

@auraz Please also fix https://github.com/edx/edx-platform/blob/master/common/lib/capa/capa/util.py#L33, as it is copy paste from https://github.com/edx/edx-platform/blob/master/common/lib/capa/capa/util.py#L36

Fixed.

jmclaus · 2014-04-30T13:48:05Z

@auraz Seems like a lot of current PR's do not have Jenkin tests running.

auraz · 2014-04-30T13:51:23Z

@jmclaus do you know how to run manual build?

jmclaus · 2014-05-06T14:41:40Z

@olmar (since @alex is out for the week), @valera-rozuvan @polesye The tests now all pass locally, please review.

The initial fix I did solved BLD-522 but was breaking other things. compare_with_tolerance is used throughout capa/capa/responsetype.py with a default_tolerance set to '0.001%' and expects it to be relative [tolerance * max(abs(student_complex), abs(instructor_complex))]. But when an instructor uses, for example, 4 +- 10% in Studio, he expects it to be absolute [tolerance = tolerance * abs(instructor_complex))]

See http://randomascii.wordpress.com/2012/02/25/comparing-floating-point-numbers-2012-edition/

alex · 2014-05-06T14:42:28Z

I am not the @alex you are looking for :-)

jmclaus · 2014-05-06T14:51:51Z

@alex Indeed, sorry! It was @auraz.

olmar · 2014-05-08T11:31:20Z

@jmclaus I reviewed and tested your pr manually also. It is OK for issue described in ticket.

olmar · 2014-05-08T11:34:44Z

but not sure about your concerns what expects instructor in studio. @valera-rozuvan please review also.

jmclaus · 2014-05-08T12:34:21Z

@olmar Thanks for review. Sorry, I wasn't clear enough. This fix actually gets the instructor what he expects (so no concern at all), 4 +- 10% will now give the following range of correct answers: [3.6, 4.4].

olmar · 2014-05-08T14:00:37Z

👍

polesye · 2014-05-09T14:54:50Z

@auraz will finish code review on Monday.

auraz · 2014-05-13T11:06:26Z

common/lib/capa/capa/util.py

@@ -29,23 +29,28 @@ def compare_with_tolerance(complex1, complex2, tolerance=default_tolerance, rela
        In [212]: 1.9e24 - 1.9*10**24
        Out[212]: 268435456.0
    """
+    if isinstance(tolerance, str):


Do we have not-a-string-type tolerance anywhere?

O, I've found one place.

In previous code, when tolerance was not ending on '%', it was going through 'evaluator(dict(), dict(), tolerance)'. Now it is going through only if it is string. Why it is so?

@auraz Is there any reason to evaluate anything else than a string?

I do not know, It should be investigated.

@auraz compare_with_tolerance is used in only 1 file responsetypes.py 5 times (if we don't count tests). It's value is either float_info.epsilon, the default value (0.001%) or what is specified in the XML. Do you think this might need evaluation? I guess to be on the safe side, we should let it go through evaluator(dict(), dict(), tolerance) every time it's not a percentage. Thanks. Will do.

@auraz I take this back, if you pass a float through the evaluator, the following error is raised:

Traceback (most recent call last): common/lib/capa/capa/tests/test_util.py line 44 in test_compare_with_tolerance result = compare_with_tolerance(109.9, 100.0, 10.0, False) common/lib/capa/capa/util.py line 37 in compare_with_tolerance tolerance = evaluator(dict(), dict(), tolerance) common/lib/calc/calc/calc.py line 228 in evaluator if math_expr.strip() == "": AttributeError: 'float' object has no attribute 'strip'

I propose we leave things as is. I tested the code manually and tests all pass. What do you think?

thank you for looking into. I agree with you.

auraz · 2014-05-13T11:29:17Z

I suggest to also add small test especially for compare_with_tolerance function, because we have no tests for float tolerances.

jmclaus · 2014-05-14T09:30:23Z

@auraz I added unit tests for compare_with_tolerance to test_response_types.py. Or should we create a new file called test_util.py and put these in there?

auraz · 2014-05-14T10:07:21Z

@jmclaus new file is better decision.

jmclaus · 2014-05-14T12:51:43Z

@auraz I put the tests in a new file called test_util.py. They run and pass locally.

jmclaus · 2014-05-15T13:55:53Z

@auraz I have addressed all your comments then. Please finish review. Thanks.

jmclaus · 2014-05-19T07:36:44Z

@auraz Good to merge then? Tests all back to green.

auraz · 2014-05-21T12:01:48Z

👍

…olerance Tolerance expressed in percentage now computes correctly. [BLD-522]

singingwolfboy added the waiting on Blades label Apr 28, 2014

auraz closed this Apr 29, 2014

auraz reopened this Apr 29, 2014

auraz reviewed Apr 29, 2014
View reviewed changes

auraz reviewed May 13, 2014
View reviewed changes

Tolerance expressed in percentage now computes correctly. [BLD-522]

db239c6

jmclaus pushed a commit that referenced this pull request May 21, 2014

Merge pull request #3489 from edx/jmclaus/bugfix_numerical_response_t…

cc7987e

…olerance Tolerance expressed in percentage now computes correctly. [BLD-522]

jmclaus merged commit cc7987e into master May 21, 2014

jmclaus deleted the jmclaus/bugfix_numerical_response_tolerance branch May 21, 2014 15:47

snyk-bot mentioned this pull request Jan 21, 2022

[Snyk] Security upgrade karma from 0.13.22 to 5.0.8 baby636/edx-platform#12

Open

This was referenced Nov 15, 2022

[Snyk] Security upgrade karma from 0.13.22 to 5.0.8 baby636/edx-platform#35

Open

[Snyk] Security upgrade karma from 0.13.22 to 5.0.8 baby636/edx-platform#38

Open

Tolerance expressed in percentage now computes correctly. [BLD-522] #3489

Tolerance expressed in percentage now computes correctly. [BLD-522] #3489

Conversation

jmclaus commented Apr 28, 2014

singingwolfboy commented Apr 28, 2014

jmclaus commented Apr 29, 2014

auraz commented Apr 29, 2014

auraz commented Apr 29, 2014

auraz commented Apr 29, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

auraz commented Apr 29, 2014

jmclaus commented Apr 30, 2014

jmclaus commented Apr 30, 2014

jmclaus commented Apr 30, 2014

jmclaus commented Apr 30, 2014

auraz commented Apr 30, 2014

jmclaus commented May 6, 2014

alex commented May 6, 2014

jmclaus commented May 6, 2014

olmar commented May 8, 2014

olmar commented May 8, 2014

jmclaus commented May 8, 2014

olmar commented May 8, 2014

polesye commented May 9, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

auraz commented May 13, 2014

jmclaus commented May 14, 2014

auraz commented May 14, 2014

jmclaus commented May 14, 2014

jmclaus commented May 15, 2014

jmclaus commented May 19, 2014

auraz commented May 21, 2014