Need acceptance test suite #61

AlexanderYastrebov · 2024-01-03T21:30:06Z

E.g. current "winner" produces garbage output on trivial input:

$ ./create_measurements.sh 2
Created file with 2 measurements in 18 ms
$ cat measurements.txt 
Calgary;1.6
Tamanrasset;13.7
$ ./calculate_average_ebarlas.sh 
{=2.147483647E8/0.0/-2.147483648E8, manrasset=13.7/13.7/13.7, y=2.147483647E8/0.0/-2.147483648E8}

real    0m0,206s
user    0m0,415s
sys     0m0,048s

The text was updated successfully, but these errors were encountered:

AlexanderYastrebov · 2024-01-03T21:30:37Z

@ebarlas FYI

AlexanderYastrebov · 2024-01-03T21:42:56Z

Something like

$ for n in 1 2 5 10 100 1000 10000; do ./create_measurements.sh $n && diff <(./calculate_average_AlexanderYastrebov.sh 2>/dev/null | ./tocsv.sh) <(./calculate_average.sh 2>/dev/null | ./tocsv.sh) && echo "OK $n" || echo "FAIL $n"; done

Created file with 1 measurements in 19 ms
OK 1
Created file with 2 measurements in 17 ms
OK 2
Created file with 5 measurements in 21 ms
OK 5
Created file with 10 measurements in 18 ms
OK 10
Created file with 100 measurements in 22 ms
OK 100
Created file with 1,000 measurements in 29 ms
107c107
< Dolisie;5.1;22.5;39.5
---
> Dolisie;5.1;22.6;39.5
124c124
< Gagnoa;17.7;22.4;25.4
---
> Gagnoa;17.7;22.3;25.4
126c126
< Garissa;21.6;41.5;72.0
---
> Garissa;21.6;41.4;72.0
135c135
< Hamburg;-0.9;9.6;17.9
---
> Hamburg;-0.9;9.5;17.9
FAIL 1000
Created file with 10,000 measurements in 62 ms
369c369
< Timbuktu;10.6;29.9;53.7
---
> Timbuktu;10.6;30.0;53.7
FAIL 10000

could be used to verify a given implementation.

ddimtirov · 2024-01-04T10:35:07Z

I manually run the reference impl on my generated file and pipe it to a file (name needs to start with measurements* or the license checker would complain).

Then in my code I have:

assert Files.readAllLines(Path.of("measurements_result.txt")).getFirst().equals(result);

It is crude, but it gets the job done... also I find assertions useful alternative to defensive coding when you cannot aford the extra checks on a normal run.

gunnarmorling · 2024-01-04T11:32:08Z

Yeah, if someone wanted to create this kind of test (bonus points for automatically running it against all provided entries), that would be awesome and highly welcome. A test runner which sets System.out to a stream which allows to capture all the output written by a submission and assert that one should do the trick.

gunnarmorling · 2024-01-04T11:35:55Z

@AlexanderYastrebov, I feel a bit dense, how do you define the expected values in your proposal? Where is tocsv.sh coming from?

Adds test samples that can be used for unit tests or to verify implementations via: ```sh for sample in $(ls src/test/resources/samples/*.txt) do echo "Validating $sample" rm -f measurements.txt ln -s $sample measurements.txt diff <(./calculate_average.sh) ${sample%.txt}.out done rm measurements.txt ``` For gunnarmorling#61

AlexanderYastrebov · 2024-01-04T12:12:53Z

@gunnarmorling

We can create a set of test measurement and expected output files (aka golden files) and check each implementation against it.
This set can contain edge cases like 1, 2, 10, 100 records.
It can also contain files created manually e.g. using station ids not used by the baseline impl.
Users can write unit-test to check their implementation.

I've created #82 for that.

Additionally we can have a test like proposed in #61 (comment) that verifies implementation against baseline implementation.

Where is tocsv.sh coming from?

Was added by #36 but its just to simplify debugging of failures, the validator may simply compare impl output to the expected output.

Adds test samples that can be used for unit tests or to verify implementations via: ```bash for sample in $(ls src/test/resources/samples/*.txt) do echo "Validating $sample" rm -f measurements.txt ln -s $sample measurements.txt diff <(./calculate_average.sh) ${sample%.txt}.out done rm measurements.txt ``` For gunnarmorling#61

This is just a fun excersise, the real validator should use shell script like in gunnarmorling#82 since many implementations depend on different jdks and command line arguments. For gunnarmorling#61

radughiorma · 2024-01-04T17:22:54Z

I added a Unit test in this PR #89
What made it difficult is that the FILE is not passed as an argument to the main method.
And also finding the classes to test.

Here is a list with the submissions that I also found to be failing the rules.

AlexanderYastrebov · 2024-01-04T17:35:54Z

I added a Unit test in this PR #89

I've also created #84 earlier but since implementations rely on various jdks, modules and parameters the validation should be performed by running calculate_average_*.sh scripts.

Adds test samples that can be used for unit tests or to verify implementations via: ```bash for sample in $(ls src/test/resources/samples/*.txt) do echo "Validating $sample" rm -f measurements.txt ln -s $sample measurements.txt diff <(./calculate_average.sh) ${sample%.txt}.out done rm measurements.txt ``` For gunnarmorling#61

The script test all implementations and prints PASS or FAIL. In case if failure it also prints implementation output to stderr. This will be handy for adding new test samples. Example usages: ```sh $ ./test_all.sh 2>/dev/null PASS artsiomkorzun PASS baseline PASS bjhara PASS criccomini FAIL ddimtirov FAIL ebarlas PASS filiphr FAIL itaske PASS khmarbaise FAIL kuduwa-keshavram FAIL lawrey PASS padreati FAIL palmr PASS richardstartin FAIL royvanrijn FAIL seijikun PASS spullara PASS truelive $ ./test_all.sh 2>/dev/null | grep PASS | cut -d' ' -f2 artsiomkorzun baseline bjhara criccomini filiphr khmarbaise padreati richardstartin spullara truelive ``` For gunnarmorling#61

The script tests all implementations and prints PASS or FAIL status. In case of failure it also prints implementation output to stderr. This will be handy for adding new test samples. Example usages: ```sh $ ./test_all.sh 2>/dev/null PASS artsiomkorzun PASS baseline PASS bjhara PASS criccomini FAIL ddimtirov FAIL ebarlas PASS filiphr FAIL itaske PASS khmarbaise FAIL kuduwa-keshavram FAIL lawrey PASS padreati FAIL palmr PASS richardstartin FAIL royvanrijn FAIL seijikun PASS spullara PASS truelive $ ./test_all.sh 2>/dev/null | grep PASS | cut -d' ' -f2 artsiomkorzun baseline bjhara criccomini filiphr khmarbaise padreati richardstartin spullara truelive ``` For gunnarmorling#61

The script tests all implementations and prints PASS or FAIL status. In case of failure it also prints implementation output to stderr. This will be handy for adding new test samples. Show test statuses and omit failing output: ```sh $ ./test_all.sh 2>/dev/null PASS artsiomkorzun PASS baseline PASS bjhara PASS criccomini FAIL ddimtirov FAIL ebarlas PASS filiphr FAIL itaske PASS khmarbaise FAIL kuduwa-keshavram FAIL lawrey PASS padreati FAIL palmr PASS richardstartin FAIL royvanrijn FAIL seijikun PASS spullara PASS truelive ``` Show only passing implementations: ``` $ ./test_all.sh 2>/dev/null | grep PASS | cut -d' ' -f2 artsiomkorzun baseline bjhara criccomini filiphr khmarbaise padreati richardstartin spullara truelive ``` For gunnarmorling#61

The script tests all implementations and prints PASS or FAIL status. In case of failure it also prints implementation output to stderr. This will be handy for adding new test samples. Show test statuses and omit failing output: ```sh $ ./test_all.sh 2>/dev/null PASS artsiomkorzun PASS baseline PASS bjhara PASS criccomini FAIL ddimtirov FAIL ebarlas PASS filiphr FAIL itaske PASS khmarbaise FAIL kuduwa-keshavram FAIL lawrey PASS padreati FAIL palmr PASS richardstartin FAIL royvanrijn FAIL seijikun PASS spullara PASS truelive ``` Show only passing implementations: ``` $ ./test_all.sh 2>/dev/null | grep PASS | cut -d' ' -f2 artsiomkorzun baseline bjhara criccomini filiphr khmarbaise padreati richardstartin spullara truelive ``` For #61

AlexanderYastrebov · 2024-01-07T22:10:27Z

@gunnarmorling I guess we can close this and work on additional tests case by case.

gunnarmorling · 2024-01-13T22:24:19Z

Makes sense. Thx for improving the test suite that much, @AlexanderYastrebov!

The script tests all implementations and prints PASS or FAIL status. In case of failure it also prints implementation output to stderr. This will be handy for adding new test samples. Show test statuses and omit failing output: ```sh $ ./test_all.sh 2>/dev/null PASS artsiomkorzun PASS baseline PASS bjhara PASS criccomini FAIL ddimtirov FAIL ebarlas PASS filiphr FAIL itaske PASS khmarbaise FAIL kuduwa-keshavram FAIL lawrey PASS padreati FAIL palmr PASS richardstartin FAIL royvanrijn FAIL seijikun PASS spullara PASS truelive ``` Show only passing implementations: ``` $ ./test_all.sh 2>/dev/null | grep PASS | cut -d' ' -f2 artsiomkorzun baseline bjhara criccomini filiphr khmarbaise padreati richardstartin spullara truelive ``` For gunnarmorling#61

AlexanderYastrebov mentioned this issue Jan 4, 2024

test: add test samples #82

Merged

AlexanderYastrebov mentioned this issue Jan 4, 2024

test: Add samples unit test #84

Closed

AlexanderYastrebov mentioned this issue Jan 4, 2024

memory mapped files, branchless parsing, bitwiddle magic #5

Merged

AlexanderYastrebov mentioned this issue Jan 4, 2024

test: add script to test all implementations #96

Merged

gunnarmorling closed this as completed Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need acceptance test suite #61

Need acceptance test suite #61

AlexanderYastrebov commented Jan 3, 2024

AlexanderYastrebov commented Jan 3, 2024

AlexanderYastrebov commented Jan 3, 2024

ddimtirov commented Jan 4, 2024 •

edited

Loading

gunnarmorling commented Jan 4, 2024

gunnarmorling commented Jan 4, 2024

AlexanderYastrebov commented Jan 4, 2024 •

edited

Loading

radughiorma commented Jan 4, 2024

AlexanderYastrebov commented Jan 4, 2024 •

edited

Loading

AlexanderYastrebov commented Jan 7, 2024 •

edited

Loading

gunnarmorling commented Jan 13, 2024

Need acceptance test suite #61

Need acceptance test suite #61

Comments

AlexanderYastrebov commented Jan 3, 2024

AlexanderYastrebov commented Jan 3, 2024

AlexanderYastrebov commented Jan 3, 2024

ddimtirov commented Jan 4, 2024 • edited Loading

gunnarmorling commented Jan 4, 2024

gunnarmorling commented Jan 4, 2024

AlexanderYastrebov commented Jan 4, 2024 • edited Loading

radughiorma commented Jan 4, 2024

AlexanderYastrebov commented Jan 4, 2024 • edited Loading

AlexanderYastrebov commented Jan 7, 2024 • edited Loading

gunnarmorling commented Jan 13, 2024

ddimtirov commented Jan 4, 2024 •

edited

Loading

AlexanderYastrebov commented Jan 4, 2024 •

edited

Loading

AlexanderYastrebov commented Jan 4, 2024 •

edited

Loading

AlexanderYastrebov commented Jan 7, 2024 •

edited

Loading