MYSTRAN 15.0 Update #9

Bruno02468 · 2023-12-19T07:56:06Z

This update contains a lot of commits, and I'll do my best to describe what has been done.

Bug fixes

Most of this update is bug fixes, and most of the bug fixes have been memory-related. As in, MYSTRAN was doing illegal memory operations that caused crashes and/or undefined behaviour. Thorough debugging with valgrind was key! Here's every relevant commit and the rationale behind the code changes:

ae8404e: ELMOFF was comparing QUAD4 areas to an uninitialized EPS1, causing it to think positive areas were negative. The fix: initialize EPS1 to be EPSIL(1) as everywhere else does.
de46a9d: GPWG had some debug prints that tried to write the value of ITABLE before it was set. The fix: move the initialization of ITABLE a couple lines up.
8930eb8: ANY_U_P_OUTPUT, a variable with all assignments and uses commented out, appeared on an IF statement, thus sometimes being read even though it was never set. The fix: comment it out of the IF statement as well.
854b5a1: there was a faulty conditional in the SuperLU version we were using, causing crashes when the global stiffness matrix was singular. The fix: so we changed to a more recent state of their repository since the fix was recent.
aa1ef5d: a small oversight in indexes used to print CSHEAR engineering forces in WRITE_ELEM_ENGR_FORCE caused a buffer to be read past the part that actually held values. The fix: make the indexes consistent with the initialized part.
6d7aaba: OP2 code in WRITE_ELEM_STRESSES forgot to set a value called STRESS_CODE, used to indicate what's about to be written. The fix: put in a placeholder value for now -- there's other placeholder values in that same subroutine anyway.
5ae0520: a missing ELSE was causing a fatal error to be issued for valid models containing CSHEAR cards with a nonzero OCID. The fix: added the ELSE.
e8f5ca0: the subroutine GET_KE_OFFSET was communicating the wrong shape of BUSH element parameters (KEO_BUSH) to a matrix operation (MATPUT). The fix: communicate the correct shape.
8b9b03e: a post-deallocation subroutine was being called for the matrix I2_GMN regardless of whether it had been just deallocated or not, triggering a bad read. The fix: move the call into the conditional for whether I2_GMN was just deallocated.
5777eb1: the SCNUM vector, containing subcase numbers, was being allocated with the wrong size, resulting in an overflow when it was written to. In MODES solutions, the "subcases" are the number of requested eigenvectors. The fix: set LSUB, which determines the maximum number of subcases in the solution, as soon was we see the EIGR bulk data card "chosen" by the METHOD case control card during LOADB. This way, all subsequent (i.e. after LINK0) allocations of SCNUM will be the correct size. There are no writes to SCNUM that could overflow before, since we set the size before the limit is even communicated to the rest of the program.
215f75c: same as above, but for EIGRL cards and the OGROUT vector. The fix: same as above, but setting LSUB as soon as we see the right EIGRL card during LOADB.
ebf53f1: a debug write was reading from an uninitialized variable depending on the requested data. The fix: initialize it just like in similar requests.
8b874e4: okay, so, when a BUSH element has negative OCID (or the continuation card is absent), that means we use the GA-GB line as the x-axis for computing offsets. However, when GA and GB coincide, that line can't be computed, but an attempt was being made to compute them anyway, resulting in some nasty memory bugs. The fix: detect zero-length BUSH elements with negative OCID and mark them as non-offset, thus avoiding the problematic computations.

Almost all of these were detected because they manifested as (usually) silent illegal memory operations. Fixing them helps ensure MYSTRAN behavior is deterministic and prevents crashes or garbled output.

New features

This release was focused on fixing bugs. I just added a NOCOUNTS parameter (968909c and 88c294a) to disable those "counter" progress-indicating writes to standard output. Why? Because not all terminals can handle it (see: VSCode's debugger terminal), and neither can files. It's disabled by default, so counters are still there if you don't set it. Counters also make some runs longer, since every operation is punctuated by a write syscall. Being able to disable them is good if you're debugging, writing standard output to a log file, or running a large model where 10% time savings mean hours.

If you see any counter getting past NOCOUNTS, sorry, they're really hard to find in the code! Tell us via Discord, a GitHub issue, a forum post, whatever you prefer.

Build and documentation changes

I updated the build instructions (20afd36 and c53e591) to make them more consistent with the way our build script handles libraries.

Also, we bumped our SuperLU version to commit xiaoyeli/superlu@76b2c9a in order to integrate a fix for an invalid read while trying to factor a singular matrix.

Oh, and I added the --fcheck=all flag to enable runtime checks. This way, memory bugs are less silent -- as opposed to lurking around until someone decides to run on valgrind. There's a small performance hit, comparable to enabling NOCOUNTS, but the benefits far outweigh the cost. Besides, there are more pressing bottlenecks, and one can always disable that flag by editing CMakeLists.txt if they're so inclined. But I do not recommend that. At all.

And the manual needs updating, of course. So do some other documents. That's in the works.

Other changes

BANDIT is now disabled by default (8e1050d), regardless of what solver you choose to use. Why? Because it's broken. It's written in Fortran 77 with tons of nonstandard stuff, and getting it to work would be moot: if you're running a model large enough for the banded solver to need BANDIT, you shouldn't be using the banded solver. Use SuperLU: PARAM,SOLLIB,SPARSE.

Extricating BANDIT from the code is low-priority due to its very high difficulty-impact ratio. That means you can still enable it, but it won't work. Don't do it.

Finally, this update is a bump to the 15.0 version, so it was also set on the code (d31a1a6).

A quick warning

The bump in the SuperLU version might require you do a clean build. If you get random linker errors, run a make clean, delete superlu/, Binaries/, CMakeCache.txt, and CMakeFiles/. Then, re-run cmake with the appropriate arguments. Sorry about that, it's got to do with how CMake handles Git submodules.

Results and final remarks

Phew, that was a lot of commits. Let me summarize what this update means.

All models in our current benchmark set now run without any illegal memory operations. So do other models that used to cause trouble, like cshear.bdf (part of the build verification suite) and large_shelled_beam.bdf (user-reported, I think).

That doesn't mean results are necessarily correct. Not all bugs are memory bugs! But this update means that many models that used to trigger nondeterministic behaviour and/or crashes now run to completion. This way, we can actually get the results to verify they're correct, and also work on new features unencumbered by crashes. Not bad for a month's work, eh?

Feedback is very much welcome, and there's more on the way!

Merging upstream updates

made compliant with the rest of the files as per Ceans observation

… when its name may be uninit'd

Bruno02468 and others added 30 commits July 26, 2023 21:12

Merge pull request #1 from MYSTRANsolver/main

65f1222

Merging upstream updates

changes to the .gitignore to make IDE work easier

cac0788

first attempt at adding K6ROT

72d6d4d

fix uninitialized EPS1 causing ELMOFF to think 1.0 is negative

ae8404e

comment unused K6ROT debug format

0caf80f

disable BANDIT by default

8e1050d

add warning for K6ROT=0

8169780

fixed failure to build when another build directory was specified

23d8fc7

fixed memory bug (printing ITABLE before initialising it)

de46a9d

fixed memory bug: testing an unused and unset variable

8930eb8

update superlu

854b5a1

Rename GET_INI_FILNAM.F90 to GET_INI_FILNAM.f90

e41d09e

made compliant with the rest of the files as per Ceans observation

fix cmake command syntax error

e3c7365

sorry, I broke the f2c download URL

64637a6

Merge branch 'main' of github.com:MYSTRANsolver/MYSTRAN

3c098c5

fixed memory bug: overrun when printing SHEAR engineering forces

aa1ef5d

set sensible compiler warnings

d35d991

fix memory bug: uninit'd stress_code. added a placeholder.

6d7aaba

explicit Wmaybe-uninitialized

ea6fc5d

fixed missing ELSE breaking CBUSHes with nonzero OCID

5ae0520

added PARAM,NOCOUNTS to suppress progress counters

968909c

add --fcheck=all compiler flag

3ed71a0

fixed memory bug caused by incorrect KEO_BUSH shape

e8f5ca0

caught another counter that escaped NOCOUNTS

88c294a

fixed memory bug due to incorrectly reporting an array's deallocation…

8b9b03e

… when its name may be uninit'd

fix memory bug by properly allocating SCNUM in MODES solutions

5777eb1

fixed memory bug: uninit'd PCOMP_PLIES when WHAT=23

ebf53f1

fixed memory bug: another LSUB-based array with bad size.

215f75c

restrict K6ROT computations to QUAD4/TRIA3 shells

b8a7572

stopped trying to compute offsets for zero-length BUSHes with no OCID

8b874e4

Bruno02468 added 3 commits December 18, 2023 21:57

build instructions needed some updates

20afd36

include g++ in the linux requirements as per zach

c53e591

bump the version and date

d31a1a6

MYSTRANsolver merged commit d909724 into MYSTRANsolver:main Dec 19, 2023
1 check passed

This was referenced Dec 19, 2023

MYSTRAN 15.1 #10

Merged

MYSTRAN Update 15.1.1 #11

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MYSTRAN 15.0 Update #9

MYSTRAN 15.0 Update #9

Bruno02468 commented Dec 19, 2023 •

edited

Loading

MYSTRAN 15.0 Update #9

MYSTRAN 15.0 Update #9

Conversation

Bruno02468 commented Dec 19, 2023 • edited Loading

Bug fixes

New features

Build and documentation changes

Other changes

A quick warning

Results and final remarks

Bruno02468 commented Dec 19, 2023 •

edited

Loading