Skip to content

Patch release v2.3.2

Compare
Choose a tag to compare
@khuck khuck released this 13 Apr 18:07
· 737 commits to develop since this release

Patch release for bug fixes.

Commits in this release:

  • view commit • Updating documentation
  • view commit • Merge branch 'develop' of github.com:khuck/xpress-apex into develop
  • view commit • Checking for nvcc 10 and gcc 8 and setting flags accordingly
  • view commit • Adding periodic plugin example, enabling static global constructors and destructors
  • view commit • Adding pthread wrapper and screen_output to policy plugin example
  • view commit • Update README.md
  • view commit • Re-enablling ability to get vector of available profiles, updated periodic example
  • view commit • Don't pin threads by default, it's kind of broken on summit
  • view commit • Fixing HPX build due to static global constructor
  • view commit • Fixing bug #134. Changing from pthread_setaffinity_np() to sched_get/setaffinity()
  • view commit • Fixing issue #135 When tracking CPU/GPU activity, the memory allocation counters should be associated with the thread making the call, when writing to OTF2 traces. This change allows for an optional argument to the apex::sample_value call that indicates whether the counter is assocaited with the specific thread or the process as a whole (the default).
  • view commit • Fixing #137. Now explicitly tracking all memory allocations and frees on both the host and the device.
  • view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
  • view commit • Re-enable pinning by default
  • view commit • Fixing #136. Now have the ability to capture task tree, not just graph. No more cycles!
  • view commit • Adding dependency_tree class
  • view commit • Fixing build errors for -std=c++11 compliance
  • view commit • Initial memory wrapper, bugs everywhere
  • view commit • Adding additional MPI rank detection support
  • view commit • Fixing build issue with HPX due to modified sample_value function
  • view commit • Fixing cuda 10.1 build errors.
  • view commit • Fixing gperftool config by finding correct include location
  • view commit • Fixing gperftool config by finding correct include location
  • view commit • Removing some high-overhead and useless counters
  • view commit • Working memory wrapper for malloc/free, removing pointers from name demangling due to instability
  • view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
  • view commit • Adding support for calloc and realloc
  • view commit • Fixing comment
  • view commit • Adding memory wrapper code for HPX configurations
  • view commit • Updating copyright to 2021
  • view commit • Fixing measurement output when dump is called multiple times.
  • view commit • Fixing tasktree processing for non-timers, adding to apex_exec script
  • view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
  • view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
  • view commit • Fixing elapsed time in graphs and shortening timer names by not including full file name and path by default
  • view commit • Fixing concurrency handler static global variable
  • view commit • Fix HPX barriers in OTF2 output
  • view commit • Merge pull request #143 from severinstrobl/otf2_hpx_barriers
  • view commit • Enabling LLVM 11 to build cuda examples
  • view commit • Forgot to set profiler to "stopped" when adding async activity.
  • view commit • Removing APEX counters (llvm won't link them?)
  • view commit • Cleaning up timers. We had been using a custom clock in order to use rdtsc on Intel platforms, but that's kind of pointless. It becomes a nightmare when trying to convert for OTF2 traces, and CUDA (and other GPUs) only provide timestamps in nanoseconds. Therefore, all timing is assumed to be done in nanoseconds now.
  • view commit • Flush CUPTI before dumping.
  • view commit • Need to move forward declaration.
  • view commit • Only override the rank if suspect it's wrong
  • view commit • Updating version number
  • view commit • Updating version number.
  • view commit • Merge branch 'develop'