allow external absint to hold custom data in `codeinst.inferred` #53300

aviatesk · 2024-02-12T16:40:00Z

It has been possible for external abstract interpreters to keep custom data in codeinst.inferred (together /w overloading inlining_policy). After #52233, when such external absint uses InternalCodeCache, this data is passed to jl_ir_flag_inferred, leading to segfaults in assertion builds.

This commit resolves the issue by omitting jl_ir_flag_inferred checks when the cache_owner is external. Nonetheless, a better resolution might be necessary. It suggests that the current design of code_owner and InternalCodeCache for the external cache system is somewhat flawed. A conceivable approach could involve:

Adding a layer similar to inlining_policy in CC.get(::WorldView{InternalCodeCache}) to enable safe redirection of custom data to the native interpreter's implementation.
Prohibiting custom data in the inferred field and directing such data to be kept in analysis_results.

vchuravy · 2024-02-12T16:53:12Z

Does #53219 resolve this? It removes the need to handle code-instances in this location.

#52964 does require that code.inferred is something codegen and the interpreter can understand, so stashing additional data in .inferred is problematic and I would prefer custom data to go into analysis_results.

base/compiler/types.jl

vchuravy · 2024-02-12T16:55:02Z

In JuliaDebug/Cthulhu.jl#540 I continued using the external code cache instead of using the internal code cache (but the external code cache no longer participates in invalidation logic)

aviatesk · 2024-02-12T17:01:03Z

Does #53219 resolve this? It removes the need to handle code-instances in this location.

Probably. I'm looking into it.

In JuliaDebug/Cthulhu.jl#540 I continued using the external code cache instead of using the internal code cache (but the external code cache no longer participates in invalidation logic)

Cthulhu has traditionally created a new cache for each instance of CthulhuInterpreter, which meant there was no inherent support for invalidation required. On a personal note, I'm inclined to retain it as it facilitates easier reflection of outcomes following modifications to the base abstract interpretation's implementation.

Keno · 2024-02-12T17:44:45Z

I think it's fine to have non-CodeInfo objects in (::CodeInstance).inferred. That's an explicit goal of #53219. #53219 also removes the (::CodeInfo).inferred field, which fixes the issue in this PR. It also fixes the underlying issue where the compiler is looking at the wrong CodeInstance in some cases. The only constraint we do need is that for owner==nothing, the CodeInstance's inferred field needs to be something the compiler understands.

vtjnash

Waiting on resolution of #53219, since that already addresses the .inferred field issue here, and then this will need to be rebased

aviatesk · 2024-02-13T04:43:58Z

Yeah I'll work on the PR first and then we only need the test cases from this PR.

aviatesk · 2024-02-13T12:58:12Z

I know it might sound like I'm flip-flopping, but I want to move forward with this PR as is. This is because, as things stand post-#52233, virtually all external abstract interpreters utilizing the external code cache with custom data are rendered inoperative on assertion builds. Given that the code in question will be eliminated by #53219 anyway in the future, I think this PR doesn't make any harm and merging this for now alone can be justified with the added test cases.

Leverages JuliaLang/julia#52233 to use the internal code cache that comes with the inherent invalidation support. Still requires: - JuliaLang/julia#53300 (or JuliaLang/julia#53219) - JuliaLang/julia#53318

vchuravy

There are currently more places that make assumptions about the content of this field.

If the relocatability flag on the codeinstance is 0x0 we do not safe it as part of a package image. This is determined here

julia/base/compiler/typeinfer.jl

Lines 314 to 328 in 288912a

    
           relocatability = 0x0 
        
           if const_flags == 0x3 && may_discard_trees(interp) 
        
               inferred_result = nothing 
        
               relocatability = 0x1 
        
           else 
        
               inferred_result = transform_result_for_cache(interp, result.linfo, valid_worlds, result) 
        
               if isa(inferred_result, String) 
        
                   t = @_gc_preserve_begin inferred_result 
        
                   relocatability = unsafe_load(unsafe_convert(Ptr{UInt8}, inferred_result), Core.sizeof(inferred_result)) 
        
                   @_gc_preserve_end t 
        
               elseif inferred_result === nothing 
        
                   relocatability = 0x1 
        
               end 
        
           end 
        
           # relocatability = isa(inferred_result, String) ? inferred_result[end] : UInt8(0)

and checked here

julia/src/staticdata_utils.c

Line 233 in a7b68cf

if (!ci->relocatability)

Even if I force relocatability to 0x1 we hit an assertion later

julia/src/ircode.c

Line 993 in a7b68cf

assert(jl_is_string(data));

from

julia/src/precompile_utils.c

Line 189 in a7b68cf

jl_ir_flag_inferred(inferred) &&

While we could thread everything through this does raise the question that we store inferred normally in a compressed format, using jl_compress_ir which of course doesn't know how to handle custom results.

For me this indicates that using analysis_results over additional data in inferred should be preferred. Or we need to figure out how to compress custom data.

Leverages JuliaLang/julia#52233 to use the internal code cache that comes with the inherent invalidation support. Still requires: - JuliaLang/julia#53300 (or JuliaLang/julia#53219) - JuliaLang/julia#53318

It has been possible for external abstract interpreters to keep custom data in `codeinst.inferred` (together /w overloading `inlining_policy`). After #52233, when such external absint uses `InternalCodeCache`, this data is passed to `jl_ir_flag_inferred`, leading to segfaults in assertion builds. This commit resolves the issue by omitting `jl_ir_flag_inferred` checks when the `cache_owner` is external. Nonetheless, a better resolution might be necessary. It suggests that the current design of `code_owner` and `InternalCodeCache` for the external cache system is somewhat flawed. A conceivable approach could involve: - Adding a layer similar to `inlining_policy` in `CC.get(::WorldView{InternalCodeCache})` to enable safe redirection of custom data to the native interpreter's implementation. - Prohibiting custom data in the `inferred` field and directing such data to be kept in `analysis_results`.

vchuravy

I am still not a big fan of using .inferred for these purposes,
but I am okay with this change to unblock current uses of AbstractInterpreter.

@aviatesk do you want to add an explicit serialization test, where you override relocatability?

vtjnash · 2024-02-17T17:41:24Z

Yeah, since jl_ir_flag_inferred is deleted in #53219 it may conflict somewhat with that PR content, but seems safe to merge this now as it is mostly just added tests

vtjnash · 2024-02-17T18:30:15Z

To address Valentin's comment: it looks like this doesn't really express an opinion specifically on whether it is acceptable to use this field, but simply attempts to address the possibility of a missing type assert, which would be valid to check there regardless of whether we later state that other content should be put here

Thinking more broadly, I wonder if we should restructure this layering of types to instead look like this, so that there is a more bright line distinction in the types what is volatile and what is constant:

struct MethodInstance #= mostly unchanged =# end

mutable struct CodeCache # renamed from CodeInstance to indicate this is volatile
    parent::MethodInstance # for show reflection
    #= most fields from existing CodeInstance =#
   ...
    @atomic inferred::Union{CodeInfo, CompressedCodeInfo}
end

struct CodeInfo # partially kept as-is, but extended to allow external stmt representations
    #= fields from existing CodeInstance that are properties of the whole function =#
    ...
    parent::MethodInstance # for show reflection
    stmts::Union{StmtInfo, OpaqueExternalObjects, Nothing} # custom info goes here
end

struct StmtInfo # split from CodeInfo
    #= fields from existing CodeInstance that are arrays of per stmt info =#
   ...
end

) It has been possible for external abstract interpreters to keep custom data in `codeinst.inferred` (together /w overloading `inlining_policy`). After #52233, when such external absint uses `InternalCodeCache`, this data is passed to `jl_ir_flag_inferred`, leading to segfaults in assertion builds. This commit resolves the issue by omitting `jl_ir_flag_inferred` checks when the `cache_owner` is external. Nonetheless, a better resolution might be necessary. It suggests that the current design of `code_owner` and `InternalCodeCache` for the external cache system is somewhat flawed. A conceivable approach could involve: - Adding a layer similar to `inlining_policy` in `CC.get(::WorldView{InternalCodeCache})` to enable safe redirection of custom data to the native interpreter's implementation. - Prohibiting custom data in the `inferred` field and directing such data to be kept in `analysis_results`. (cherry picked from commit 93876c9)

While experimenting with precompilation for external absints on builds just after #53300 was merged, I found that the test case for `CustomAbstractInterpreterCaching2.jl` fails if the test case for `CustomAbstractInterpreterCaching1.jl` isn't run in the same session beforehand. That is probably because of the previous lack of support for proper `CodeInstance` caching. To address this, I've changed the tests to run in separate processes in this commit. Note that it appears that a recent refactor concerning `CodeInstance` might have resolved this issue, so the new test cases runs successfully on master. However, I suspect the fix hasn't been applied to v1.11 yet, we would need more research.

As mentioned in #53478, the precompilation support for external abstract interpreters in v1.11 isn't perfect, and directly cherry-picking the refined test cases from #53478 into the v1.11 backport branch leads to a test failure (note that this particular problem has likely been fixed in the master branch, probably thanks to #53300). To address this, this commit does more than just cherry-pick the test case, and it also modifies the `CodeInstance(::AbstractInterpreter, ::InferenceResult)` constructor to allow precompilation for external abstract interpreters in v1.11.

Backported PRs: - [x] #53361  - [x] #53300  - [x] #53342  - [x] #53372  - [x] #53357  - [x] #53373  - [x] #53333  - [x] #53354  - [x] #53407  - [x] #53388  - [x] #53355  - [x] #53429  - [x] #53437  - [x] #53284  - [x] #53466  - [x] #53467  - [x] #53326  - [x] #53332 - [x] #53320  - [x] #53476 Contains multiple commits, manual intervention needed: - [ ] #53285  Non-merged PRs with backport label: - [ ] #53424  - [ ] #53408  - [ ] #53403  - [ ] #53402  - [ ] #53391  - [ ] #53125  - [ ] #52694

…iaLang#53300) It has been possible for external abstract interpreters to keep custom data in `codeinst.inferred` (together /w overloading `inlining_policy`). After JuliaLang#52233, when such external absint uses `InternalCodeCache`, this data is passed to `jl_ir_flag_inferred`, leading to segfaults in assertion builds. This commit resolves the issue by omitting `jl_ir_flag_inferred` checks when the `cache_owner` is external. Nonetheless, a better resolution might be necessary. It suggests that the current design of `code_owner` and `InternalCodeCache` for the external cache system is somewhat flawed. A conceivable approach could involve: - Adding a layer similar to `inlining_policy` in `CC.get(::WorldView{InternalCodeCache})` to enable safe redirection of custom data to the native interpreter's implementation. - Prohibiting custom data in the `inferred` field and directing such data to be kept in `analysis_results`.

While experimenting with precompilation for external absints on builds just after #53300 was merged, I found that the test case for `CustomAbstractInterpreterCaching2.jl` fails if the test case for `CustomAbstractInterpreterCaching1.jl` isn't run in the same session beforehand. That is probably because of the previous lack of support for proper `CodeInstance` caching. To address this, I've changed the tests to run in separate processes in this commit. Note that it appears that a recent refactor concerning `CodeInstance` might have resolved this issue, so the new test cases runs successfully on master. However, I suspect the fix hasn't been applied to v1.11 yet, we would need more research.

As mentioned in #53478, the precompilation support for external abstract interpreters in v1.11 isn't perfect, and directly cherry-picking the refined test cases from #53478 into the v1.11 backport branch leads to a test failure (note that this particular problem has likely been fixed in the master branch, probably thanks to #53300). To address this, this commit does more than just cherry-pick the test case, and it also modifies the `CodeInstance(::AbstractInterpreter, ::InferenceResult)` constructor to allow precompilation for external abstract interpreters in v1.11.

) While experimenting with precompilation for external absints on builds just after #53300 was merged, I found that the test case for `CustomAbstractInterpreterCaching2.jl` fails if the test case for `CustomAbstractInterpreterCaching1.jl` isn't run in the same session beforehand. That is probably because of the previous lack of support for proper `CodeInstance` caching. To address this, I've changed the tests to run in separate processes in this commit. Note that it appears that a recent refactor concerning `CodeInstance` might have resolved this issue, so the new test cases runs successfully on master. However, I suspect the fix hasn't been applied to v1.11 yet, we would need more research.

aviatesk mentioned this pull request Feb 12, 2024

minor follow up on the tagged code instance change JuliaDebug/Cthulhu.jl#543

Merged

vchuravy reviewed Feb 12, 2024

View reviewed changes

base/compiler/types.jl Outdated Show resolved Hide resolved

aviatesk force-pushed the avi/ext-interp-custom-data branch from 0ec75bb to e52761e Compare February 12, 2024 17:02

vtjnash requested changes Feb 13, 2024

View reviewed changes

aviatesk mentioned this pull request Feb 13, 2024

switch to using the internal code cache aviatesk/JET.jl#611

Open

vchuravy requested changes Feb 13, 2024

View reviewed changes

aviatesk force-pushed the avi/ext-interp-custom-data branch 2 times, most recently from 5bd972a to 31bebd3 Compare February 15, 2024 14:22

aviatesk added 2 commits February 17, 2024 01:18

add more external absint precompile test

610f27a

aviatesk force-pushed the avi/ext-interp-custom-data branch from 31bebd3 to 610f27a Compare February 16, 2024 16:19

vchuravy requested review from vchuravy and vtjnash February 16, 2024 18:19

vchuravy added this to the 1.11 milestone Feb 16, 2024

vchuravy added the backport 1.11 Change should be backported to release-1.11 label Feb 16, 2024

vchuravy approved these changes Feb 16, 2024

View reviewed changes

vtjnash approved these changes Feb 17, 2024

View reviewed changes

vtjnash merged commit 93876c9 into master Feb 17, 2024
8 checks passed

vtjnash deleted the avi/ext-interp-custom-data branch February 17, 2024 17:41

KristofferC mentioned this pull request Feb 26, 2024

Backports release 1.11 #53472

Merged

28 tasks

aviatesk mentioned this pull request Feb 26, 2024

enhance the effectiveness of the test cases introduced in #53300 #53478

Merged

aviatesk mentioned this pull request Feb 27, 2024

1.11: allow external abstract interpreter compilation #53488

Open

KristofferC removed the backport 1.11 Change should be backported to release-1.11 label Mar 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow external absint to hold custom data in `codeinst.inferred` #53300

allow external absint to hold custom data in `codeinst.inferred` #53300

aviatesk commented Feb 12, 2024

vchuravy commented Feb 12, 2024

vchuravy commented Feb 12, 2024

aviatesk commented Feb 12, 2024

Keno commented Feb 12, 2024

vtjnash left a comment

aviatesk commented Feb 13, 2024

aviatesk commented Feb 13, 2024

vchuravy left a comment

vchuravy left a comment

vtjnash commented Feb 17, 2024

vtjnash commented Feb 17, 2024 •

edited

Loading

	relocatability = 0x0
	if const_flags == 0x3 && may_discard_trees(interp)
	inferred_result = nothing
	relocatability = 0x1
	else
	inferred_result = transform_result_for_cache(interp, result.linfo, valid_worlds, result)
	if isa(inferred_result, String)
	t = @_gc_preserve_begin inferred_result
	relocatability = unsafe_load(unsafe_convert(Ptr{UInt8}, inferred_result), Core.sizeof(inferred_result))
	@_gc_preserve_end t
	elseif inferred_result === nothing
	relocatability = 0x1
	end
	end
	# relocatability = isa(inferred_result, String) ? inferred_result[end] : UInt8(0)

allow external absint to hold custom data in codeinst.inferred #53300

allow external absint to hold custom data in codeinst.inferred #53300

Conversation

aviatesk commented Feb 12, 2024

vchuravy commented Feb 12, 2024

vchuravy commented Feb 12, 2024

aviatesk commented Feb 12, 2024

Keno commented Feb 12, 2024

vtjnash left a comment

Choose a reason for hiding this comment

aviatesk commented Feb 13, 2024

aviatesk commented Feb 13, 2024

vchuravy left a comment

Choose a reason for hiding this comment

vchuravy left a comment

Choose a reason for hiding this comment

vtjnash commented Feb 17, 2024

vtjnash commented Feb 17, 2024 • edited Loading

allow external absint to hold custom data in `codeinst.inferred` #53300

allow external absint to hold custom data in `codeinst.inferred` #53300

vtjnash commented Feb 17, 2024 •

edited

Loading