-
Notifications
You must be signed in to change notification settings - Fork 739
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LLVM and SPIRV-LLVM-Translator pulldown (WW45 2024) #16022
Commits on Oct 17, 2024
-
[clang][bytecode] Explicitly truncate in IntegralAP::from() (#112683)
Add Integral::toAPInt(), which truncates to the given BitWidth, similar to the toAPSInt() we already have.
Configuration menu - View commit details
-
Copy full SHA for 5b4071c - Browse repository at this point
Copy the full SHA 5b4071cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1251687 - Browse repository at this point
Copy the full SHA 1251687View commit details -
[mlir][vector] Escape variable usage in test
Otherwise the shell might expand this in the command line.
Configuration menu - View commit details
-
Copy full SHA for 4d228e1 - Browse repository at this point
Copy the full SHA 4d228e1View commit details -
[ARM] Fix -Wunused-variable in ARMFrameLowering.cpp (NFC)
/llvm-project/llvm/lib/Target/ARM/ARMFrameLowering.cpp:1028:9: error: unused variable 'FPOffset' [-Werror,-Wunused-variable] int FPOffset = MFI.getObjectOffset(FramePtrSpillFI); ^ 1 error generated.
Configuration menu - View commit details
-
Copy full SHA for 584e00a - Browse repository at this point
Copy the full SHA 584e00aView commit details -
[include-cleaner] Fix -Wpessimizing-move in IncludeCleaner.cpp (NFC)
/llvm-project/clang-tools-extra/include-cleaner/tool/IncludeCleaner.cpp:302:14: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] return std::move(llvm::errorCodeToError(Err)); ^ /llvm-project/clang-tools-extra/include-cleaner/tool/IncludeCleaner.cpp:302:14: note: remove std::move call here return std::move(llvm::errorCodeToError(Err)); ^~~~~~~~~~ ~ 1 error generated.
Configuration menu - View commit details
-
Copy full SHA for 2954d1f - Browse repository at this point
Copy the full SHA 2954d1fView commit details -
[llvm][ARM]Add widen global arrays pass (#107120)
- Pass optimizes memcpy's by padding out destinations and sources to a full word to make backend generate full word loads instead of loading a single byte (ldrb) and/or half word (ldrh). Only pads destination when it's a stack allocated constant size array and source when it's constant array. Heuristic to decide whether to pad or not is very basic and could be improved to allow more examples to be padded. - Pass works within GlobalOpt but is disabled by default on all targets except ARM.
Configuration menu - View commit details
-
Copy full SHA for ab90d27 - Browse repository at this point
Copy the full SHA ab90d27View commit details -
Configuration menu - View commit details
-
Copy full SHA for 83953c7 - Browse repository at this point
Copy the full SHA 83953c7View commit details -
VisitIfStmt: Prune a redundant condition.
`S->isConsteval()` is evaluated at the top of this method. Likely mis-merging in #75425
Configuration menu - View commit details
-
Copy full SHA for 5bcc66d - Browse repository at this point
Copy the full SHA 5bcc66dView commit details -
InstrProfilingMerge.c: Fix potential misalignment in
SrcBitmapStart
Currently it is not an issue. It will be a problem if Bitmap is located after single byte counters.
Configuration menu - View commit details
-
Copy full SHA for 40d9561 - Browse repository at this point
Copy the full SHA 40d9561View commit details -
Silence -Wswitch after cb43021
lldb/source/Plugins/TypeSystem/Clang/TypeSystemClang.cpp:4885:13: warning: enumeration value 'SveMFloat8' not handled in switch [-Wswitch] 4885 | switch (llvm::cast<clang::BuiltinType>(qual_type)->getKind()) { |
Configuration menu - View commit details
-
Copy full SHA for 9c80eb7 - Browse repository at this point
Copy the full SHA 9c80eb7View commit details -
[flang][extension] support concatenation with absent optional (#112678)
Fix #112593 by adding support in lowering to concatenation with an absent optional _assumed length_ dummy argument because: 1. Most compilers seem to support it (most likely by accident). 2. This actually makes the compiler codegen simpler. Codegen was going out of its way to poke the LLVM optimizer bear by producing an undef argument for the length. I insist on the fact that no compiler support this with _explicit length_ optional arguments and the executable will segfault and I would discourage users from using that "feature" because runtime checks for bad optional dereference will kick when used (For instance, "nagfor -C=present" will produce an executable that abort with an error message . Flang does not have such runtime check option so far). Hence, I am not updating the Extensions.md document because this is not something I think we should advertise.
Configuration menu - View commit details
-
Copy full SHA for 2f0b4f4 - Browse repository at this point
Copy the full SHA 2f0b4f4View commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedOct 17, 2024 Configuration menu - View commit details
-
Copy full SHA for 9b3cc0c - Browse repository at this point
Copy the full SHA 9b3cc0cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e21c80a - Browse repository at this point
Copy the full SHA e21c80aView commit details -
[lldb][docs] Add link to LoongArch tracking issue
llvm/llvm-project#112693 will be tracking the overall state of LoongArch support. This means anyone can check without having to track down an expert.
Configuration menu - View commit details
-
Copy full SHA for 2882bb1 - Browse repository at this point
Copy the full SHA 2882bb1View commit details -
[Polly] Remove unused variable 'IdentTy' in LoopGeneratorsKMP.cpp (NFC)
/llvm-project/polly/lib/CodeGen/LoopGeneratorsKMP.cpp:396:15: error: unused variable 'IdentTy' [-Werror,-Wunused-variable] StructType *IdentTy = ^ /llvm-project/polly/lib/CodeGen/LoopGeneratorsKMP.cpp:460:15: error: unused variable 'IdentTy' [-Werror,-Wunused-variable] StructType *IdentTy = ^ 2 errors generated.
Configuration menu - View commit details
-
Copy full SHA for f42785d - Browse repository at this point
Copy the full SHA f42785dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 067e8b8 - Browse repository at this point
Copy the full SHA 067e8b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8268bc4 - Browse repository at this point
Copy the full SHA 8268bc4View commit details -
[DAG] visitSINT_TO_FP/UINT_TO_FP - use FoldConstantArithmetic to atte…
…mpt to constant fold Don't rely on isConstantIntBuildVectorOrConstantInt followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682
Configuration menu - View commit details
-
Copy full SHA for 784c15a - Browse repository at this point
Copy the full SHA 784c15aView commit details -
[DAG] visitFP_TO_SINT/FP_TO_UINT - use FoldConstantArithmetic to atte…
…mpt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682
Configuration menu - View commit details
-
Copy full SHA for 5692a0c - Browse repository at this point
Copy the full SHA 5692a0cView commit details -
Configuration menu - View commit details
-
Copy full SHA for bf5cf82 - Browse repository at this point
Copy the full SHA bf5cf82View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf046c8 - Browse repository at this point
Copy the full SHA cf046c8View commit details -
[AArch64][SVE] Enable max vector bandwidth for SVE (#109671)
Returns true for shouldMaximizeVectorBandwidth when the register type is a scalable vector and SVE or streaming SVE are available.
Configuration menu - View commit details
-
Copy full SHA for c980a20 - Browse repository at this point
Copy the full SHA c980a20View commit details -
[ARM] Fix -mno-omit-leaf-frame-pointer flag doesn't works on 32-bit A…
…RM (#109628) The -mno-omit-leaf-frame-pointer flag works on 32-bit ARM architectures and addresses the bug reported in #108019
Configuration menu - View commit details
-
Copy full SHA for 4a2bd78 - Browse repository at this point
Copy the full SHA 4a2bd78View commit details -
[InstCombine] Add shared helper for logical and bitwise and/or (NFC)
Add a helper for shared folds between logical and bitwise and/or and move the and/or of icmp and fcmp folds in there. This makes it easier to extend to more folds. A possible extension would be to base the current and/or of icmp reassociation logic on this helper, so that it for example also applies to fcmp.
Configuration menu - View commit details
-
Copy full SHA for 0f7d148 - Browse repository at this point
Copy the full SHA 0f7d148View commit details -
Revert "[llvm][ARM]Add widen global arrays pass" (#112701)
Reverts llvm/llvm-project#107120 Unexpected build failures in post-commit pipelines. Needs investigation
Configuration menu - View commit details
-
Copy full SHA for 370fd74 - Browse repository at this point
Copy the full SHA 370fd74View commit details -
[InstCombine] Set
samesign
when converting signed predicates into u……nsigned (#112642) Alive2: https://alive2.llvm.org/ce/z/6cqdt-
Configuration menu - View commit details
-
Copy full SHA for 095d49d - Browse repository at this point
Copy the full SHA 095d49dView commit details -
Different info in docs in AST methods (#112190)
[Here](https://github.com/llvm/llvm-project/blob/6a98c4a1602591c942f01dceb3aa29ffd4cf1e5b/clang/include/clang/ASTMatchers/ASTMatchers.h#L4188-L4203) and [here](https://github.com/llvm/llvm-project/blob/6a98c4a1602591c942f01dceb3aa29ffd4cf1e5b/clang/include/clang/ASTMatchers/ASTMatchers.h#L3679-L3695) we can see similar code samples and same examples: ``` cxxMemberCallExpr(on(callExpr())) ``` In the first case, it is [written](https://github.com/llvm/llvm-project/blob/6a98c4a1602591c942f01dceb3aa29ffd4cf1e5b/clang/include/clang/ASTMatchers/ASTMatchers.h#L4201) that the object must not be matched: ``` /// cxxMemberCallExpr(on(callExpr())) /// does not match `(g()).m()`, because the parens are not ignored. ``` In the second case, it is [written](https://github.com/llvm/llvm-project/blob/6a98c4a1602591c942f01dceb3aa29ffd4cf1e5b/clang/include/clang/ASTMatchers/ASTMatchers.h#L3693) that the object must be matched: ``` /// cxxMemberCallExpr(on(callExpr())) /// matches `(g()).m()`. ``` I think that parens are ignored
Configuration menu - View commit details
-
Copy full SHA for 388d7f1 - Browse repository at this point
Copy the full SHA 388d7f1View commit details -
[ARM] Prefer MUL to MULS on some implementations (#112540)
MULS adversely affects performance on many implementations. Where this is the case, we prefer not to shrink MUL to MULS.
Configuration menu - View commit details
-
Copy full SHA for ea796e5 - Browse repository at this point
Copy the full SHA ea796e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for d9cd607 - Browse repository at this point
Copy the full SHA d9cd607View commit details -
[MLIR] Enable pattern only for scf.forall op (#110230)
The init args shape might change in the loop body and hence the pattern doesn't hold true.
Configuration menu - View commit details
-
Copy full SHA for c1047ba - Browse repository at this point
Copy the full SHA c1047baView commit details -
Configuration menu - View commit details
-
Copy full SHA for 51b4ada - Browse repository at this point
Copy the full SHA 51b4adaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f17da1 - Browse repository at this point
Copy the full SHA 3f17da1View commit details -
[PGO] Initialize GCOV Writeout and Reset Functions in the Runtime on …
…AIX (#108570) This PR registers the writeout and reset functions for `gcov` for all modules in the PGO runtime, instead of registering them using global constructors in each module. The change is made for AIX only, but the same mechanism works on Linux on Power. When registering such functions using global constructors in each module without `-ffunction-sections`, the AIX linker cannot garbage collect unused undefined symbols, because such symbols are grouped in the same section as the `__sinit` symbol. Keeping such undefined symbols causes link errors (see test case https://github.com/llvm/llvm-project/pull/108570/files#diff-500a7e1ba871e1b6b61b523700d5e30987900002add306e1b5e4972cf6d5a4f1R1 for this scenario). This PR implements the initialization in the runtime, hence avoiding introducing `__sinit` into each module. The implementation adds a new global variable `__llvm_covinit_functions` to each module. This new global variable contains the function pointers to the `Writeout` and `Reset` functions. `__llvm_covinit_functions`'s section is the named section `__llvm_covinit`. The linker will aggregate all the `__llvm_covinit` sections from each module to form one single named section in the final binary. The pair of functions ``` const __llvm_gcov_init_func_struct *__llvm_profile_begin_covinit(); const __llvm_gcov_init_func_struct *__llvm_profile_end_covinit(); ``` are implemented to return the start and end address of this named section in the final binary, and they are used in function ``` __llvm_profile_gcov_initialize() ``` (which is a constructor function in the runtime) so the runtime knows the addresses of all the `Writeout` and `Reset` functions from all the modules. One noticeable implementation detail relevant to AIX is that to preserve the `__llvm_covinit` from the linker's garbage collection, a `.ref` pseudo instruction is inserted into them, referring to the section that contains the `__llvm_gcov_ctr` variables, which are used in the instrumented code. The `__llvm_gcov_ctr` variables did not belong to named sections before, but this PR added them to the `__llvm_gcov_ctr_section` named section, so we can add a `.ref` pseudo instruction that refers to them in the `__llvm_covinit` section.
Configuration menu - View commit details
-
Copy full SHA for f9d0789 - Browse repository at this point
Copy the full SHA f9d0789View commit details -
[InferAlignment][NFC] Unify Load/Store handling in tryToImproveAlign…
… (#112699) Removes code duplication in tryToImproveAlign by unifying load and store instruction handling with getLoadStore helper functions.
Configuration menu - View commit details
-
Copy full SHA for 8c60efe - Browse repository at this point
Copy the full SHA 8c60efeView commit details -
Finally formalise our defacto line-ending policy
Historically, we've not automatically enforced how git tracks line endings, but there are many, many commits that "undo" unintended CRLFs getting into history. `git log --pretty=oneline --grep=CRLF` shows nearly 100 commits involving reverts of CRLF making its way into the index and then history. As far as I can tell, there are none the other way round except for specific cases like `.bat` files or tests for parsers that need to accept such sequences. Of note, one of the earliest of those listed in that output is: ``` commit 9795860 Author: NAKAMURA Takumi <geek4civic@gmail.com> Date: Thu Feb 3 11:41:27 2011 +0000 cmake/*: Add svn:eol-style=native and fix CRLF. llvm-svn: 124793 ``` ...which introduced such a defacto policy for subversion. With old versions of git, it's been a bit of a crap-shoot whether enforcing storing line endings in the history will upset checkouts on machines where such line endings are the norm. Indeed many users have enforced that git checks out the working copy according to a global or per-user config via core crlf, or core autocrlf. For ~8 years now[1], however, git has supported the ability to "do as the Romans do" on checkout, but internally store subsets of text files with line-endings specified via a system of patterns in the `.gitattributes` file. Since we now have this ability, and we've been specifying attributes for various binary files, I think it makes sense to rid us of all that work converting things "back", and just let git handle the local checkout. Thus the new toplevel policy here is * text=auto In simple terms this means "unless otherwise specified, convert all files considered "text" files to LF in the project history, but check them out as expected on the local machine. What is "expected on the local machine" is dependent on configuration and default. For those files in the repository that *do* need CRLF endings, I've adopted a policy of `eol=crlf` which means that git will store them in history with LF, but regardless of user config, they'll be checked out in tree with CRLF. Finally, existing files have been "corrected" in history via `git add --renormalize .` End users should *not* need to adjust their local git config or workflow. [1]: git 2.10 was released with fixed support for fine-grained line-ending tracking that respects user-config *and* repo policy. This can be considered the point at which git will respect both the user's local working tree preference *and* the history as specified by the maintainers. See https://github.com/git/git/blob/master/Documentation/RelNotes/2.10.0.txt#L248 for the release note.
Configuration menu - View commit details
-
Copy full SHA for dccebdd - Browse repository at this point
Copy the full SHA dccebddView commit details -
Merge from 'main' to 'sycl-web' (153 commits)
CONFLICT (content): Merge conflict in clang/include/clang/Serialization/ASTBitCodes.h CONFLICT (content): Merge conflict in clang/test/Modules/no-external-type-id.cppm
Configuration menu - View commit details
-
Copy full SHA for e54df7d - Browse repository at this point
Copy the full SHA e54df7dView commit details -
Renormalize line endings whitespace only after dccebdd
Line ending policies were changed in the parent, dccebdd. To make it easier to resolve downstream merge conflicts after line-ending policies are adjusted this is a separate whitespace-only commit. If you have merge conflicts as a result, you can simply `git add --renormalize -u && git merge --continue` or `git add --renormalize -u && git rebase --continue` - depending on your workflow.
Configuration menu - View commit details
-
Copy full SHA for 9d98acb - Browse repository at this point
Copy the full SHA 9d98acbView commit details -
[OpenMP] Fix missing gtid argument in __kmp_print_tdg_dot function (#…
…111986) This patch modifies the signature of the `__kmp_print_tdg_dot` function in `kmp_tasking.cpp` to include the global thread ID (gtid) as an argument. The gtid is now correctly passed to the function. - Updated the function declaration to accept the gtid parameter. - Modified all calls to `__kmp_print_tdg_dot` to pass the correct gtid value. This change addresses issues encountered when compiling with `OMPX_TASKGRAPH` enabled. No functional changes are expected beyond successful compilation.
Configuration menu - View commit details
-
Copy full SHA for af1e9c8 - Browse repository at this point
Copy the full SHA af1e9c8View commit details -
[libc][setjmp] make x86_64 longjmp naked (#112581)
The generated asm for x86_64's longjmp has a function prolog and epilog. The epilog in particular is unreachable. Convert longjmp to a naked function to avoid these spurious instructions in longjmp. Link: https://github.com/llvm/llvm-project/pull/112437/files#r1802085511
Configuration menu - View commit details
-
Copy full SHA for 3764d0f - Browse repository at this point
Copy the full SHA 3764d0fView commit details -
[flang] Allow OpenMP declarations before type declarations (#112414)
Skip resolving implicit types for OpenMP declarative directives, to allow them to appear before type declarations, which is supported by several compilers. This was discussed in https://discourse.llvm.org/t/rfc-openmp-should-type-declaration-be-allowed-after-threadprivate/81345. This fixes the semantic errors of llvm/llvm-project#106021.
Configuration menu - View commit details
-
Copy full SHA for a1ac5a5 - Browse repository at this point
Copy the full SHA a1ac5a5View commit details -
[libc++] Make __libcpp_verbose_abort() noexcept like std::terminate()…
… (#109151) Make __libcpp_verbose_abort() noexcept (it is already noreturn), to match std::terminate(). Clang's function effect analysis can use this to ignore such functions as being beyond its scope. (See llvm/llvm-project#99656).
Configuration menu - View commit details
-
Copy full SHA for 9548366 - Browse repository at this point
Copy the full SHA 9548366View commit details -
Configuration menu - View commit details
-
Copy full SHA for db32924 - Browse repository at this point
Copy the full SHA db32924View commit details -
Configuration menu - View commit details
-
Copy full SHA for 91b2ac6 - Browse repository at this point
Copy the full SHA 91b2ac6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9173fd7 - Browse repository at this point
Copy the full SHA 9173fd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for b47849b - Browse repository at this point
Copy the full SHA b47849bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b6764f - Browse repository at this point
Copy the full SHA 8b6764fView commit details -
[llvm][LSR] Fix where invariant on ScaledReg & Scale is violated (#11…
…2576) Comments attached to the `ScaledReg` field of `struct Formula` explains that, `ScaledReg` must be non-null when `Scale` is non-zero. This fixes up a code path where this invariant is violated. Also, add an assert to ensure this invariant holds true. Without this patch, compiler aborts with the attached test case. Fixes #76504
Configuration menu - View commit details
-
Copy full SHA for caa32e6 - Browse repository at this point
Copy the full SHA caa32e6View commit details -
[clang] Fix covariant cv-qualification check to require the override …
…function return type to have the same or less cv-qualification (#112713) This prevents changing cv-qualification from const to volatile or vice versa, for example. https://eel.is/c++draft/class.virtual#8.3 Previously, we checked that the new type is the same or more qualified to return an error, but the standard requires the new type to be the same or less qualified and since the cv-qualification is only partially ordered, we cannot rely on a check on whether it is more qualified to return an error. Now, we reversed the condition to check whether the old is at least as qualified, and return an error if it is not. Also, adjusted the error name and message to clarify the requirement and added a missing closing parenthesis. Added tests to cover different use cases for classes with different qualifications and also refactored them to make them easier to follow: 1. Use override to make sure the function names actually match. 2. Named the function in a more descriptive way to clarify what each use case is checking. Fixes: #111742
Configuration menu - View commit details
-
Copy full SHA for 8f25c0b - Browse repository at this point
Copy the full SHA 8f25c0bView commit details -
[NFC][AMDGPU] Auto-generate check lines for some test cases (#112426)
- `llvm/test/CodeGen/AMDGPU/andorbitset.ll` - `llvm/test/CodeGen/AMDGPU/andorxorinvimm.ll` - `llvm/test/CodeGen/AMDGPU/fabs.f64.ll` - `llvm/test/CodeGen/AMDGPU/llvm.amdgcn.raw.ptr.buffer.store.ll` - `llvm/test/CodeGen/AMDGPU/s_mulk_i32.ll`
Configuration menu - View commit details
-
Copy full SHA for 92663de - Browse repository at this point
Copy the full SHA 92663deView commit details -
[lldb] Disable warning about codecvt_utf8 deprecation (NFC) (#112446)
Disable -Wdeprecated-declarations for codecvt_utf8 in Editline. This is in preparation for #112276 which narrows the scope of -Wno-deprecated-declarations for building LLDB.
Configuration menu - View commit details
-
Copy full SHA for 8c7f80f - Browse repository at this point
Copy the full SHA 8c7f80fView commit details -
[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112706)
Convert many instances of: Fn = Intrinsic::getOrInsertDeclaration(...); CreateCall(Fn, ...) to the equivalent CreateIntrinsic call.
Configuration menu - View commit details
-
Copy full SHA for 85c17e4 - Browse repository at this point
Copy the full SHA 85c17e4View commit details -
[lldb] Narrow scope of -Wno-deprecated-declarations (NFC) (#112276)
Currently all of LLDB is being compiled with -Wno-deprecated-declarations. That's not desirable, especially as part of the LLVM monorepo, as we miss deprecation warnings from LLVM and clang. According to the git history, this was first introduced to suppress warnings related to auto_ptr. Since then, other things have been deprecated and gone unnoticed. This patch limits the flag to Host.mm which uses a handful of LSApplication headers that have no replacement. rdar://112040718
Configuration menu - View commit details
-
Copy full SHA for 4897fc4 - Browse repository at this point
Copy the full SHA 4897fc4View commit details -
[lldb] Don't exit the main loop when in runs out of things to listen …
…on (#112565) This behavior made sense in the beginning as the class was completely single threaded, so if the source count ever reached zero, there was no way to add new ones. In https://reviews.llvm.org/D131160, the class gained the ability to add events (callbacks) from other threads, which means that is no longer the case (and indeed, one possible use case for this class -- acting as a sort of arbiter for multiple threads wanting to run code while making sure it runs serially -- has this class sit in an empty Run call most of the time). I'm not aware of us having a use for such a thing right now, but one of my tests in another patch turned into something similar by accident. Another problem with the current approach is that, in a distributed/dynamic setup (multiple things using the main loop without a clear coordinator), one can never be sure whether unregistering a specific event will terminate the loop (it depends on whether there are other listeners). We had this problem in lldb-platform.cpp, where we had to add an additional layer of synchronization to avoid premature termination. We can remove this if we can rely on the loop terminating only when we tell it to.
Configuration menu - View commit details
-
Copy full SHA for 98b419c - Browse repository at this point
Copy the full SHA 98b419cView commit details -
[flang][cuda] Relax the verifier for cuf.register_kernel op (#112585)
Relax the verifier since the `gpu.func` might be converted to `llvm.func` before `cuf.register_kernel` is converted.
Configuration menu - View commit details
-
Copy full SHA for 834d001 - Browse repository at this point
Copy the full SHA 834d001View commit details -
[flang][docs] Update description of how to contribute (#112369)
It's my understanding that all code review pre-commit takes place on GitHub Pull Requests and that post-commit review is done either on the closed PR or the commit on GitHub.
Configuration menu - View commit details
-
Copy full SHA for ab208de - Browse repository at this point
Copy the full SHA ab208deView commit details -
[SimplifyLibCall][Attribute] Fix bug where we may keep
range
attr w……ith incompatible type (#112649) In a variety of places we change the bitwidth of a parameter but don't update the attributes. The issue in this case is from the `range` attribute when inlining `__memset_chk`. `optimizeMemSetChk` will replace an `i32` with an `i8`, and if the `i32` had a `range` attr assosiated it will cause an error. Fixes #112633
Configuration menu - View commit details
-
Copy full SHA for c85611e - Browse repository at this point
Copy the full SHA c85611eView commit details -
[NFC][LoopVectorize] Restructure simple early exit tests (#112721)
The previous simple_early_exit.ll was growing too large and difficult to manage. Instead I've decided to refactor the tests by splitting out into notional groups: 1. single_early_exit.ll: loops with a single uncountable exit that do not have live-outs from the loop. 2. single_early_exit_live_outs.ll: loops with a single uncountable exit with live-outs. 3. multi_early_exit.ll: loops with multiple early exits, i.e. a mixture of countable and uncountable exits, but with no live-outs from the loop. 4. multi_early_exit_live_outs.ll: as above, but with live-outs. 5. single_early_exit_unsafe_ptrs.ll: loops with a single uncountable exit, but with pointers that are not unconditionally dereferenceable. 6. unsupported_early_exit.ll: loops with uncountable exits that we cannot yet vectorise. 7. early_exit_legality.ll: tests the debug output from LoopVectorizationLegality to make sure we handle different scenarios correctly. Only the last test now requires asserts. Over time some of these tests should start vectorising as more support is added. I also tried to rename the multi early exit tests to make it clear there what mixture of countable and uncountable exits are present.
Configuration menu - View commit details
-
Copy full SHA for 76f3776 - Browse repository at this point
Copy the full SHA 76f3776View commit details -
[DAG] visitFCEIL/FTRUNC/FFLOOR/FNEG - use FoldConstantArithmetic to a…
…ttempt to constant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682
Configuration menu - View commit details
-
Copy full SHA for 256bbdb - Browse repository at this point
Copy the full SHA 256bbdbView commit details -
[clang-tidy] insert
static
keyword in correct position for misc-u……se-internal-linkage (#108792) Fixes: #108760 --------- Co-authored-by: Danny Mösch <danny.moesch@icloud.com>
Configuration menu - View commit details
-
Copy full SHA for 6f21a7b - Browse repository at this point
Copy the full SHA 6f21a7bView commit details -
Merge from 'sycl' to 'sycl-web' (6 commits)
iclsrc committedOct 17, 2024 Configuration menu - View commit details
-
Copy full SHA for ee0004a - Browse repository at this point
Copy the full SHA ee0004aView commit details -
[AArch64] Add armv9.6 features to AArch64AsmParser (#112722)
New features need to be added to ExtensionMap for .arch and .arch_extension to work.
Configuration menu - View commit details
-
Copy full SHA for 94643a4 - Browse repository at this point
Copy the full SHA 94643a4View commit details -
[LTO] Add function alias as function instead of data (#112599)
On AIX, for undefined functions, only the dotnamed symbols (the address of the function) are generated after linking (i.e., no named function symbol is generated). Currently, all alias symbols are added as defined data symbols when parsing symbols in LTOModule (the Link Time Optimization library used by linker to optimization code at link time). On AIX, if the function alias is used in the native object, and only its dotnamed symbol is generated, the linker will have problem to match the dotnamed symbol from the native object and the defined symbol marked as data from the bitcode at LTO linktime. This patch is to add function alias as function instead of data.
Configuration menu - View commit details
-
Copy full SHA for 0205667 - Browse repository at this point
Copy the full SHA 0205667View commit details -
Document the requirement that commits have a public email address (#1…
…09318) See https://discourse.llvm.org/t/hidden-emails-on-github-should-we-do-something-about-it/74223 for details about why this is important to the community. Note, we currently have soft enforcement for this requirement in the form of a bot which posts comments letting patch authors know their email is private, so we're already setting expectations in practice; this PR is documenting those expectations for clarity.
Configuration menu - View commit details
-
Copy full SHA for bf1a554 - Browse repository at this point
Copy the full SHA bf1a554View commit details -
Merge from 'main' to 'sycl-web' (47 commits)
CONFLICT (content): Merge conflict in .gitattributes
Configuration menu - View commit details
-
Copy full SHA for a02f06b - Browse repository at this point
Copy the full SHA a02f06bView commit details -
[RISCV][GISel] Correct RORIW patterns.
We had two rotl patterns and no rotr pattern. The order was such that the incorrect rotl pattern was being used.
Configuration menu - View commit details
-
Copy full SHA for feedb35 - Browse repository at this point
Copy the full SHA feedb35View commit details -
[libc][math][c23] Fix undefined behavior in expxf16.h (#112734)
Fixes the left-shifting of potentially negative signed integers.
Configuration menu - View commit details
-
Copy full SHA for 87645e9 - Browse repository at this point
Copy the full SHA 87645e9View commit details -
[clang-tidy][docs] Replace _not_ in reserved-identifier.rst with *not…
…* (#112162) Fixes a documentation formatting error where `_not_` was used which has no special meaning in reST and replaces it with `*not*`. Closes #111691.
Configuration menu - View commit details
-
Copy full SHA for 6d7712a - Browse repository at this point
Copy the full SHA 6d7712aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f8d5f0 - Browse repository at this point
Copy the full SHA 8f8d5f0View commit details -
[AArch64][InstCombine] Eliminate redundant barrier intrinsics (#112023)
If there are no memory ops on the path from one dmb to another then one barrier can be eliminated.
Configuration menu - View commit details
-
Copy full SHA for 1a60905 - Browse repository at this point
Copy the full SHA 1a60905View commit details -
[libunwind][AIX] Remove weak declaration "__xlcxx_personality_v0" (#1…
…12436) `__xlcxx_personality_v0` is the personality routine in `libc++abi` for the EH of applications generated by the legacy IBM C++ compiler. Since the EH info generated by the legacy compiler does not provide the location of the personality routine, this routine is hard-coded as the handler for legacy EH in the unwinder. The symbol is resolved dynamically using `dlopen()` to avoid a hard dependency of `libunwind` on `libc++abi` for cases such as non-C++ applications. The weak declaration of `__xlcxx_personality_v0` was originally intended to bypass `dlopen()` if the C++ application generated by the legacy compiler is statically linked with the new LLVM C++ compiler. Unfortunately, this causes problems with runtime linking for Clang-compiled code using the unwinder that does not link with `libc++abi`. On the other hand, the C++ runtime libraries shipped for AIX are actually stripped and statically linking is not supported. So, we can fix the problem by removing the `__xlcxx_personality_v0` weak declaration. Besides, `dlopen()` would work as long as the libc++abi shared library is available.
Configuration menu - View commit details
-
Copy full SHA for 2ef24e0 - Browse repository at this point
Copy the full SHA 2ef24e0View commit details -
[clang][bytecode] Handle non-arrays in initElem{,Pop} (#112719)
... provided the given index is 0. Skip the atIndex() in that case.
Configuration menu - View commit details
-
Copy full SHA for 03888a9 - Browse repository at this point
Copy the full SHA 03888a9View commit details -
[HLSL] Simplify debug check in ResourceBindings::addDeclBindingInfo (…
…#112661) Follow-up for llvm/llvm-project#111203 (review).
Configuration menu - View commit details
-
Copy full SHA for f35a14d - Browse repository at this point
Copy the full SHA f35a14dView commit details -
Add atan2 test case for prior change in X86SelLowering.cpp (#112616)
When updating X86SelLowering.cpp for atan2, based on #96222, it was known that a needed change was missing which was merged later in #101268. However, the corresponding test update to `fp-strict-libcalls-msvc32.ll` was missed. This change rectifies that oversight. This also adds a missing label to the tanh test, since it's produced by update_llc_test_checks.py Part of: Implement the atan2 HLSL Function #70096.
Configuration menu - View commit details
-
Copy full SHA for dea213c - Browse repository at this point
Copy the full SHA dea213cView commit details -
[WebAssembly] Add atan2 to RuntimeLibcallSignatureTable (#112613)
This change is part of this proposal: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 - `WebAssemblyRuntimeLibcallSignatures.cpp`: Add `RTLIB::ATAN2*` to RuntimeLibcallSignatureTable - Add atan2 calls to `CodeGen/WebAssembly/libcalls-trig.ll` and update test checks Part of: Implement the atan2 HLSL Function #70096.
Configuration menu - View commit details
-
Copy full SHA for 2bebeea - Browse repository at this point
Copy the full SHA 2bebeeaView commit details -
[flang] Update printing values in dump-parse-tree (#112709)
Remove 'if std::string' that is covered by another branch of the if-statement. Add printing of 'bool' and 'int' values, since they have corresponding `GetNodeName` definitions.
Configuration menu - View commit details
-
Copy full SHA for 2c93598 - Browse repository at this point
Copy the full SHA 2c93598View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed7868d - Browse repository at this point
Copy the full SHA ed7868dView commit details -
Configuration menu - View commit details
-
Copy full SHA for ed3d051 - Browse repository at this point
Copy the full SHA ed3d051View commit details -
[DirectX] Remove trivially dead functions at linkage finalize (#106146)
Functions are not removed even when made internal by DXILFinalizeLinkage. The removal code is called from alwaysinliner and globalopt, which are invoked too early to remove functions made internal by this pass. This adds a check similar to that in alwaysinliner that removes trivially dead functions after being marked internal. It refactors that code a bit to make it simpler including reversing what is stored int he work queue. Tests both the pass in isolation and the full i0nlining, linkage finalization and function removal steps. Fixes #106139
Configuration menu - View commit details
-
Copy full SHA for ab75180 - Browse repository at this point
Copy the full SHA ab75180View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9d39ce - Browse repository at this point
Copy the full SHA a9d39ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 29097dd - Browse repository at this point
Copy the full SHA 29097ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4e026b7 - Browse repository at this point
Copy the full SHA 4e026b7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32b55f3 - Browse repository at this point
Copy the full SHA 32b55f3View commit details -
Merge from 'sycl' to 'sycl-web' (3 commits)
iclsrc committedOct 17, 2024 Configuration menu - View commit details
-
Copy full SHA for b583b80 - Browse repository at this point
Copy the full SHA b583b80View commit details -
[sanitizer] Large range support in IsAccessibleMemoryRange (#112665)
The comment stated that it's slow, but likely it's a deadlock, as write can be blocked. Also we can't be sure that `page_size * 10` is appropriate size. Still most likely this is NFC, as the max `size` we use is 32, and should fit in any buffer.
Configuration menu - View commit details
-
Copy full SHA for 7086584 - Browse repository at this point
Copy the full SHA 7086584View commit details -
[libc++] Remove obsolete Appveyor files (#112744)
We don't use Appveyor to run Windows CI anymore (in fact I don't know if we ever did). Our full Windows CI setup is based on Github actions, so these files are effectively dead code.
Configuration menu - View commit details
-
Copy full SHA for 2cd10f5 - Browse repository at this point
Copy the full SHA 2cd10f5View commit details -
[runtimes] Avoid cluttering the top-level build directory with test a…
…rtifacts (#112717) Instead of placing artifacts for testing the runtimes at <build>/test, place those artifacts at <build>/<project>/test. This prevents cluttering the build directory with the runtimes' test artifacts for everyone else. As a drive-by, remove LIBCXX_BINARY_INCLUDE_DIR which wasn't used anymore.
Configuration menu - View commit details
-
Copy full SHA for 8c77f4c - Browse repository at this point
Copy the full SHA 8c77f4cView commit details -
[libc++] Mark libc++ deallocation helpers as noexcept (#110884)
They already can't throw exceptions and they are called from noexcept functions, but they were not marked as noexcept. Depending on compiler inlining, this might not make a difference or this might improve the codegen a bit by removing the implicit try-catch block that Clang generates around non-noexcept functions called from noexcept functions. The original issue also mentioned that one occurrence of std::allocator::deallocate was missing noexcept, however it has since then been removed. Fixes #66100
Configuration menu - View commit details
-
Copy full SHA for e2d07fc - Browse repository at this point
Copy the full SHA e2d07fcView commit details -
[runtimes] Improve the documentation for LIBCXX_ADDITIONAL_COMPILE_FL…
…AGS (#112733) This clarifies how that option is meant to be used to avoid confusion. As a drive-by, also fix an incorrect usage in the recently-added GPU caches.
Configuration menu - View commit details
-
Copy full SHA for e674424 - Browse repository at this point
Copy the full SHA e674424View commit details -
[-Wunsafe-buffer-usage] Emit a warning if pointer returned by vector:…
…:data and array::data is cast to larger type (#111910) Emit a warning when the raw pointer retrieved from std::vector and std::array instances are cast to a larger type. Such a cast followed by a field dereference to the resulting pointer could cause an OOB access. This is similar to the existing span::data warning. (rdar://136704278) Co-authored-by: MalavikaSamak <malavika2@apple.com>
Configuration menu - View commit details
-
Copy full SHA for e913a33 - Browse repository at this point
Copy the full SHA e913a33View commit details -
[SCEVExpander] Expand UDiv avoiding UB when in seq_min/max. (#92177)
Update SCEVExpander to introduce an SafeUDivMode, which is set when expanding operands of SCEVSequentialMinMaxExpr. In this mode, the expander will make sure that the divisor of the expanded UDiv is neither 0 nor poison. Fixes llvm/llvm-project#89958. PR llvm/llvm-project#92177
Configuration menu - View commit details
-
Copy full SHA for b060661 - Browse repository at this point
Copy the full SHA b060661View commit details -
Configuration menu - View commit details
-
Copy full SHA for 871b2b3 - Browse repository at this point
Copy the full SHA 871b2b3View commit details -
[Clang] Disable use of the counted_by attribute for whole struct poin…
…ters (#112636) The whole struct is specificed in the __bdos. The calculation of the whole size of the structure can be done in two ways: 1) sizeof(struct S) + count * sizeof(typeof(fam)) 2) offsetof(struct S, fam) + count * sizeof(typeof(fam)) The first will add any remaining whitespace that might exist after allocation while the second method is more precise, but not quite expected from programmers. See [1] for a discussion of the topic. GCC isn't (currently) able to calculate __bdos on a pointer to the whole structure. Therefore, because of the above issue, we'll choose to match what GCC does for consistency's sake. [1] https://lore.kernel.org/lkml/ZvV6X5FPBBW7CO1f@archlinux/ Co-authored-by: Eli Friedman <efriedma@quicinc.com>
Configuration menu - View commit details
-
Copy full SHA for 8c62bf5 - Browse repository at this point
Copy the full SHA 8c62bf5View commit details -
[LLDB][Minidump] Add breakpoint stop reasons to the minidump. (#108448)
Recently my coworker @jeffreytan81 pointed out that Minidumps don't show breakpoints when collected. This was prior blocked because Minidumps could only contain 1 exception, now that we support N signals/sections we can save all the threads stopped on breakpoints.
Configuration menu - View commit details
-
Copy full SHA for 5033ea7 - Browse repository at this point
Copy the full SHA 5033ea7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b82935 - Browse repository at this point
Copy the full SHA 4b82935View commit details -
[alpha.webkit.UncountedLocalVarsChecker] Recursive functions are erro…
…neously treated as non-trivial (#110973) This PR fixes the bug that alpha.webkit.UncountedLocalVarsChecker erroneously treats a trivial recursive function as non-trivial. This was caused by TrivialFunctionAnalysis::isTrivialImpl which takes a statement as an argument populating the cache with "false" while traversing the statement to determine its triviality within a recursive function in TrivialFunctionAnalysisVisitor's WithCachedResult. Because IsFunctionTrivial honors an entry in the cache, this resulted in the whole function to be treated as non-trivial. Thankfully, TrivialFunctionAnalysisVisitor::IsFunctionTrivial already handles recursive functions correctly so this PR applies the same logic to TrivialFunctionAnalysisVisitor::WithCachedResult by sharing code between the two functions. This avoids the cache to be pre-populated with "false" while traversing statements in a recurisve function.
Configuration menu - View commit details
-
Copy full SHA for 71b81e9 - Browse repository at this point
Copy the full SHA 71b81e9View commit details -
[sanitizer] Add TryMemCpy (#112668)
For posix implementation is similar to `IsAccessibleMemoryRange`, using `pipe`. We need this because we can't rely on non-atomic `IsAccessibleMemoryRange` + `memcpy`, as the protection or mapping may change and we may crash.
Configuration menu - View commit details
-
Copy full SHA for 46df20a - Browse repository at this point
Copy the full SHA 46df20aView commit details
Commits on Oct 18, 2024
-
[sanitizer] Add MemCpyAccessible (#112794)
A layer over `TryMemCpy` to copy only available pages.
Configuration menu - View commit details
-
Copy full SHA for 7106de9 - Browse repository at this point
Copy the full SHA 7106de9View commit details -
[HLSL] Add handle initialization for simple resource declarations (#1…
…11207) Adds `@_init_resource_bindings()` function to module initialization that includes `handle.fromBinding` intrinsic calls for simple resource declarations. Arrays of resources or resources inside user defined types are not supported yet. While this unblocks our progress on [Compile a runnable shader from clang](llvm/wg-hlsl#7) milestone, this is probably not the way we would like to handle resource binding initialization going forward. Ideally, it should be done via the resource class constructors in order to support dynamic resource binding or unbounded arrays if resources. Depends on PRs #110327 and #111203. Part 1 of #105076
Configuration menu - View commit details
-
Copy full SHA for 7dbfa7b - Browse repository at this point
Copy the full SHA 7dbfa7bView commit details -
[LoongArch] [CodeGen] Add options for Clang to generate LoongArch-spe…
…cific frecipe & frsqrte instructions (#109917) Two options: `-mfrecipe` & `-mno-frecipe`. Enable or Disable frecipe.{s/d} and frsqrte.{s/d} instructions. The default is `-mno-frecipe`.
Configuration menu - View commit details
-
Copy full SHA for e9eec14 - Browse repository at this point
Copy the full SHA e9eec14View commit details -
Reapply "[Inliner] Propagate more attributes to params when inlining …
…(#91101)" (2nd Attempt) (#112749) Root cause of the bug was code hanging onto `range` attr after changing BitWidth. This was fixed in PR #112633.
Configuration menu - View commit details
-
Copy full SHA for 69a798a - Browse repository at this point
Copy the full SHA 69a798aView commit details -
[BOLT] Fix logs with no hex convension (#112650)
Add `utohexstr` to ensure that offsets/addresses are correctly formatted as hexadecimal values.
Configuration menu - View commit details
-
Copy full SHA for c3bbc3a - Browse repository at this point
Copy the full SHA c3bbc3aView commit details -
[mlir][transforms] Add
signalPassFailure
in RemoveDeadValues (#112199)This PR adds `signalPassFailure` in RemoveDeadValues to ensure that a pipeline would stop here. Fixes #111757.
Configuration menu - View commit details
-
Copy full SHA for 70865c4 - Browse repository at this point
Copy the full SHA 70865c4View commit details -
[PowerPC][ISelLowering] Support -mstack-protector-guard=tls (#110928)
Add support for using a thread-local variable with a specified offset for holding the stack guard canary value. This supports both 32- and 64- bit PowerPC targets. This mirrors changes from #108942 but targeting PowerPC instead of RISCV. Because both of these PRs modify the same driver functions, this series is stack on top of the RISC-V one. --------- Signed-off-by: Keith Packard <keithp@keithp.com>
Configuration menu - View commit details
-
Copy full SHA for 44b020a - Browse repository at this point
Copy the full SHA 44b020aView commit details -
Configuration menu - View commit details
-
Copy full SHA for bbccc52 - Browse repository at this point
Copy the full SHA bbccc52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c6f85f - Browse repository at this point
Copy the full SHA 9c6f85fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 67f576f - Browse repository at this point
Copy the full SHA 67f576fView commit details -
Configuration menu - View commit details
-
Copy full SHA for d989c24 - Browse repository at this point
Copy the full SHA d989c24View commit details -
[Flang][OpenMP] Add semantic checks for Worshare construct (#111358)
Add missing semantic checks for the Workshare construct: OpenMP 5.2: 11.4 Workshare Construct - The construct must not contain any user-defined function calls unless either the function is pure and elemental or the function call is contained inside a parallel construct that is nested inside the workshare construct. (Flang-new used to check only the elemental function, but now it needs to be an impure elemental function) - At most one NoWait clause can appear in the Workshare construct. - Add tests for the same.
Configuration menu - View commit details
-
Copy full SHA for 2526455 - Browse repository at this point
Copy the full SHA 2526455View commit details -
[NFC][Flang][Test] Add some missing tests (#110468)
- At most one Collapse clause in SIMD construct - A DO loop must follow the SIMD directive
Configuration menu - View commit details
-
Copy full SHA for b340310 - Browse repository at this point
Copy the full SHA b340310View commit details -
[Flang][Semantics] Add a semantic check for simd construct (#109089)
Add missing semantic check for the SAFELEN clause in the SIMD Order construct
Configuration menu - View commit details
-
Copy full SHA for e6321d9 - Browse repository at this point
Copy the full SHA e6321d9View commit details -
[VPlan] Use VPInstruction::Name when assigning names (NFCI).
This slightly improves the printing of VPInstructions. NFC except debug output.
Configuration menu - View commit details
-
Copy full SHA for b497010 - Browse repository at this point
Copy the full SHA b497010View commit details -
[llvm] Consistently respect
naked
fn attribute in `TargetFrameLower……ing::hasFP()` (#106014) Some targets (e.g. PPC and Hexagon) already did this. I think it's best to do this consistently so that frontend authors don't run into inconsistent results when they emit `naked` functions. For example, in Zig, we had to change our emit code to also set `frame-pointer=none` to get reliable results across targets. Note: I don't have commit access.
Configuration menu - View commit details
-
Copy full SHA for ad4a582 - Browse repository at this point
Copy the full SHA ad4a582View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d08625 - Browse repository at this point
Copy the full SHA 5d08625View commit details -
Configuration menu - View commit details
-
Copy full SHA for f225b07 - Browse repository at this point
Copy the full SHA f225b07View commit details -
[PAC] Fix address discrimination for type info vtable pointers (#102199)
In #99726, `-fptrauth-type-info-vtable-pointer-discrimination` was introduced, which is intended to enable type and address discrimination for type_info vtable pointers. However, some codegen logic for actually enabling address discrimination was missing. This patch addresses the issue. Fixes #101716
Configuration menu - View commit details
-
Copy full SHA for 6bb6300 - Browse repository at this point
Copy the full SHA 6bb6300View commit details -
[RISCV] Implement trampolines for rv64 (#96309)
This is implementation is based on what the X86 target does but emitting the instructions that GCC emits for rv64. --------- Co-authored-by: Pengcheng Wang <wangpengcheng.pp@bytedance.com>
Configuration menu - View commit details
-
Copy full SHA for 9d469b5 - Browse repository at this point
Copy the full SHA 9d469b5View commit details -
[TLI] Add support for the
ilogb
libcall. (#112725)This patch adds the `ilogb` libcall. Constant folding will be handled in subsequent patches.
Configuration menu - View commit details
-
Copy full SHA for 761fa58 - Browse repository at this point
Copy the full SHA 761fa58View commit details -
[libc][complex] Added support for CFP16 and CFP128 (#112594)
Fixes: #112217
Configuration menu - View commit details
-
Copy full SHA for 7be4ab0 - Browse repository at this point
Copy the full SHA 7be4ab0View commit details -
[MLIR][BUILD] Fix Unicode build issue on Windows. (#112300)
This issue is from llvm/llvm-project#77668. I encountered a build issue because it used Unicode. When I built MLIR on Windows with Visual Studio 2022, I faced a build failure. --------- Co-authored-by: Harrison Hao <harrison.hao@amd.com>
Configuration menu - View commit details
-
Copy full SHA for eb446eb - Browse repository at this point
Copy the full SHA eb446ebView commit details -
[RISCV] Check if v extension is enabled by the function features for …
…the builtins not in Zve64*. (#112827) Fixes: llvm/llvm-project#109694
Configuration menu - View commit details
-
Copy full SHA for e8509a4 - Browse repository at this point
Copy the full SHA e8509a4View commit details -
[clang-tidy] rewrite matchers in modernize-use-starts-ends-with (#112…
…101) Rewrite the AST matchers for slightly more composability. Furthermore, check that the `starts_with` and `ends_with` functions return a `bool`. There is one behavioral change, in that the methods of a class (and transitive classes) are searched once for a matching `starts_with`/`ends_with` function, picking the first it can find. Previously, the matchers would try to find `starts_with`, then `startsWith`, and finally, `startswith`. Now, the first of the three that is encountered will be the matched method. --------- Co-authored-by: Nicolas van Kempen <nvankemp@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 18b5018 - Browse repository at this point
Copy the full SHA 18b5018View commit details -
[OpenCL] Replace a CreatePointerCast call; NFC (#112676)
With opaque pointers, the only purpose of the cast here is to cast between address spaces, similar to the 4-argument case below.
Configuration menu - View commit details
-
Copy full SHA for 5a09ce9 - Browse repository at this point
Copy the full SHA 5a09ce9View commit details -
[APInt] Enable APInt ctor assertion by default (#112670)
This enables the assertion introduced in llvm/llvm-project#106524, which checks that the value passed to the APInt constructor is indeed a valid N-bit signed or unsigned integer. Places that previously violated the assertion were updated in advance, e.g. in llvm/llvm-project#80309. It is possible to opt-out of the check and restore the previous behavior by setting implicitTrunc=true.
Configuration menu - View commit details
-
Copy full SHA for ba1ee2b - Browse repository at this point
Copy the full SHA ba1ee2bView commit details -
[flang] add getElementType() to fir::SquenceType and fir::VectorType …
…(#112770) getElementType() was missing from Sequence and Vector types. Did a replace of the obvious places getEleTy() was used for these two types and updated to use this name instead. Co-authored-by: Scott Manley <scmanley@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e6a4346 - Browse repository at this point
Copy the full SHA e6a4346View commit details -
[flang][Driver] Add support for -f[no-]wrapv and -f[no]-strict-overfl…
…ow in the frontend (#110061) This patch introduces the options for integer overflow flags into Flang. The behavior is similar to that of Clang.
Configuration menu - View commit details
-
Copy full SHA for 9698e57 - Browse repository at this point
Copy the full SHA 9698e57View commit details -
[clang][bytecode] Ignore explicit calls to trivial dtors (#112841)
This is what the current interpreter does as well.
Configuration menu - View commit details
-
Copy full SHA for 9d0616c - Browse repository at this point
Copy the full SHA 9d0616cView commit details -
[mlir][vector] Clarify the semantics of masking maps (nfc) (#111383)
We use the term "masking map" throughout the Linalg vectorization logic, but we don't really define what it is and how it differs from Linalg indexing maps. This PR clarifies the differnces, makes sure that the new terminology is used consistenty and improves code re-use.
Configuration menu - View commit details
-
Copy full SHA for f7f51f2 - Browse repository at this point
Copy the full SHA f7f51f2View commit details -
Revert "[ARM] Fix frame chains with M-profile PACBTI (#110285)"
Reverting because this is causing failures with MSan: https://lab.llvm.org/buildbot/#/builders/169/builds/4378 This reverts commit e1f8f84.
Configuration menu - View commit details
-
Copy full SHA for 18ac017 - Browse repository at this point
Copy the full SHA 18ac017View commit details -
[CLANG][AArch64]Add SVE tuple types for mfloat8_t (#112687)
This patch adds scalable tuple types vectors for MFloat_8 type, according to the ACLE[1]. [1] https://github.com/ARM-software/acle.git
Configuration menu - View commit details
-
Copy full SHA for 508fd96 - Browse repository at this point
Copy the full SHA 508fd96View commit details -
[clang] Deduplicate the logic that only warns once when stack is almo…
…st full (#112552) Zero diff in behavior.
Configuration menu - View commit details
-
Copy full SHA for 09cc75e - Browse repository at this point
Copy the full SHA 09cc75eView commit details -
[clang] Fix C23 constexpr crashes (#112708)
Before using a constexpr variable that is not properly initialized check that it is valid. Fixes llvm/llvm-project#109095 Fixes llvm/llvm-project#112516
Configuration menu - View commit details
-
Copy full SHA for abfba7d - Browse repository at this point
Copy the full SHA abfba7dView commit details -
[FuncSpec] Update MinFunctionSize logic (#112711)
Always require functions to be larger than MinFunctionSize when SpecializeLiteralConstant is enabled, and increase MinFunctionSize to 500, to prevent excessive triggering of specialisations on small functions.
Configuration menu - View commit details
-
Copy full SHA for 0d1a91e - Browse repository at this point
Copy the full SHA 0d1a91eView commit details -
Configuration menu - View commit details
-
Copy full SHA for d1ee850 - Browse repository at this point
Copy the full SHA d1ee850View commit details -
[mlir][tensor] Fix off-by-one error in ReshapeOpsUtils (#112774)
This patch fixes an off-by-one error in `mlir::getReassociationIndicesForCollapse()` that occurs when the last two dims of the source tensor satisfy the while loop. This would cause an assertion failure due to out-of-bounds-access, which is now fixed.
Configuration menu - View commit details
-
Copy full SHA for 2f15d7e - Browse repository at this point
Copy the full SHA 2f15d7eView commit details -
Configuration menu - View commit details
-
Copy full SHA for bafc66e - Browse repository at this point
Copy the full SHA bafc66eView commit details -
Sink/test: increase coverage of invariant-load (#112690)
Tests adapted from: https://discourse.llvm.org/t/sinking-does-any-llvm-pass-currently-handle-load-sinking-for-invariant-loads/79643 We don't add tests for llvm.invariant.{start,end} though, as these are very difficult to support architecturally.
Configuration menu - View commit details
-
Copy full SHA for 7eaf92b - Browse repository at this point
Copy the full SHA 7eaf92bView commit details -
[AArch64][GlobalISel] Add some post-legalization cast combines. (#112…
…509) This helps clear up some of the legalization artefacts. Not all of the cast_combines are added (notably select combines) as they currently have questionable benefit in the test updates.
Configuration menu - View commit details
-
Copy full SHA for 2f792f6 - Browse repository at this point
Copy the full SHA 2f792f6View commit details -
[DAG] visitABS - use FoldConstantArithmetic to attempt to constant fold
Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us. Cleanup for #112682
Configuration menu - View commit details
-
Copy full SHA for c72992b - Browse repository at this point
Copy the full SHA c72992bView commit details -
[DAG] visitXROUND - use FoldConstantArithmetic to attempt to constant…
… fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.
Configuration menu - View commit details
-
Copy full SHA for 7a43be1 - Browse repository at this point
Copy the full SHA 7a43be1View commit details -
[DAG] visitFP_ROUND - use FoldConstantArithmetic to attempt to consta…
…nt fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.
Configuration menu - View commit details
-
Copy full SHA for 3a1df05 - Browse repository at this point
Copy the full SHA 3a1df05View commit details -
[DAG] visitFP_EXTEND - use FoldConstantArithmetic to attempt to const…
…ant fold Don't rely on isConstantFPBuildVectorOrConstantFP followed by getNode() will constant fold - FoldConstantArithmetic will do all of this for us.
Configuration menu - View commit details
-
Copy full SHA for 3ec1b1a - Browse repository at this point
Copy the full SHA 3ec1b1aView commit details -
[AArch64][SVE] Support lowering fixed-length BUILD_VECTORS to ZIPs (#…
…111698) This allows lowering fixed-length (non-constant) BUILD_VECTORS (<= 128-bit) to a chain of ZIP1 instructions when Neon is not available, rather than using the default lowering, which is to spill to the stack and reload. For example, ``` t5: v4f32 = BUILD_VECTOR(t0, t1, t2, t3) ``` Becomes: ``` zip1 z0.s, z0.s, z1.s // z0 = t0,t1,... zip1 z2.s, z2.s, z3.s // z2 = t2,t3,... zip1 z0.d, z0.d, z2.d // z0 = t0,t1,t2,t3,... ``` When values are already in FRPs, this generally seems to lead to a more compact output with less movement to/from the stack.
Configuration menu - View commit details
-
Copy full SHA for 5f7502b - Browse repository at this point
Copy the full SHA 5f7502bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7b65971 - Browse repository at this point
Copy the full SHA 7b65971View commit details -
[RISCV] Inline Assembly: RVC constraint and N modifier (#112561)
This change implements support for the `cr` and `cf` register constraints (which allocate a RVC GPR or RVC FPR respectively), and the `N` modifier (which prints the raw encoding of a register rather than the name). The intention behind these additions is to make it easier to use inline assembly when assembling raw instructions that are not supported by the compiler, for instance when experimenting with new instructions or when supporting proprietary extensions outside the toolchain. These implement part of my proposal in riscv-non-isa/riscv-c-api-doc#92 As part of the implementation, I felt there was not enough coverage of inline assembly and the "in X" floating-point extensions, so I have added more regression tests around these configurations.
Configuration menu - View commit details
-
Copy full SHA for 228f88f - Browse repository at this point
Copy the full SHA 228f88fView commit details -
[NFC] Fix implicit-fallthrough warnings (#112825)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnings: clang/lib/Sema/SemaFunctionEffects.cpp:1531:5: error: this statement may fall through [-Werror=implicit-fallthrough=] 1531 | switch (DiffKind) { | ^~~~~~
Configuration menu - View commit details
-
Copy full SHA for 19aa4c8 - Browse repository at this point
Copy the full SHA 19aa4c8View commit details -
[X86] Add FMA constant folding test coverage
Shows we constant fold scalars but not vectors
Configuration menu - View commit details
-
Copy full SHA for 4e01690 - Browse repository at this point
Copy the full SHA 4e01690View commit details -
[llvm][TableGen] Count implicit defs as well as explicit ones in the …
…GlobalISel TableGen emitter (#112673) `NumDefs` only counts the number of registers in `(outs)`, not any implicit defs specified with `Defs = [...]` This causes patterns with physical register defs to fail to import here instead of later where implicit defs are rendered. Add on `ImplicitDefs.size()` to count both and create `DstExpDefs` to count only explicit defs, used later on.
Configuration menu - View commit details
-
Copy full SHA for 8f6d491 - Browse repository at this point
Copy the full SHA 8f6d491View commit details -
[DAG] isConstantIntBuildVectorOrConstantInt - peek through bitcasts (…
…#112710) Alter both isConstantIntBuildVectorOrConstantInt + isConstantFPBuildVectorOrConstantFP to return a bool instead of the underlying SDNode, and adjust usage to account for this. Update isConstantIntBuildVectorOrConstantInt to peek though bitcasts when attempting to find a constant, in particular this improves canonicalization of constants to the RHS on commutable instructions. X86 is the beneficiary here as it often bitcasts rematerializable 0/-1 vector constants as vXi32 and bitcasts to the requested type Minor cleanup that helps with #107423
Configuration menu - View commit details
-
Copy full SHA for a630771 - Browse repository at this point
Copy the full SHA a630771View commit details -
[flang][OpenMP][test] re-add complex atomic capture regression test (…
…#112736) This was reverted in llvm/llvm-project#110969 due to a failure on aarch64. Weirdly aarch64 (but apparently not x86?) has a spurious phi instruction. flang -fc1 -emit-llvm will run midle-end optimization passes. Presumably one of those is behaving differently on different targets. I have adapted the test to work correctly on aarch64. The difference is in the RUN lines and the atomic exit block.
Configuration menu - View commit details
-
Copy full SHA for 6ce4b6d - Browse repository at this point
Copy the full SHA 6ce4b6dView commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedOct 18, 2024 Configuration menu - View commit details
-
Copy full SHA for 2663912 - Browse repository at this point
Copy the full SHA 2663912View commit details -
Revert "[AArch64][SVE] Enable max vector bandwidth for SVE" (#112873)
Reverts llvm/llvm-project#109671 Reverting due to some performance regressions on neoverse-v1.
Configuration menu - View commit details
-
Copy full SHA for 091a235 - Browse repository at this point
Copy the full SHA 091a235View commit details -
Revert "[APInt] Enable APInt ctor assertion by default (#112670)"
This reverts commit ba1ee2b. Causes some buildbot failures on aarch64.
Configuration menu - View commit details
-
Copy full SHA for 9065b75 - Browse repository at this point
Copy the full SHA 9065b75View commit details -
[DAG] visitFMA/FMAD - use FoldConstantArithmetic to add missing vecto…
…r constant folding support
Configuration menu - View commit details
-
Copy full SHA for 5c37316 - Browse repository at this point
Copy the full SHA 5c37316View commit details -
Configuration menu - View commit details
-
Copy full SHA for c7d1163 - Browse repository at this point
Copy the full SHA c7d1163View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e6abf5 - Browse repository at this point
Copy the full SHA 8e6abf5View commit details -
[clang][ASTImporter] Fix of unchecked Error object (#112688)
After commits 9c72a30 and 30a9cac error handling in function 'importTemplateParameterDefaultArgument' was not correct, probably related to (not) using std::move. A crash with unchecked Error result could happen when the import error path was taken. Here a test is added that reproduces this case and the problem is fixed.
Configuration menu - View commit details
-
Copy full SHA for 55cbbce - Browse repository at this point
Copy the full SHA 55cbbceView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1330d9 - Browse repository at this point
Copy the full SHA e1330d9View commit details -
[libc][complex] add additonal header guards for CFP16 and CFP128 (#11…
…2879) Fixes build errors due to #112594
Configuration menu - View commit details
-
Copy full SHA for d508701 - Browse repository at this point
Copy the full SHA d508701View commit details -
[clang][bytecode] Check for memory leaks after destroying global scop…
…e (#112868) The global scope we create when evaluating expressions might free some of the dynamic memory allocations, so we can't check for memory leaks before destroying it.
Configuration menu - View commit details
-
Copy full SHA for 3eaf4a9 - Browse repository at this point
Copy the full SHA 3eaf4a9View commit details -
[clang] constexpr built-in abs function. (#112539)
According to [P0533R9](https://wg21.link/P0533R9), the C++ standard library functions corresponding to the C macros in `[c.math.abs]` are now `constexpr`. To implement this feature in libc++, we must make the built-in abs function `constexpr`. This patch adds the implementation of a `constexpr` abs function for the current constant evaluator and the new bytecode interpreter. It is important to note that in 2's complement systems, the absolute value of the most negative value is out of range. In gcc, it will result in an out-of-range error and will not be evaluated as constants. We follow the same approach here.
Configuration menu - View commit details
-
Copy full SHA for 332ac18 - Browse repository at this point
Copy the full SHA 332ac18View commit details -
[mlir][tensor] Add tests to invalid.mlir (nfc) (#112759)
Adds two test with invalid usage of `tensor.extract_slice` that were missing. Also moves one other test for `tensor.extract_slice`, so that all tests for this Op are clustered together. Note, this PR merely documents the current behaviour. No new functionality is added.
Configuration menu - View commit details
-
Copy full SHA for 1a871b2 - Browse repository at this point
Copy the full SHA 1a871b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4995d09 - Browse repository at this point
Copy the full SHA 4995d09View commit details -
[PassRegistry] Add complex deinterleaving pass to PassRegistry.def (#…
…112874) Allow for the complex deinterleaving pass to be invoked via `opt --passes=complex-deinterleaving`
Configuration menu - View commit details
-
Copy full SHA for 7be1dc0 - Browse repository at this point
Copy the full SHA 7be1dc0View commit details -
[flang][OpenMP] Add
Id
function toOmpClause
to return clause id,……… (#112712) … NFC This replaces the two instances of `GetClauseKindForParserClass` with a localized member function.
Configuration menu - View commit details
-
Copy full SHA for 852e477 - Browse repository at this point
Copy the full SHA 852e477View commit details -
[libc][math][c23] Add tanhf16 C23 math function (#106006)
Part of #95250.
Configuration menu - View commit details
-
Copy full SHA for fdd7c03 - Browse repository at this point
Copy the full SHA fdd7c03View commit details -
[libc++] Remove obsolete _LIBCPP_PREDEFINED_OBJC_ARC_ADDRESSOF (#112745)
I don't know if that macro was ever truly defined by Clang, however it's not anymore, so that is effectively dead code.
Configuration menu - View commit details
-
Copy full SHA for 3a30955 - Browse repository at this point
Copy the full SHA 3a30955View commit details -
[ASan][windows] Recognize movzx r11d, BYTE PTR [rdx] in interception_…
…win (#111638) The instruction is present in some library in the 24H2 update for Windows 11: ==8508==interception_win: unhandled instruction at 0x7ff83e193a40: 44 0f b6 1a 4c 8b d2 48 This could be generalized, but getting all the ModR/M byte combinations right is tricky. Many other classes of instructions handled in this file could use some generalization too.
Configuration menu - View commit details
-
Copy full SHA for ce4618a - Browse repository at this point
Copy the full SHA ce4618aView commit details -
[NFC][X86] Fix Werror=extra error due to enum (#112812)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnings: llvm-project/llvm/lib/Target/X86/X86FastISel.cpp: In member function ‘virtual bool {anonymous}::X86FastISel::fastLowerCall(llvm::FastISel::CallLoweringInfo&)’: llvm-project/llvm/lib/Target/X86/X86FastISel.cpp:3547: error: enumerated and non-enumerated type in conditional expression [-Werror=extra] 3547 | MIB.addReg(Is64Bit ? X86::RIP : 0).addImm(1).addReg(0);
Configuration menu - View commit details
-
Copy full SHA for d7ae43e - Browse repository at this point
Copy the full SHA d7ae43eView commit details -
[NFC] Fix multi-character character constant warning (#112809)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warning: In file included from llvm-project/llvm/lib/BinaryFormat/Minidump.cpp:9: llvm-project/llvm/include/llvm/BinaryFormat/Minidump.h:250:37: error: multi-character character constant [-Werror=multichar] 250 | static const uint32_t LLDB_FLAG = 'LLDB';
Configuration menu - View commit details
-
Copy full SHA for c742a5d - Browse repository at this point
Copy the full SHA c742a5dView commit details -
[RISCV][TTI] Recognize CONCAT_VECTORS if a shufflevector mask is mult…
…iple insert subvector. (#111459) reference: llvm/llvm-project#110457
Configuration menu - View commit details
-
Copy full SHA for 12bcea3 - Browse repository at this point
Copy the full SHA 12bcea3View commit details -
[LVI] Infer non-zero from equality icmp (#112838)
This following pattern is common in loop headers: ``` %101 = sub nuw i64 %78, %98 %103 = icmp eq i64 %78, %98 br i1 %103, label %.thread.i.i, label %.preheader.preheader.i.i .preheader.preheader.i.i: %invariant.umin.i.i = call i64 @llvm.umin.i64(i64 %101, i64 9) %umax.i = call i64 @llvm.umax.i64(i64 %invariant.umin.i.i, i64 1) br label %.preheader.i.i .preheader.i.i: ... %116 = add nuw nsw i64 %.011.i.i, 1 %exitcond.not.i = icmp eq i64 %116, %umax.i br i1 %exitcond.not.i, label %.critedge.i.i, label %.preheader.i.i ``` As `%78` is not equal to `%98` in BB `.preheader.preheader.i.i`, we can prove `%101` is non-zero. Then we can simplify the loop exit condition. Addresses regression introduced by llvm/llvm-project#112742.
Configuration menu - View commit details
-
Copy full SHA for c89d731 - Browse repository at this point
Copy the full SHA c89d731View commit details -
[lldb][test] Add test for ASTImporter's name conflict resolution (#11…
…2566) This is a reduced test case from a crash we've observed in the past. The assertion that this test triggers is: ``` Assertion failed: ((Pos == ImportedDecls.end() || Pos->second == To) && "Try to import an already imported Decl"), function MapImported, file ASTImporter.cpp, line 10494. ``` In a non-asserts build we crash later on in the ASTImporter. The root cause is, as the assertion above points out, that we erroneously replace an existing `From->To` decl mapping with a `To` decl that isn't complete. Then we try to complete it but it has no definition and we dereference a nullptr. The reason this happens is basically what's been described in https://reviews.llvm.org/D67803?id=220956#1676588 The dylib contains a definition of `Service` which is different to the one in the main executable. When we start dumping the children of the variable we're printing, we start completing it's members, `ASTImport`ing fields in the process. When the ASTImporter realizes there's been a name conflict (i.e., a structural mismatch on the `Service` type) it would usually report back an error. However, LLDB uses `ODRHandlingType::Liberal`, which means we create a new decl for the ODR'd type instead of re-using the previously mapped decl. Eventually this leads us to crash. Ideally we'd be using `ODRHandlingType::Conservative` and warn/error, though LLDB relies on this in some cases (particularly for distinguishing template specializations, though maybe there's better a way to deal with those). We should really warn the user when this happens and not crash. To avoid the crash we'd need to know to not create a decl for the ODR violation, and instead re-use the definition we've previously seen. Though I'm not yet sure that's viable for all of LLDB's use-cases (where ODR violations might legimiately occur in a program, e.g., with opaque definitions, etc.).
Configuration menu - View commit details
-
Copy full SHA for 3bc765d - Browse repository at this point
Copy the full SHA 3bc765dView commit details -
[CodeGen] Fix return type of PHI_iterator::getIncomingValue. NFC.
This is supposed to match ValT aka Register.
Configuration menu - View commit details
-
Copy full SHA for b7bc1d0 - Browse repository at this point
Copy the full SHA b7bc1d0View commit details -
[Clang] Fix an assertion in expression recovery (#112888)
Explicit object member function calls are not modelled as member calls Fixes #112559
Configuration menu - View commit details
-
Copy full SHA for af90e7c - Browse repository at this point
Copy the full SHA af90e7cView commit details -
[libc][math] Add option to set a specific exponent for frexp with Inf…
…/NaN inputs. (#112387) In IEEE 754 and C standards, when calling `frexp` with Inf/Nan inputs, the exponent result is unspecified. In this case, FreeBSD libc and musl just passthrough `exp`, while glibc, FreeBSD libm set exp = 0, and MSVC set exp = -1. By default, LLVM libc will passthrough `exp` just as FreeBSD libc and musl, but we also allow users to explicitly choose the return exp value in this case for compatibility with other libc. Notice that, gcc did generate passthrough `exp` for `frexp(NaN/Inf, exp)`: https://godbolt.org/z/sM8fEej4E
Configuration menu - View commit details
-
Copy full SHA for b0dbd2c - Browse repository at this point
Copy the full SHA b0dbd2cView commit details -
[libc++] Re-add attribute macro to clang-format (#112746)
That macro was removed incorrectly from the clang-format file because it had a typo in its name. However, the macro with the right name is still being used in the library (sadly, in a single place).
Configuration menu - View commit details
-
Copy full SHA for 90bc60c - Browse repository at this point
Copy the full SHA 90bc60cView commit details -
[analyzer][Solver] Teach SymbolicRangeInferrer about commutativity (2…
…/2) (#112887) This patch should not introduce much overhead as it only does one more constraint map lookup, which is really quick. Depends on #112583
Configuration menu - View commit details
-
Copy full SHA for 67e8421 - Browse repository at this point
Copy the full SHA 67e8421View commit details -
[analyzer][Solver][NFC] Cleanup const-correctness inside range-based …
…solver (#112891)
Configuration menu - View commit details
-
Copy full SHA for 1b49ee7 - Browse repository at this point
Copy the full SHA 1b49ee7View commit details -
[libc] changed the return cast from static_cast<float16> to fputil::c…
…ast<float16> in exp10f16.cpp. (#112889) switch to fputil::cast to fix rounding with compiler-rt
Configuration menu - View commit details
-
Copy full SHA for 803220d - Browse repository at this point
Copy the full SHA 803220dView commit details -
[SystemZ] Rename SystemZ ATT Asm dialect to GNU Asm dialect (#112800)
The ATT assembler dialect on SystemZ seems to have been taken from the existing ATT/Intel code. However, on SystemZ, ATT does not hold any meaning. In reality, we are splitting the difference between GNU Asm syntax and HLASM Asm syntax, so it makes sense to rename ATT to GNU instead. Co-authored-by: Tony Tao <tonytao@ca.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 783901b - Browse repository at this point
Copy the full SHA 783901bView commit details -
[mlir][linalg] Fix idx comparison in the vectorizer (#112900)
Fixes loop comparison condition in the vectorizer. As that logic is used specifically for vectorising `tensor.extract`, I also added a test that violates the assumptions made inside `getTrailingNonUnitLoopDimIdx`, namely that Linalg loops are non-empty. Vectorizer pre-conditions will capture that much earlier making sure that `getTrailingNonUnitLoopDimIdx` is only run when all the assumptions are actually met. Thank you for pointing this out, @pfusik !
Configuration menu - View commit details
-
Copy full SHA for 0a3347d - Browse repository at this point
Copy the full SHA 0a3347dView commit details -
Merge from 'main' to 'sycl-web' (67 commits)
CONFLICT (content): Merge conflict in clang/include/clang/Serialization/ASTBitCodes.h CONFLICT (content): Merge conflict in clang/test/Modules/no-external-type-id.cppm
Configuration menu - View commit details
-
Copy full SHA for 58b4c80 - Browse repository at this point
Copy the full SHA 58b4c80View commit details -
[TableGen] Remove unused tokens FalseKW and TrueKW
These were introduced in https://reviews.llvm.org/D90635 but never used.
Configuration menu - View commit details
-
Copy full SHA for 68efaaa - Browse repository at this point
Copy the full SHA 68efaaaView commit details -
[Offload] Fix not copying the buffer identifier of offloading files
Summary: This caused an error when copying a file of the same name when multiple architectures needed the file. The buffer identifier which we use for the name in `-save-temps` mode would be empty and create in invalid filename. Copy this correctly now.
Configuration menu - View commit details
-
Copy full SHA for c27aae0 - Browse repository at this point
Copy the full SHA c27aae0View commit details -
[LLVM][TableGen] Change all
Init
pointers to const (#112705)This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089
Configuration menu - View commit details
-
Copy full SHA for 62e2c7f - Browse repository at this point
Copy the full SHA 62e2c7fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 922992a - Browse repository at this point
Copy the full SHA 922992aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 00d30bd - Browse repository at this point
Copy the full SHA 00d30bdView commit details -
Merge from 'sycl' to 'sycl-web' (3 commits)
iclsrc committedOct 18, 2024 Configuration menu - View commit details
-
Copy full SHA for e08259d - Browse repository at this point
Copy the full SHA e08259dView commit details -
[SandboxIR] Implement Operator (#112805)
This patch implements sandboxir::Operator mirroring llvm::Operator.
Configuration menu - View commit details
-
Copy full SHA for 54566ba - Browse repository at this point
Copy the full SHA 54566baView commit details -
Configuration menu - View commit details
-
Copy full SHA for dbe47c2 - Browse repository at this point
Copy the full SHA dbe47c2View commit details -
[libc++] __uglify non-conforming member typedef
base
(#112843)Currently, libc++'s `bitset`, `forward_list`, and `list` have non-conforming member typedef name `base`. The typedef is private, but can cause ambiguity in name lookup. Some other classes in libc++ that are either implementation details or not precisely specified by the standard also have member typdef `base`. I think this can still be conforming. Follows up #80706 and #111127.
Configuration menu - View commit details
-
Copy full SHA for 397707f - Browse repository at this point
Copy the full SHA 397707fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f264e4 - Browse repository at this point
Copy the full SHA 9f264e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for a99bf0f - Browse repository at this point
Copy the full SHA a99bf0fView commit details -
[llvm] prefer isa_and_nonnull over v && isa (#112541)
Use `isa_and_nonnull<T>(v)` instead of `v && isa<T>(v)`, where `v` is evaluated twice in the latter.
Configuration menu - View commit details
-
Copy full SHA for 721b796 - Browse repository at this point
Copy the full SHA 721b796View commit details -
[M68k] ARII atomic load/store (#108982)
Only ARI was supported, this PR adds ARII support for atomic loads/stores (also with zero displacement). Closes #107939
Configuration menu - View commit details
-
Copy full SHA for e13f1d1 - Browse repository at this point
Copy the full SHA e13f1d1View commit details -
[NFC][GOFF] Fix char overflow (#112826)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnining: llvm/unittests/Object/GOFFObjectFileTest.cpp:511:17: error: overflow in conversion from ‘int’ to ‘char’ changes value from ‘240’ to ‘'\37777777760'’ [-Werror=overflow] 511 | GOFFData[1] = 0xF0;
Configuration menu - View commit details
-
Copy full SHA for 9d7b35d - Browse repository at this point
Copy the full SHA 9d7b35dView commit details -
[ARM] Use ARM::NoRegister in more places. NFC
Similar to #112507, this uses ARM::NoRegister in a few more places, as opposed to the constant 0.
Configuration menu - View commit details
-
Copy full SHA for 0f3ed9c - Browse repository at this point
Copy the full SHA 0f3ed9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a91611 - Browse repository at this point
Copy the full SHA 3a91611View commit details -
[X86] andnot-patterns.ll - add non-BMI test coverage
Extra test coverage for #112547 to test cases where we don't create a ANDNOT instruction
Configuration menu - View commit details
-
Copy full SHA for 7da0a69 - Browse repository at this point
Copy the full SHA 7da0a69View commit details -
[NFC] Fix c++ style comment in c file (#112814)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnings: llvm-project/clang/include/clang-c/Index.h:2983:3: error: C++ style comments are not allowed in ISO C90 [-Werror] 2983 | // HLSL Types
Configuration menu - View commit details
-
Copy full SHA for 8182f81 - Browse repository at this point
Copy the full SHA 8182f81View commit details -
[NFC] add DiagnosticsTestHelper decl (#112820)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warning llvm-project/clang/unittests/Basic/DiagnosticTest.cpp:19:6: error: ‘void clang::DiagnosticsTestHelper(clang::DiagnosticsEngine&)’ has not been declared within ‘clang’ [-Werror] 19 | void clang::DiagnosticsTestHelper(DiagnosticsEngine &diag) { | ^~~~~ In file included from llvm-project/clang/unittests/Basic/DiagnosticTest.cpp:9: llvm-project/clang/include/clang/Basic/Diagnostic.h:567:15: note: only here as a ‘friend’ 567 | friend void DiagnosticsTestHelper(DiagnosticsEngine &);
Configuration menu - View commit details
-
Copy full SHA for 6a623e8 - Browse repository at this point
Copy the full SHA 6a623e8View commit details -
[SandboxIR] Implement FPMathOperator (#112921)
This patch implements sandboxir::FPMathOperator mirroring llvm::FPMathOperator
Configuration menu - View commit details
-
Copy full SHA for a24a420 - Browse repository at this point
Copy the full SHA a24a420View commit details -
[NFC] Fix Werror=extra warning related to mismatched enum type (#112808)
This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Fix warnings: llvm-project/llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp: In member function ‘void llvm::AsmPrinter::emitJumpTableSizesSection(const llvm::MachineJumpTableInfo*, const llvm::Function&) const’: llvm-project/llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp:2852:31: error: enumerated and non-enumerated type in conditional expression [-Werror=extra] 2852 | int Flags = F.hasComdat() ? ELF::SHF_GROUP : 0; | ~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~
Configuration menu - View commit details
-
Copy full SHA for 6c60ead - Browse repository at this point
Copy the full SHA 6c60eadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1fc0d3f - Browse repository at this point
Copy the full SHA 1fc0d3fView commit details -
[MemProf] Fix the option to disable memprof ICP (#112917)
The -enable-memprof-indirect-call-support meant to guard the recently added memprof ICP support was not used in enough places. Specifically, it was not checked in mayHaveMemprofSummary, which is called from the ThinLTO backend applyImports. This led to failures when checking the callsite records, as we incorrectly expected records for indirect calls. Fix the option to be checked in all necessary locations, and add testing.
Configuration menu - View commit details
-
Copy full SHA for 6264288 - Browse repository at this point
Copy the full SHA 6264288View commit details -
Fix build break in SemaHLSL.cpp on MSVC 2022: warning C4715: 'getReso…
…urceClass': not all control paths return a value (#112767) Moves the existing `llvm_unreachable` statement to the bottom of the function and changes the case statement to deliberately fall through to it. Build break was introduced by #111203 It was not caught by the builders as they use Visual Studio 2019, whereas this warning only appears in 2022. --------- Co-authored-by: Matheus Izvekov <mizvekov@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 9120ade - Browse repository at this point
Copy the full SHA 9120adeView commit details -
Configuration menu - View commit details
-
Copy full SHA for a01d7df - Browse repository at this point
Copy the full SHA a01d7dfView commit details -
[MemProf] Disable memprof ICP support by default (#112940)
A failure showed up after this was committed, rather than revert simply disable this new support to simplify investigation and further testing.
Configuration menu - View commit details
-
Copy full SHA for 5995e4b - Browse repository at this point
Copy the full SHA 5995e4bView commit details -
[lld-macho] Improve robustness of ObjC category merging (#112618)
This patch enhances the robustness of lld's Objective-C category merging. Currently, the category merger assumes it can fully parse and understand the format of all categories in the input, triggering an assert if any invalid category data is encountered. This will end up causing asserts in certain rare corner cases that are difficult to reproduce in small test cases. The proposed changes modify the behavior so that if invalid category data is detected, category merging is skipped for that specific class and all other categories sharing the same base class. This approach allows the linker to continue processing other categories without failing entirely due to a single problematic input. We also add a LIT test to where we corrupt category data and check that category merging for that class was skipped but the link was successful.
Configuration menu - View commit details
-
Copy full SHA for f9d3e98 - Browse repository at this point
Copy the full SHA f9d3e98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e87c2a - Browse repository at this point
Copy the full SHA 7e87c2aView commit details -
[ADT] Make concat able to handle ranges with iterators that return by…
… value (such as zip) (#112783) If any iterator in the concatenation returns by value, the result must return by value otherwise it'll produce dangling references. (some context that may or may not be relevant to this part of the code may be in llvm/llvm-project@981ce8f ) An alternative to #112441
Configuration menu - View commit details
-
Copy full SHA for 266154a - Browse repository at this point
Copy the full SHA 266154aView commit details -
Full path names are used in several unittests instead of the binary n…
…ame. Fix up the testcase failures (#107974) Encountered several testcase failures when running `ninja check-all`. It was due to the full path name were shown in the error message instead of the binary name, and therefore causing the check string mismatch. The machine was running CentOS 9 with binfmt_misc setup that uses qemu-aarch64 (8.1.2). Built and ran the unittest as aarch64 host (through qemu user). Co-authored-by: h2h <h2h@meta.com>
Configuration menu - View commit details
-
Copy full SHA for 629a182 - Browse repository at this point
Copy the full SHA 629a182View commit details -
Configuration menu - View commit details
-
Copy full SHA for 170dab9 - Browse repository at this point
Copy the full SHA 170dab9View commit details -
[clang][RISCV] Extend intrinsic size check variable from 16 -> 32 bit…
…s. NFC (#111481) We currently have over 67000 intrinsics, uint16_t will overflow.
Configuration menu - View commit details
-
Copy full SHA for e3b22dc - Browse repository at this point
Copy the full SHA e3b22dcView commit details -
[nfc][lsan] Parametrize ScanForPointers with loader (#112803)
Use `DirectLoader` which is equivalent to existing behaviour of loading pointers directly from memory.
Configuration menu - View commit details
-
Copy full SHA for d60fdc1 - Browse repository at this point
Copy the full SHA d60fdc1View commit details -
[lld-macho] Fix category merging sed issue (#112955)
Fix 'sed' spacing to ensure compatibility with all platforms. Original failure: https://lab.llvm.org/buildbot/#/builders/190/builds/7903 ``` RUN: at line 33: sed -E '/^__OBJC_\$_CATEGORY_MyBaseClass_\$_Category01:/ { n; s/^[ \t]*\.quad[ \t]+l_OBJC_CLASS_NAME_$/\t.quad\tL_OBJC_IMAGE_INFO+3/ }' merge_cat_minimal.s > merge_cat_minimal_bad_name.s + sed -E '/^__OBJC_\$_CATEGORY_MyBaseClass_\$_Category01:/ { n; s/^[ \t]*\.quad[ \t]+l_OBJC_CLASS_NAME_$/\t.quad\tL_OBJC_IMAGE_INFO+3/ }' merge_cat_minimal.s sed: 1: "/^__OBJC_\$_CATEGORY_My ...": bad flag in substitute command: '}' ```
Configuration menu - View commit details
-
Copy full SHA for caa9e41 - Browse repository at this point
Copy the full SHA caa9e41View commit details -
[libc][math] Add test and fix atan2f crashing when flush-denorm-to-ze…
…ro (FTZ) and denorm-as-zero (DAZ) modes are set. (#112828)
Configuration menu - View commit details
-
Copy full SHA for 952dafb - Browse repository at this point
Copy the full SHA 952dafbView commit details -
[mlir] Add forall canonicalization to replace constant induction vars…
… (#112764) Adds a canonicalization pattern for scf.forall that replaces constant induction variables with a constant index. There is a similar canonicalization that completely removes constant induction variables from the loop, but that pattern does not apply on foralls with mappings, so this one is necessary for those cases. --------- Signed-off-by: Max Dawkins <max.dawkins@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1ae2446 - Browse repository at this point
Copy the full SHA 1ae2446View commit details -
Configuration menu - View commit details
-
Copy full SHA for 53e85d4 - Browse repository at this point
Copy the full SHA 53e85d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 203b972 - Browse repository at this point
Copy the full SHA 203b972View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a4661c - Browse repository at this point
Copy the full SHA 9a4661cView commit details -
[SLP]Initial non-power-of-2 support (but still whole register) for re…
…ductions Enables initial non-power-of-2 support (but still requires number of elements, forming whole registers) for reductions. Enables extra vectorization for MultiSource/Benchmarks/7zip/7zip-benchmark, CINT2006/464.h264ref and CFP2017rate/526.blender_r (checked for SSE2) Reviewers: RKSimon Reviewed By: RKSimon Pull Request: llvm/llvm-project#112361
Configuration menu - View commit details
-
Copy full SHA for 7f2e937 - Browse repository at this point
Copy the full SHA 7f2e937View commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedOct 18, 2024 Configuration menu - View commit details
-
Copy full SHA for f428f71 - Browse repository at this point
Copy the full SHA f428f71View commit details
Commits on Nov 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e6f4ee4 - Browse repository at this point
Copy the full SHA e6f4ee4View commit details -
Align constant-loclist test to upstream (#2756)
The test has evolved upstream since it was forked into this repository and started failing now. Re-align the invocations and FileCheck patterns with the test in llvm-project. Original commit: KhronosGroup/SPIRV-LLVM-Translator@4baa42c3da51c0d
Configuration menu - View commit details
-
Copy full SHA for e75b386 - Browse repository at this point
Copy the full SHA e75b386View commit details -
Drop 'using namespace std' everywhere (#2755)
Most uses of std functionality already have explicit `std::` qualifications, so qualify the remaining unqualified uses and drop the `using namespace` directives. Original commit: KhronosGroup/SPIRV-LLVM-Translator@29b02642964a240
Configuration menu - View commit details
-
Copy full SHA for 3f8ee44 - Browse repository at this point
Copy the full SHA 3f8ee44View commit details -
Fix typo in warning message (#2758)
Original commit: KhronosGroup/SPIRV-LLVM-Translator@20eeca98ebef7c8
Configuration menu - View commit details
-
Copy full SHA for 02a0c48 - Browse repository at this point
Copy the full SHA 02a0c48View commit details -
Use move to prevent copy (#2761)
Signed-off-by: Marcos Maronas <marcos.maronas@intel.com> Original commit: KhronosGroup/SPIRV-LLVM-Translator@ee130659c16e2d3
Configuration menu - View commit details
-
Copy full SHA for f99f6b5 - Browse repository at this point
Copy the full SHA f99f6b5View commit details -
Remove redundant requirement spirv-as in tu-member-opaque.spvasm test (…
…#2757) * Remove redundant requirement spirv-as in tu-member-opaque.spvasm test * Correct the tu-member-opaque.spvasm test file extension Original commit: KhronosGroup/SPIRV-LLVM-Translator@34bc772821bd93e
Configuration menu - View commit details
-
Copy full SHA for cdd21b0 - Browse repository at this point
Copy the full SHA cdd21b0View commit details -
Add missing requirement spirv-dis in GroupAndSubgroupInstructions.spv…
…asm test (#2759) Original commit: KhronosGroup/SPIRV-LLVM-Translator@ee682a4869b84da
Configuration menu - View commit details
-
Copy full SHA for 77ca54d - Browse repository at this point
Copy the full SHA 77ca54dView commit details -
Fix SPIR-V Builtin OpGenericCastToPtr transformation to OCL (#2763)
OpGenericCastToPtrExplict is for dynamic cast and OpGenericCastToPtr is for static cast. OpGenericCastToPtrExplict is already transformed to to_{global|local|private} OCL builtins, but the handling for OpGenericCastToPtr is missing. Looks we can transform OpGenericCastToPtr to addrspacecast instruction directly in SPIRV to OCL transformation. Signed-off-by: Cui, Dele <dele.cui@intel.com> Original commit: KhronosGroup/SPIRV-LLVM-Translator@21e96548e242860
Configuration menu - View commit details
-
Copy full SHA for 8659507 - Browse repository at this point
Copy the full SHA 8659507View commit details -
Clarify optionality of spirv-val in test suite (#2765)
Original commit: KhronosGroup/SPIRV-LLVM-Translator@e8fce056867bb1b
Configuration menu - View commit details
-
Copy full SHA for 38c9811 - Browse repository at this point
Copy the full SHA 38c9811View commit details -
SPV_KHR_untyped_pointers - implement OpUntypedPrefetchKHR (#2752)
When the extension is enabled, we should replace prefetch OpenCL ExtInst with the OpUntypedPrefetchKHR instruction. Spec: https://htmlpreview.github.io/?https://github.com/KhronosGroup/SPIRV-Registry/blob/main/extensions/KHR/SPV_KHR_untyped_pointers.html Original commit: KhronosGroup/SPIRV-LLVM-Translator@c795db9a90b86f0
Configuration menu - View commit details
-
Copy full SHA for 9b50b27 - Browse repository at this point
Copy the full SHA 9b50b27View commit details -
Add fast math flag translation for OpenCL std lib (#2762)
Such possibility was added in SPIR-V 1.6. This patch also introduces limited translation of nofpclass LLVM parameter attribute. Signed-off-by: Sidorov, Dmitry <dmitry.sidorov@intel.com> Original commit: KhronosGroup/SPIRV-LLVM-Translator@ae8fa3825a699b2
Configuration menu - View commit details
-
Copy full SHA for bd2ac52 - Browse repository at this point
Copy the full SHA bd2ac52View commit details -
Fix SPIRVCopyMemory::validate() (#2770)
The asserts should be checking the `Target` member variable; `Id` is not used for this class. Only fix the wrong asserts for now; proper handling and testing of `OpCopyMemory` will be done in a followup commit. Contributes to KhronosGroup/SPIRV-LLVM-Translator#2769 Original commit: KhronosGroup/SPIRV-LLVM-Translator@9d2926d27b478f0
Configuration menu - View commit details
-
Copy full SHA for 1f8de71 - Browse repository at this point
Copy the full SHA 1f8de71View commit details -
Fix mangling for atomic builtins used with SPV_KHR_untyped_pointers (#…
…2771) This change allows to preserve the correct builtin mangling in reverse translation. All the existing tests for atomics (except atomic flag instructions which are not covered by the extension) were updated to verify we get the same mangling with and without extension enabled. Original commit: KhronosGroup/SPIRV-LLVM-Translator@566023769b3ab6a
Configuration menu - View commit details
-
Copy full SHA for b12df55 - Browse repository at this point
Copy the full SHA b12df55View commit details -
Do not run check-in-tree after pushes to llvm_release_* (#2793)
The check-in-tree tests take about 50 minutes to run compared to <5 minutes for check-out-of-tree. This causes considerable congestion on the runners when backports get merged to several llvm_release_* branches around the same time. Since the value of these post-push checks is relatively low, disable them; they will still run on pull requests. Original commit: KhronosGroup/SPIRV-LLVM-Translator@196bbc97bdde045
Configuration menu - View commit details
-
Copy full SHA for bd16a00 - Browse repository at this point
Copy the full SHA bd16a00View commit details -
SPIRVReader: Add OpCopyMemory support (#2779)
Add support for translating `OpCopyMemory` into `llvm.memcpy`. Fixes KhronosGroup/SPIRV-LLVM-Translator#2769 Original commit: KhronosGroup/SPIRV-LLVM-Translator@8dc0349c0860172
Configuration menu - View commit details
-
Copy full SHA for 157dae5 - Browse repository at this point
Copy the full SHA 157dae5View commit details -
Fix llvm.frexp intrinsic translation with SPV_KHR_untyped_pointers (#…
…2818) Original commit: KhronosGroup/SPIRV-LLVM-Translator@e3b9ba378349313
Configuration menu - View commit details
-
Copy full SHA for 7be620b - Browse repository at this point
Copy the full SHA 7be620bView commit details -
SPV_KHR_untyped_pointers - fix images translation (#2817)
This patch ensures that we are loading image type, not untyped pointer type when the extension is used. This way we also preserve correct mangling during the reverse translation. Original commit: KhronosGroup/SPIRV-LLVM-Translator@cd7985fb6d2589d
Configuration menu - View commit details
-
Copy full SHA for e9ebfa5 - Browse repository at this point
Copy the full SHA e9ebfa5View commit details -
SPV_KHR_untyped_pointers - fix enqueue marker (#2820)
This patch ensures that we are loading image type, not untyped pointer type when the extension is used. This way we also preserve correct mangling during the reverse translation. Original commit: KhronosGroup/SPIRV-LLVM-Translator@8e65158c3250e5c
Configuration menu - View commit details
-
Copy full SHA for 00fc482 - Browse repository at this point
Copy the full SHA 00fc482View commit details -
Test more cases with SPV_KHR_untyped_pointers (#2821)
Original commit: KhronosGroup/SPIRV-LLVM-Translator@caa1abcda8a91de
Configuration menu - View commit details
-
Copy full SHA for 2a0693c - Browse repository at this point
Copy the full SHA 2a0693cView commit details -
Translate integer dot product SPIR-V builtins to OCL builtins (#2794)
#1174 implements translating integer dot product OCL builtins to SPIR-V builtins. This pull request is to do the reverse translation. Original commit: KhronosGroup/SPIRV-LLVM-Translator@925255cb1982896
Configuration menu - View commit details
-
Copy full SHA for f717bd2 - Browse repository at this point
Copy the full SHA f717bd2View commit details -
Handle removed OpAtomicCompareExchangeWeak (#2824)
OpAtomicCompareExchangeWeak has been removed and #2665 added a validation for it. This pull request replaces OpAtomicCompareExchangeWeak with OpAtomicCompareExchange. Original commit: KhronosGroup/SPIRV-LLVM-Translator@eb64e4d795006d5
Configuration menu - View commit details
-
Copy full SHA for 0bbbeac - Browse repository at this point
Copy the full SHA 0bbbeacView commit details
Commits on Nov 8, 2024
-
Revert "Finally formalise our defacto line-ending policy"
This reverts commit dccebdd.
Configuration menu - View commit details
-
Copy full SHA for 0ba27fe - Browse repository at this point
Copy the full SHA 0ba27feView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c562c9 - Browse repository at this point
Copy the full SHA 5c562c9View commit details -
[clang] Sync with community to use UnqualPtrTy
Remove leftover differences during opaque pointer transition.
Configuration menu - View commit details
-
Copy full SHA for 9162b69 - Browse repository at this point
Copy the full SHA 9162b69View commit details -
[SYCL][Test] Update attributes in tests after 69a798a 095d49d
Configuration menu - View commit details
-
Copy full SHA for 2816519 - Browse repository at this point
Copy the full SHA 2816519View commit details
Commits on Nov 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1457e00 - Browse repository at this point
Copy the full SHA 1457e00View commit details -
[ORC] skip reoptimization tests on windows
Disable the test for now. https://github.com/intel/llvm/actions/runs/11758403932/job/32756817619?pr=16022
Configuration menu - View commit details
-
Copy full SHA for 2057848 - Browse repository at this point
Copy the full SHA 2057848View commit details