Show patches with: Submitter = liuhongt       |    State = Action Required       |    Archived = No       |   97 patches
Patch Series rb/tb S/W/F Date Submitter Delegate State
[x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. [x86] Fix unrecognizable insn due to illegal immediate_operand (const_int 255) of QImode. - - --- 2022-11-28 liuhongt New
[V3,x86] Fix incorrect _mm_cvtsbh_ss. [V3,x86] Fix incorrect _mm_cvtsbh_ss. - - --- 2022-11-25 liuhongt New
[v2,x86] Fix incorrect _mm_cvtsbh_ss. [v2,x86] Fix incorrect _mm_cvtsbh_ss. - - --- 2022-11-24 liuhongt New
[x86] Fix incorrect implementation for mm_cvtsbh_ss. [x86] Fix incorrect implementation for mm_cvtsbh_ss. - - --- 2022-11-23 liuhongt New
[x86] Some tidy up for RA related hooks. [x86] Some tidy up for RA related hooks. - - --- 2022-11-21 liuhongt New
[x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. [x86] define builtins for "shared" avxneconvert-avx512bf16vl builtins. - - --- 2022-11-18 liuhongt New
[2/2] Enable hwasan for x86-64. Support HWASAN with Intel LAM - - --- 2022-11-11 liuhongt New
[1/2] Implement hwasan target_hook. Support HWASAN with Intel LAM - - --- 2022-11-11 liuhongt New
[V2,x86] Fix incorrect digit constraint [V2,x86] Fix incorrect digit constraint - - --- 2022-10-31 liuhongt New
[x86] Fix incorrect digit constraint [x86] Fix incorrect digit constraint - - --- 2022-10-27 liuhongt New
Canonicalize vec_perm index to make the first index come from the first vector. Canonicalize vec_perm index to make the first index come from the first vector. - - --- 2022-10-18 liuhongt New
[x86] Add define_insn_and_split to support general version of "kxnor". [x86] Add define_insn_and_split to support general version of "kxnor". - - --- 2022-10-11 liuhongt New
Check nonlinear iv in vect_can_advance_ivs_p. Check nonlinear iv in vect_can_advance_ivs_p. - - --- 2022-09-29 liuhongt New
[x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. [x86] Support 2-instruction vector shuffle for V4SI/V4SF in ix86_expand_vec_perm_const_1. - - --- 2022-09-23 liuhongt New
Fix incorrect handle in vectorizable_induction for mixed induction type. Fix incorrect handle in vectorizable_induction for mixed induction type. - - --- 2022-09-20 liuhongt New
[x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg [x86] Don't optimize cmp mem, 0 to load mem, reg + test reg, reg - - --- 2022-09-16 liuhongt New
Modernize ix86_builtin_vectorized_function with corresponding expanders. Modernize ix86_builtin_vectorized_function with corresponding expanders. - - --- 2022-09-16 liuhongt New
[V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [V2] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - --- 2022-08-29 liuhongt New
[RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. [RFC:] Extend vectorizer to handle nonlinear induction for neg, mul/lshift/rshift with a constant. - - --- 2022-08-04 liuhongt New
Move pass_cse_sincos after vectorizer. Move pass_cse_sincos after vectorizer. - - --- 2022-07-20 liuhongt New
[V2] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. [V2] Extend 16/32-bit vector bit_op patterns with (m, 0, i) alternative. - - --- 2022-07-19 liuhongt New
[V2,RFC] Support vectorization for Complex type. [V2,RFC] Support vectorization for Complex type. - - --- 2022-07-18 liuhongt New
Extend 16/32-bit vector bit_op patterns with (m, 0, i)(vertical) alternative. Extend 16/32-bit vector bit_op patterns with (m, 0, i)(vertical) alternative. - - --- 2022-07-18 liuhongt New
Extend 64-bit vector bit_op patterns with ?r alternative Extend 64-bit vector bit_op patterns with ?r alternative - - --- 2022-07-14 liuhongt New
[RFC] Support vectorization for Complex type. [RFC] Support vectorization for Complex type. - - --- 2022-07-11 liuhongt New
Allocate general register(memory/immediate) for 16/32/64-bit vector bit_op patterns. Allocate general register(memory/immediate) for 16/32/64-bit vector bit_op patterns. - - --- 2022-07-11 liuhongt New
Improve reg_or_subregno to return INVALID_REGNUM when the subreg of memory is processed. Improve reg_or_subregno to return INVALID_REGNUM when the subreg of memory is processed. - - --- 2022-06-23 liuhongt New
[x86] Replace REGNO with reg_or_subregno in pre_reload splitter. [x86] Replace REGNO with reg_or_subregno in pre_reload splitter. - - --- 2022-06-20 liuhongt New
Add optional __Bfloat16 support Add optional __Bfloat16 support - - --- 2022-06-10 liuhongt New
Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. - - --- 2022-06-07 liuhongt New
Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. Simplify (B * v + C) * D -> BD* v + CD when B, C, D are all INTEGER_CST. - - --- 2022-06-02 liuhongt New
Add a bit dislike for separate mem alternative when op is REG_P. Add a bit dislike for separate mem alternative when op is REG_P. - - --- 2022-05-25 liuhongt New
[i386] recognize bzhi pattern when there's zero_extendsidi. [i386] recognize bzhi pattern when there's zero_extendsidi. - - --- 2022-05-17 liuhongt New
Optimize vpermtiw/b to vpunpcklqdq for certain cases. Optimize vpermtiw/b to vpunpcklqdq for certain cases. - - --- 2022-05-13 liuhongt New
[i386] Implement permutation with pslldq + psrldq + por when pshufb is not available. [i386] Implement permutation with pslldq + psrldq + por when pshufb is not available. - - --- 2022-05-09 liuhongt New
[Middle-end] Enhance final_value_replacement_loop to handle bitwise induction. [Middle-end] Enhance final_value_replacement_loop to handle bitwise induction. - - --- 2022-05-09 liuhongt New
[v2] Strip of a vector load which is only used partially. [v2] Strip of a vector load which is only used partially. - - --- 2022-05-09 liuhongt New
Expand __builtin_memcmp_eq with ptest for OImode. Expand __builtin_memcmp_eq with ptest for OImode. - - --- 2022-05-07 liuhongt New
Expand __builtin_memcmp_eq with ptest for OI/TImode. Expand __builtin_memcmp_eq with ptest for OI/TImode. - - --- 2022-05-05 liuhongt New
Strip of a vector load which is only used partially. Strip of a vector load which is only used partially. - - --- 2022-05-05 liuhongt New
Refine and/ior/xor/andn masked patterns for V*HFmode. Refine and/ior/xor/andn masked patterns for V*HFmode. - - --- 2022-04-06 liuhongt New
Split vector load from parm_del to elemental loads to avoid STLF stalls. Split vector load from parm_del to elemental loads to avoid STLF stalls. - - --- 2022-04-01 liuhongt New
Split vector load from parm_del to elemental loads to avoid STLF stalls. Split vector load from parm_del to elemental loads to avoid STLF stalls. - - --- 2022-03-31 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - --- 2022-03-16 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - --- 2022-03-16 liuhongt New
[i386] Add extra cost for unsigned_load which may have stall forward issue. [i386] Add extra cost for unsigned_load which may have stall forward issue. - - --- 2022-03-15 liuhongt New
[i386] Prevent vectorization for load from parm_decl at O2 to avoid STF issue. [i386] Prevent vectorization for load from parm_decl at O2 to avoid STF issue. - - --- 2022-03-04 liuhongt New
[i386] Optimize v4si broadcast for noavx512vl. [i386] Optimize v4si broadcast for noavx512vl. - - --- 2022-03-04 liuhongt New
[i386] Replace ix86_gen_scratch_sse_rtx with gen_reg_rtx. [i386] Replace ix86_gen_scratch_sse_rtx with gen_reg_rtx. - - --- 2022-03-01 liuhongt New
[i386] Fix typo in <code>v1ti3. [i386] Fix typo in <code>v1ti3. - - --- 2022-02-23 liuhongt New
Restrict the two sources of vect_recog_cond_expr_convert_pattern to be of the same type when conver… Restrict the two sources of vect_recog_cond_expr_convert_pattern to be of the same type when conver… - - --- 2022-02-16 liuhongt New
[vect] Add vect_recog_cond_expr_convert_pattern. [vect] Add vect_recog_cond_expr_convert_pattern. - - --- 2022-02-10 liuhongt New
[i386] ICE: QImode(not SImode) operand should be passed to gen_vec_initv16qiqi in ashlv16qi3. [i386] ICE: QImode(not SImode) operand should be passed to gen_vec_initv16qiqi in ashlv16qi3. - - --- 2022-02-09 liuhongt New
[i386] ICE: QImode(not SImode) operand should be passed to gen_vec_initv16qiqi in ashlv16qi3. [i386] ICE: QImode(not SImode) operand should be passed to gen_vec_initv16qiqi in ashlv16qi3. - - --- 2022-02-09 liuhongt New
[rtl/cprop_hardreg] Don't propagate for a more expensive reg-reg move. [rtl/cprop_hardreg] Don't propagate for a more expensive reg-reg move. - - --- 2022-01-25 liuhongt New
[vect] Add vect_recog_cond_expr_convert_pattern. [vect] Add vect_recog_cond_expr_convert_pattern. - - --- 2022-01-24 liuhongt New
Enhance vec_pack_trunc for integral mode mask. Enhance vec_pack_trunc for integral mode mask. - - --- 2022-01-18 liuhongt New
[i386] Fix ICE of unrecognizable insn. [PR target/104001] [i386] Fix ICE of unrecognizable insn. [PR target/104001] - - --- 2022-01-13 liuhongt New
[RTL/fwprop] Allow propagations from inner loop to outer loop. [RTL/fwprop] Allow propagations from inner loop to outer loop. - - --- 2022-01-07 liuhongt New
[i386] Optimize V16HF vector insert to element 0 for AVX2. [i386] Optimize V16HF vector insert to element 0 for AVX2. - - --- 2022-01-06 liuhongt New
[RTL/fwprop] Allow propagations from inner loop to outer loop. [RTL/fwprop] Allow propagations from inner loop to outer loop. - - --- 2022-01-06 liuhongt New
[RTL/fwprop] Allow propagations from inner loop to outer loop. [RTL/fwprop] Allow propagations from inner loop to outer loop. - - --- 2022-01-05 liuhongt New
Set __FLT_EVAL_METHOD__/__FLT_EVAL_METHOD_TS_18661_3__ to FLT_EVAL_METHOD_PROMOTE_TO_FLOAT when FLT… Set __FLT_EVAL_METHOD__/__FLT_EVAL_METHOD_TS_18661_3__ to FLT_EVAL_METHOD_PROMOTE_TO_FLOAT when FLT… - - --- 2021-12-27 liuhongt New
[i386] Add option -mvect-compare-costs [i386] Add option -mvect-compare-costs - - --- 2021-12-22 liuhongt New
[i386] Add define_insn_and_split for vpcmp{b, w, d, q} vpcmp{ph, ps, pd}. [i386] Add define_insn_and_split for vpcmp{b, w, d, q} vpcmp{ph, ps, pd}. - - --- 2021-12-21 liuhongt New
[i386] Add option -mvect-compare-costs [i386] Add option -mvect-compare-costs - - --- 2021-12-16 liuhongt New
[Gimple] Fix ICE. [PR103682] [Gimple] Fix ICE. [PR103682] - - --- 2021-12-14 liuhongt New
[i386] Prefer INT_SSE_REGS for SSE_FLOAT_MODE_P in preferred_reload_class. [i386] Prefer INT_SSE_REGS for SSE_FLOAT_MODE_P in preferred_reload_class. - - --- 2021-12-03 liuhongt New
[i386] Prefer INT_SSE_REGS for SSE_FLOAT_MODE_P in preferred_reload_class. [i386] Prefer INT_SSE_REGS for SSE_FLOAT_MODE_P in preferred_reload_class. - - --- 2021-12-02 liuhongt New
[i386] Fix ICE in ix86_attr_length_immediate_default. [i386] Fix ICE in ix86_attr_length_immediate_default. - - --- 2021-11-30 liuhongt New
Fix regression introduced by r12-5536. Fix regression introduced by r12-5536. - - --- 2021-11-29 liuhongt New
Don't allow mask/sse/mmx mov in TLS code sequences. Don't allow mask/sse/mmx mov in TLS code sequences. - - --- 2021-11-19 liuhongt New
Don't allow mask/sse/mmx mov in TLS code sequences. Don't allow mask/sse/mmx mov in TLS code sequences. - - --- 2021-11-18 liuhongt New
Improve integer bit test on __atomic_fetch_[or|and]_* returns Improve integer bit test on __atomic_fetch_[or|and]_* returns - - --- 2021-11-10 liuhongt New
[pass_if_conversion] Extend is_cond_scalar_reduction to handle bit_and/bit_xor/bit_ior. [pass_if_conversion] Extend is_cond_scalar_reduction to handle bit_and/bit_xor/bit_ior. - - --- 2021-11-09 liuhongt New
[2/2,Gimple] Simplify (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b, c). [1/2,Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a, b) - - --- 2021-11-05 liuhongt New
[1/2,Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a, b) [1/2,Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a, b) - - --- 2021-11-05 liuhongt New
Add !flag_signaling_nans to simplifcation: (trunc)copysign((extend)a, (extend)b) to copysign (a, b). Add !flag_signaling_nans to simplifcation: (trunc)copysign((extend)a, (extend)b) to copysign (a, b). - - --- 2021-11-05 liuhongt New
[1/2,Middle-end] Simplify (trunc)copysign((extend)a, (extend)b) to .COPYSIGN (a, b). [1/2,Middle-end] Simplify (trunc)copysign((extend)a, (extend)b) to .COPYSIGN (a, b). - - --- 2021-11-04 liuhongt New
[v5] Improve integer bit test on __atomic_fetch_[or|and]_* returns [v5] Improve integer bit test on __atomic_fetch_[or|and]_* returns - - --- 2021-11-04 liuhongt New
Adjust testcase for O2 vect. Adjust testcase for O2 vect. - - --- 2021-11-01 liuhongt New
Adjust testcase for O2 vect. Adjust testcase for O2 vect. - - --- 2021-10-28 liuhongt New
Canonicalize __atomic/sync_fetch_or/xor/and for constant mask. Canonicalize __atomic/sync_fetch_or/xor/and for constant mask. - - --- 2021-10-25 liuhongt New
Canonicalize __atomic/sync_fetch_or/xor/and for constant mask. Canonicalize __atomic/sync_fetch_or/xor/and for constant mask. - - --- 2021-10-22 liuhongt New
Adjust testcase for O2 vectorization. Adjust testcase for O2 vectorization. - - --- 2021-10-14 liuhongt New
Adjust testcase for O2 vectorization enabling Adjust testcase for O2 vectorization enabling - - --- 2021-10-11 liuhongt New
[GCC-12] Mention O2 vectorization enabling. [GCC-12] Mention O2 vectorization enabling. - - --- 2021-10-08 liuhongt New
[i386] Support reduc_{plus,smax,smin,umax,min}_scal_v4hi. [i386] Support reduc_{plus,smax,smin,umax,min}_scal_v4hi. - - --- 2021-09-28 liuhongt New
Enable auto-vectorization at O2 with very-cheap cost model. Enable auto-vectorization at O2 with very-cheap cost model. - - --- 2021-09-26 liuhongt New
[GIMPLE] Simplify (_Float16) ceil ((double) x) to .CEIL (x) when available. [GIMPLE] Simplify (_Float16) ceil ((double) x) to .CEIL (x) when available. - - --- 2021-09-24 liuhongt New
[GCC12] Mention Intel AVX512-FP16 and _Float16 support. [GCC12] Mention Intel AVX512-FP16 and _Float16 support. - - --- 2021-09-24 liuhongt New
wwwdocs: [GCC12] Mention Intel AVX512-FP16. wwwdocs: [GCC12] Mention Intel AVX512-FP16. - - --- 2021-09-23 liuhongt New
[i386] Fix ICE in pass_rpad. [i386] Fix ICE in pass_rpad. - - --- 2021-09-18 liuhongt New
[AVX512FP16] Support embedded broadcast for AVX512FP16 instructions. [AVX512FP16] Support embedded broadcast for AVX512FP16 instructions. - - --- 2021-09-16 liuhongt New
Check mask type when doing cond_op related gimple simplification. Check mask type when doing cond_op related gimple simplification. - - --- 2021-09-16 liuhongt New
Enable auto-vectorization at O2 with very-cheap cost model. Enable auto-vectorization at O2 with very-cheap cost model. - - --- 2021-09-16 liuhongt New
Optimize for V{8,16,32}HFmode vec_set/extract/init. Optimize for V{8,16,32}HFmode vec_set/extract/init. - - --- 2021-09-15 liuhongt New