On Fri, Sep 25, 2020 at 4:05 PM Martin Liška <mli...@suse.cz> wrote: > > On 9/24/20 2:41 PM, Richard Biener wrote: > > On Wed, Sep 2, 2020 at 1:53 PM Martin Liška <mli...@suse.cz> wrote: > >> > >> On 9/1/20 4:50 PM, David Malcolm wrote: > >>> Hope this is constructive > >>> Dave > >> > >> Thank you David. All of them very very useful! > >> > >> There's updated version of the patch. > > Hey. > > What a juicy patch review! > > > > > I noticed several functions without a function-level comment. > > Yep, but several of them are documented in a class declaration. Anyway, I will > improve for the next time. > > > > > - cluster (tree case_label_expr, basic_block case_bb, profile_probability > > prob, > > - profile_probability subtree_prob); > > + inline cluster (tree case_label_expr, basic_block case_bb, > > + profile_probability prob, profile_probability > > subtree_prob); > > > > I thought we generally leave this to the compiler ... > > > > +@item -fconvert-if-to-switch > > +@opindex fconvert-if-to-switch > > +Perform conversion of an if cascade into a switch statement. > > +Do so if the switch can be later transformed using a jump table > > +or a bit test. The transformation can help to produce faster code for > > +the switch statement. This flag is enabled by default > > +at @option{-O2} and higher. > > > > this mentions we do this only when we later can convert the > > switch again but both passes (we still have two :/) have > > independent guards. > > Yes, we have the option for jump tables (-jump-tables), but we miss one for a > bit-test. > Moreover, as mentioned in the cover email, one can see it beneficial to > convert a if-chain > to switch as the expansion (without any BT and JT) can benefit from balanced > tree. > > > > > + /* For now, just wipe the dominator information. */ > > + free_dominance_info (CDI_DOMINATORS); > > > > could at least be conditional on the vop renaming condition... > > > > + if (!all_candidates.is_empty ()) > > + mark_virtual_operands_for_renaming (fun); > > Yep. > > > > > + if (bitmap_bit_p (*visited_bbs, bb->index)) > > + break; > > + bitmap_set_bit (*visited_bbs, bb->index); > > > > since you are using a bitmap and not a sbitmap (why?) > > you can combine those into > > New to me, thanks. > > > > > if (!bitmap_set_bit (*visited_bbs, bb->index)) > > break; > > > > + /* Current we support following patterns (situations): > > + > > + 1) if condition with equal operation: > > + > > ... > > > > did you see whether using > > > > register_edge_assert_for (lhs, true_edge, code, lhs, rhs, asserts); > > > > works equally well? It fills the 'asserts' vector with relations > > derived from 'lhs'. There's also > > vr_values::extract_range_for_var_from_comparison_expr > > to compute the case_range > > Good point! I must admit that my patch doesn't properly handle negative > conditions: > > if (argc != 11111) > { > if (argc == 1) > global = 222; > ... > } > > which can VRP correctly identify as anti-range: > int ~[11111, 11111] EQUIVALENCES: { argc_8(D) } (1 elements)$1 = void > > I have question about OR and AND conditions: > > <bb 2> : > _1 = aChar_8(D) == 1; > _2 = aChar_8(D) == 10; > _3 = _1 | _2; > if (_3 != 0) > goto <bb 6>; [INV] > else > goto <bb 3>; [INV] > > <bb 2> : > _1 = aChar_8(D) != 1; > _2 = aChar_8(D) != 10; > _3 = _1 & _2; > if (_3 != 0) > goto <bb 6>; [INV] > else > goto <bb 3>; [INV] > > Can I somehow get that from VRP (as I ask register_edge_assert_for only for > LHS > of a condition)?
Yes, you simply get all sorts of conditions that hold when a condition is true, not just those based on the SSA name you put in. But it occured to me that the use-case is somewhat different - for switch-conversion you want to know whether the test _exactly_ matches a range test, the VRP worker will not tell you that. For example if you had if (x && a > 3 && a < 7) then it will give you 'a in [4, 6]' and it might not give you 'x in [1, 1]' (for example if x is float). But that's required for correctness. So we're back to your custom crafted code unless we manage to somehow refactor the VRP condition analysis to handle both cases (should be possible I think, but have not looked too closely). Maybe the actual matching pieces can be shared at least. > > > > + /* If it's not the first condition, then we need a BB without > > + any statements. */ > > + if (!first) > > + { > > + unsigned stmt_count = 0; > > + for (gimple_stmt_iterator gsi = gsi_start_nondebug_bb (bb); > > + !gsi_end_p (gsi); gsi_next_nondebug (&gsi)) > > + ++stmt_count; > > + > > + if (stmt_count - visited_stmt_count != 0) > > + break; > > > > hmm, OK, this might be a bit iffy to get correct then, still it's a lot > > of pattern maching code that is there elsewhere already. > > ifcombine simply hoists any stmts without side-effects up the > > dominator tree and thus only requires BBs without side-effects > > (IIRC there's a predicate fn for that). > > Yes, I completely miss support for code hoisting (expect first BB where we > put gswitch). > If I'm correct hoisting should be possible where case destination should be a > new BB > that will contain original statements and then it will jump to a case > destination block. > > > > > + /* Prevent loosing information for a PHI node where 2 edges will > > + be folded into one. Note that we must do the same also for > > false_edge > > + (for last BB in a if-elseif chain). */ > > + if (!chain->record_phi_arguments (true_edge) > > + || !chain->record_phi_arguments (false_edge)) > > > > I don't really get this - looking at record_phi_arguments it seems > > we're requiring that all edges into the same PHI from inside the case > > (irrespective of from which case label) have the same value for the > > PHI arg? > > I guess so, I'll refresh the functionality. > > > > > + if (arg != *v) > > + return false; > > > > should use operand_equal_p at least, REAL_CSTs are for example > > not shared tree nodes. I'll also notice that if record_phi_arguments > > fails we still may have altered its hash-map even though the particular > > edge will not participate in the current chain, so it will affect other > > chains ending in the same BB. Overall this looks a bit too conservative > > (and random, based on visiting order). > > Oh, yes, it's not properly cleared once we bail out for a particular chain. > > > > > + expanded_location loc > > + = expand_location (gimple_location (chain->m_first_condition)); > > + if (dump_file) > > + { > > + fprintf (dump_file, "Condition chain (at %s:%d) with %d > > conditions " > > + "(%d BBs) transformed into a switch statement.\n", > > + loc.file, loc.line, total_case_values, > > + chain->m_entries.length ()); > > > > Use dump_printf_loc and you can pass a gimple * stmt as location. > > > > + /* Follow if-elseif-elseif chain. */ > > + bb = false_edge->dest; > > > > so that means the code doesn't handle a tree, right? But what > > makes us sure the chain doesn't continue on the true_edge instead, > > guess this degenerate tree isn't handled either. > > As mentioned earlier, I didn't consider VAR != CST type of conditions that > makes it more complicated. > > > > > I was thinking on whether doing the switch discovery in a reverse > > CFG walk, recording for each BB what case_range(s) it represents > > for a particular variable(s) so when visiting a dominator you > > can quickly figure what's the relevant children (true, false or both). > > Sounds promising. Note that right now we do not support overlapping cases > like: > > if (5 <= argc && argc <= 10) > foo (); > else if (6 <= argc && argc <= 100) > foo (); > > So I'm wondering if we can support 2 children? > > > It would also make the matching a BB-local operation where you'd > > do the case_label discovery based on the single-pred BBs gimple-cond. > > Can you please describe more how will the walk work? > > > > > + output = bit_test_cluster::find_bit_tests (filtered_clusters); > > + r = output.length () < filtered_clusters.length (); > > + if (r) > > + dump_clusters (&output, "BT can be built"); > > > > so as of the very above comment - this might be guarded with > > flag_tree_switch_conversion? > > flag_tree_switch_conversion isn't connected to the if-chain pass (yet). > > > > > As mentioned previously I would have liked to see if-to-switch > > integrated with switch-conversion, having separate passes is > > somewhat awkward (for example the redundant and possibly > > expensive find_bit_tests). > > Well, the CFG transformation for BT and JT is not trivial and I would like > to go in the first iteration through gswitch statements. > I have a massive speed up for the find_bit_tests/find_jump_tables. > > > > > + /* Move all statements from the BB to the BB with gswitch. */ > > + auto_vec<gimple *> stmts; > > + for (gimple_stmt_iterator gsi = gsi_start_bb (entry.m_bb); > > + !gsi_end_p (gsi); gsi_next (&gsi)) > > + { > > + gimple *stmt = gsi_stmt (gsi); > > + if (gimple_code (stmt) != GIMPLE_COND) > > + stmts.safe_push (stmt); > > + } > > + > > + for (unsigned i = 0; i < stmts.length (); i++) > > + { > > + gimple_stmt_iterator gsi_from = gsi_for_stmt (stmts[i]); > > + gsi_move_before (&gsi_from, &gsi); > > + } > > > > so you are already hoisting all stmts ... > > As mentioned, it's not supported right now. This moves all these kind of > "temp" statements: > > _1 = aChar_8(D) == 1; > _2 = aChar_8(D) == 10; > _3 = _1 | _2; Sure, but that _is_ hoisting of all stmts. > Martin > > > > > + make_edge (first_cond.m_bb, case_bb, 0); > > > > and if this doesn't create a new edge you need equivalent PHI > > args in the case_bb. To remove this restriction you "only" > > have to add a forwarder. Sth like > > > > edge e = make_edge (...); > > if (!e) > > { > > bb = create_basic_block (); > > make_edge (first_cond.m_bb, bb, 0); > > e = make_edge (bb, case_bb, 0); > > } > > fill PHI arg of 'e' from original value (no need to create the hash-map > > then) > > > > Richard. > > > > > >> Martin >