Blase: Add a tactic to genralize the widths of bitvectors #1756

ineol · 2025-10-08T20:33:56Z

It seems to be working on a small example.

It needs to special case functions such as zeroExtend to recognize that some parameters are "width" parameters.

github-actions · 2025-10-08T20:56:38Z

bv_decide solved 0 theorems.
bitwuzla solved 0 theorems.
bv_decide found 0 counterexamples.
bitwuzla found 0 counterexamples.
bv_decide only failed on 0 problems.
bitwuzla only failed on 0 problems.
both bitwuzla and bv_decide failed on 0 problems.
In total, bitwuzla saw 0 problems.
In total, bv_decide saw 0 problems.
ran rg 'LeanSAT provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'LeanSAT proved' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla proved' | wc -l, this file found 0, rg found 0, SUCCESS
The InstCombine benchmark contains 4520 theorems in total.
Saved dataframe at: /home/runner/work/lean-mlir/lean-mlir/bv-evaluation/raw-data/InstCombine/instcombine_ceg_data.csv
all_files_solved_bitwuzla_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_rw_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_bb_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_sat_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratt_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratc_times_stddev avg: nan | stddev: nan
mean of percentage stddev/av: nan%

github-actions · 2025-10-08T22:32:22Z

bv_decide solved 0 theorems.
bitwuzla solved 0 theorems.
bv_decide found 0 counterexamples.
bitwuzla found 0 counterexamples.
bv_decide only failed on 0 problems.
bitwuzla only failed on 0 problems.
both bitwuzla and bv_decide failed on 0 problems.
In total, bitwuzla saw 0 problems.
In total, bv_decide saw 0 problems.
ran rg 'LeanSAT provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'LeanSAT proved' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla proved' | wc -l, this file found 0, rg found 0, SUCCESS
The InstCombine benchmark contains 4520 theorems in total.
Saved dataframe at: /home/runner/work/lean-mlir/lean-mlir/bv-evaluation/raw-data/InstCombine/instcombine_ceg_data.csv
all_files_solved_bitwuzla_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_rw_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_bb_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_sat_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratt_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratc_times_stddev avg: nan | stddev: nan
mean of percentage stddev/av: nan%

ineol · 2025-10-09T09:40:42Z

@bollu it seems to be working now

github-actions · 2025-10-09T09:59:45Z

bv_decide solved 0 theorems.
bitwuzla solved 0 theorems.
bv_decide found 0 counterexamples.
bitwuzla found 0 counterexamples.
bv_decide only failed on 0 problems.
bitwuzla only failed on 0 problems.
both bitwuzla and bv_decide failed on 0 problems.
In total, bitwuzla saw 0 problems.
In total, bv_decide saw 0 problems.
ran rg 'LeanSAT provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'LeanSAT proved' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla proved' | wc -l, this file found 0, rg found 0, SUCCESS
The InstCombine benchmark contains 4520 theorems in total.
Saved dataframe at: /home/runner/work/lean-mlir/lean-mlir/bv-evaluation/raw-data/InstCombine/instcombine_ceg_data.csv
all_files_solved_bitwuzla_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_rw_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_bb_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_sat_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratt_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratc_times_stddev avg: nan | stddev: nan
mean of percentage stddev/av: nan%

github-actions · 2025-10-09T10:02:36Z

bv_decide solved 0 theorems.
bitwuzla solved 0 theorems.
bv_decide found 0 counterexamples.
bitwuzla found 0 counterexamples.
bv_decide only failed on 0 problems.
bitwuzla only failed on 0 problems.
both bitwuzla and bv_decide failed on 0 problems.
In total, bitwuzla saw 0 problems.
In total, bv_decide saw 0 problems.
ran rg 'LeanSAT provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'LeanSAT proved' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla proved' | wc -l, this file found 0, rg found 0, SUCCESS
The InstCombine benchmark contains 4520 theorems in total.
Saved dataframe at: /home/runner/work/lean-mlir/lean-mlir/bv-evaluation/raw-data/InstCombine/instcombine_ceg_data.csv
all_files_solved_bitwuzla_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_rw_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_bb_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_sat_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratt_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratc_times_stddev avg: nan | stddev: nan
mean of percentage stddev/av: nan%

bollu

While I agree that this is a viable approach, I had a different algorithm in mind:

At each function application f x1 ... xn, infer the type of the function as f : T1 -> T2 .. -> Tn -> O. Inspect the (xi, ti), and if the type ti is a BitVec K, we perform the generalization as you did. This avoids the need for a table, but I guess I'm now not sure about the tradeoffs between having a table and not having one.

Anyway, LGTM to me! I wrote down some comments, but none of them are blockers to merging.

Thanks muchly ^_^

Blase/Blase/MultiWidth/Tactic.lean

bollu · 2025-10-09T10:11:08Z

Blase/Blase/MultiWidth/Tactic.lean

+  for (e', x) in s.mapping do
+    if ← isDefEq e e' then
+      return x


Suggested change

for (e', x) in s.mapping do

if ← isDefEq e e' then

return x

/-- TODO: Instead of using a HashMap, consider using a DiscrTree. -/

for (e', x) in s.mapping do

if ← isDefEq e e' then

return x

Hmm is there a good way to recover the original expression from the keys of the tree?

bollu · 2025-10-09T10:14:03Z

Blase/Blase/MultiWidth/Tactic.lean

+      let arg ← if bv? i then State.add? arg else visit arg
+      pure <| .app res arg
+  | .forallE n e₁ e₂ info =>
+    pure <| .forallE n (← visit e₁) (← visit e₂) info


Why don't you use the combinators forallTelescoping and instead work with raw BVars? I guess in this case it's OK, but I do wonder why you prefer the approach :)

Hmm we want to recurse in e1 so we'd need to first change the types in the telescope and then use forallTelescope right?

bollu · 2025-10-09T10:16:11Z

Blase/Blase/MultiWidth/Tactic.lean

+def genTable : Std.HashMap Name (Array Bool) := Id.run do
+  let mut table := .emptyWithCapacity 16
+  table := table.insert ``BitVec #[true]
+  table := table.insert ``BitVec.zeroExtend #[true, true, false]
+  table := table.insert ``BitVec.signExtend #[true, true, false]
+  table := table.insert ``BitVec.instAdd #[true]
+  table := table.insert ``BitVec.instSub #[true]
+  table := table.insert ``BitVec.instMul #[true]
+  table := table.insert ``BitVec.instDiv #[true]
+  table


Don't we want to always generalize a BV variable? My intuition is that instead of having a table, we check if a value has type BitVec w, and if it is, we generalize it, with a possible exception for BitVec 1?

The problem is that we want to generalize 10 in x.signExtend 10 but it's only because we know that the first parameter of signExtend is a width and not a random Nat.

A more rigorous approach would be to analyze at the type of signExtend and see that the variable appears as a paramter of BitVec and recover the information in the table like that, but it seems complicated.

I just read your comment above, so we agree on the method :)

I think we can try this simple approach and see if it's sufficient for the evaluation.

github-actions · 2025-10-09T12:30:19Z

bv_decide solved 0 theorems.
bitwuzla solved 0 theorems.
bv_decide found 0 counterexamples.
bitwuzla found 0 counterexamples.
bv_decide only failed on 0 problems.
bitwuzla only failed on 0 problems.
both bitwuzla and bv_decide failed on 0 problems.
In total, bitwuzla saw 0 problems.
In total, bv_decide saw 0 problems.
ran rg 'LeanSAT provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla provided a counter' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'LeanSAT proved' | wc -l, this file found 0, rg found 0, SUCCESS
ran rg 'Bitwuzla proved' | wc -l, this file found 0, rg found 0, SUCCESS
The InstCombine benchmark contains 4520 theorems in total.
Saved dataframe at: /home/runner/work/lean-mlir/lean-mlir/bv-evaluation/raw-data/InstCombine/instcombine_ceg_data.csv
all_files_solved_bitwuzla_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_rw_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_bb_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_sat_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratt_times_stddev avg: nan | stddev: nan
all_files_solved_bv_decide_lratc_times_stddev avg: nan | stddev: nan
mean of percentage stddev/av: nan%

ineol requested a review from bollu October 8, 2025 20:33

ineol force-pushed the push-tnpkrqksnkvx branch from d003a14 to 9feffb1 Compare October 8, 2025 22:09

ineol force-pushed the push-tnpkrqksnkvx branch 2 times, most recently from e6f5a3d to 0cbe25c Compare October 9, 2025 09:40

ineol marked this pull request as ready for review October 9, 2025 09:40

bollu approved these changes Oct 9, 2025

View reviewed changes

Add a tactic to genralize the widths of bitvectors

867a36c

ineol force-pushed the push-tnpkrqksnkvx branch from 0cbe25c to 867a36c Compare October 9, 2025 12:08

ineol added this pull request to the merge queue Oct 9, 2025

Merged via the queue into main with commit e0bb242 Oct 9, 2025
20 of 21 checks passed

Blase: Add a tactic to genralize the widths of bitvectors #1756

Blase: Add a tactic to genralize the widths of bitvectors #1756

Uh oh!

Conversation

ineol commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 8, 2025

Uh oh!

github-actions bot commented Oct 8, 2025

Uh oh!

ineol commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

bollu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bollu Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ineol Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

bollu Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ineol Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

bollu Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ineol Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ineol Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ineol commented Oct 8, 2025 •

edited

Loading