Splice fixes for `chanmon_consistency` fuzz target by wpaulino · Pull Request #4655 · lightningdevkit/rust-lightning

wpaulino · 2026-06-01T22:17:34Z

While this target found several issues in our production code, there were also issues with the fuzz target itself, which this PR addresses. It fixes the following payloads from #4363:

00a6a61b1b1b211b211e1b1b1b1b211b211b211e1b1b261b1b211b1ba1
e8a3a3201ba31b201ba3a3201ba3a3201b201ba3a3201b040d80c113ff
a3cd00a2a1181021181118101810211811181000bb10181021ffff38
ffa1ffffa0414da373ff
a2cd00a2a118102118602700cda2a000a100733a3a6b73a3ffb4ffb7a1
ffa1ffacab211b11baff
88a4a6ffacadab21bcff

ldk-reviews-bot · 2026-06-01T22:17:37Z

👋 Thanks for assigning @TheBlueMatt as a reviewer!
I'll wait for their review and will help manage the review process.
Once they submit their review, I'll check if a second reviewer would be helpful.

ldk-claude-review-bot · 2026-06-01T22:30:10Z

+	for (idx, node) in nodes.iter().enumerate() {
+		if node.broadcaster.txn_broadcasted.borrow().is_empty() {
+			continue;
+		}
+
+		let pending_events = node.get_and_clear_pending_events();


The verification is one-directional: for each broadcast tx, you check a matching SpliceNegotiated event exists, but you don't verify the reverse — that every pending SpliceNegotiated event has a corresponding broadcast tx. A node with no broadcasts (skipped at line 2017) but with dangling SpliceNegotiated events passes silently.

Consider also checking nodes that have no broadcasts but do have pending SpliceNegotiated events, as that would indicate the splice tx was not broadcast despite an event being generated.

Suggested change

for (idx, node) in nodes.iter().enumerate() {

if node.broadcaster.txn_broadcasted.borrow().is_empty() {

continue;

}

let pending_events = node.get_and_clear_pending_events();

for (idx, node) in nodes.iter().enumerate() {

let pending_events = node.get_and_clear_pending_events();

if node.broadcaster.txn_broadcasted.borrow().is_empty() {

assert!(

!pending_events.iter().any(|e| matches!(e, events::Event::SpliceNegotiated { .. })),

"node {} has pending SpliceNegotiated event(s) but no broadcast tx",

idx,

);

continue;

}

ldk-claude-review-bot · 2026-06-01T22:30:42Z

The current diff contains only two hunks, both of which I examined. The error string "not expecting funding signatures" matches the messages in lightning/src/ln/channel.rs:2197 and :2243, confirming the err.contains(...) guard is correct.

No new issues found in this diff beyond my prior review comment. The two changes are correct:

The FundingTransactionReadyForSigning error handling appropriately tolerates stale signing events invalidated by a later tx_abort, and the matched error substring exists in the production code.
The MAX_SETTLE_ITERATIONS constant (100→256) and the typo fix ("may iterations" → "many iterations") are straightforward.

My previously posted comment on the one-directional broadcast/event verification in assert_test_invariants still stands but is not re-posted here.

ldk-reviews-bot · 2026-06-03T22:18:31Z

🔔 1st Reminder

Hey @TheBlueMatt! This PR has been waiting for your review.
Please take a look when you have a chance. If you're unable to review, please let us know so we can find another reviewer.

ldk-reviews-bot · 2026-06-06T00:01:18Z

🔔 2nd Reminder

Hey @TheBlueMatt! This PR has been waiting for your review.
Please take a look when you have a chance. If you're unable to review, please let us know so we can find another reviewer.

ldk-reviews-bot · 2026-06-08T00:02:11Z

🔔 3rd Reminder

Hey @TheBlueMatt! This PR has been waiting for your review.
Please take a look when you have a chance. If you're unable to review, please let us know so we can find another reviewer.

TheBlueMatt · 2026-06-08T12:01:59Z

 use std::sync::{Arc, Mutex};

 const MAX_FEE: u32 = 10_000;
+const MAX_SETTLE_ITERATIONS: usize = 256;


Codex's analysis on why it's needed:

• They are draining real protocol work in a very serialized harness loop. process_all_events() checks work in this order and restarts the loop after the first thing that makes progress: 1. manager persistence / monitor completions 2. node 0 messages 3. node 1 messages 4. node 2 messages 5. node 0 events 6. node 1 events 7. node 2 events 8. quiet pass So one commitment round can take several loop iterations: deliver commitment_signed, complete monitor update, deliver revoke_and_ack, complete another monitor update, process PaymentClaimable, send update_fulfill_htlc, then another commitment_signed/revoke_and_ack round. For payload 1, the long 4:102 loop is the final ff. It is settling: - probe HTLCs left by the previous settle_all() channel-liveness check - a0: A starts an A-B splice - 41: B pays A - 4d: C pays A via B - a3: C starts a B-C splice - 73: A pays C via B with MPP split over the B-C channels In that one settle region, the normal log shows this wire work: 52 commitment_signed 48 revoke_and_ack 15 update_add_htlc 15 update_fulfill_htlc 8 tx_complete 5 tx_add_input 5 tx_add_output 4 tx_signatures 4 stfu 1 tx_init_rbf 1 tx_ack_rbf 1 splice_init 1 splice_ack So it is mostly HTLC commitment handshakes. The splice/RBF messages are there too, but the big multiplier is: multiple channels, MPP parts, previous settle probe payments, and each add/fulfill needing the commitment dance. For payload 2, the long 0:103 loop is the first explicit ff. Before it, the payload has already staged both A-B and B-C splices, partially delivered some splice messages manually, sent a tiny A->B payment, sent two MPP A->B->C payments, two B->C direct payments, and started another B-C splice. Its config byte also makes node B’s monitors start as InProgress and makes A/C use deferred monitor writes. That settle region delivers: 26 commitment_signed 18 revoke_and_ack 15 update_add_htlc 15 update_fulfill_htlc 16 tx_complete 10 tx_add_input 10 tx_add_output 8 tx_signatures 4 stfu 2 tx_init_rbf 2 tx_ack_rbf 2 splice_ack The first visible HTLC burst is especially dense: A sends 7 update_add_htlcs to B in one commitment update, from the tiny direct A->B payment plus the two 3-part MPP first hops. B later sends 8 update_add_htlcs to C from the two direct B->C payments plus the two 3-part MPP second hops. Then C claims, fulfills go back C->B, then B fulfills back B->A, each with more commitment handshakes and monitor completions. The last iterations are not extra messages. They are cleanup: - payload 1: iteration 99 is still monitor/persistence progress, then 100 and 101 are the two quiet passes. - payload 2: iteration 100 is still monitor/persistence progress, then 101 and 102 are the two quiet passes. That is why the old 100 cap was wrong: it fired while the harness was still making legitimate forward progress, right before quiescence.

wpaulino · 2026-06-09T18:07:50Z

I'm dropping 89bdf95 and baa5d0c as the corresponding failures are already addressed by #4657.

Now that the fuzz target supports canceling splice funding attempts, we may see failed signing attempts due to the cancellation.

LDK and the chanmon_consistency fuzz target have grown in complexity recently and thus require more iterations than previously assumed to fully settle the state of all active channels.

joostjager · 2026-06-11T17:29:52Z

+						Err(APIError::APIMisuseError { ref err })
+							if err.contains("not expecting funding signatures") =>
+						{
+							// A queued signing event can be invalidated by a later `tx_abort`


Stricter invariant might check if the tx_abort has indeed happened.

tx_abort can happen for other reasons as well. cancel_funding_contributed would be a better signal, but that's still not enough to identify this case, e.g., both nodes of the splice have to sign but one canceled, so the other node attempting to sign will fail even though they didn't cancel.

wpaulino requested a review from TheBlueMatt June 1, 2026 22:17

wpaulino self-assigned this Jun 1, 2026

wpaulino added this to the 0.3 milestone Jun 1, 2026

ldk-claude-review-bot reviewed Jun 1, 2026

View reviewed changes

TheBlueMatt reviewed Jun 8, 2026

View reviewed changes

joostjager reviewed Jun 8, 2026

View reviewed changes

Comment thread fuzz/src/chanmon_consistency.rs Outdated

joostjager reviewed Jun 8, 2026

View reviewed changes

Comment thread fuzz/src/chanmon_consistency.rs Outdated

wpaulino added 2 commits June 9, 2026 11:08

Ignore stale splice signing fuzz events

1379144

Now that the fuzz target supports canceling splice funding attempts, we may see failed signing attempts due to the cancellation.

Raise iteration capacity in chanmon consistency when settling state

099bb09

LDK and the chanmon_consistency fuzz target have grown in complexity recently and thus require more iterations than previously assumed to fully settle the state of all active channels.

wpaulino force-pushed the splice-chanmon-consistency-fixes branch from baa5d0c to 099bb09 Compare June 9, 2026 18:09

TheBlueMatt added the backport 0.3 label Jun 10, 2026

wpaulino requested a review from TheBlueMatt June 10, 2026 17:43

joostjager approved these changes Jun 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Splice fixes for `chanmon_consistency` fuzz target#4655

Splice fixes for `chanmon_consistency` fuzz target#4655
wpaulino wants to merge 2 commits into
lightningdevkit:mainfrom
wpaulino:splice-chanmon-consistency-fixes

wpaulino commented Jun 1, 2026

Uh oh!

ldk-reviews-bot commented Jun 1, 2026 •

edited

Loading

Uh oh!

ldk-claude-review-bot Jun 1, 2026

Uh oh!

ldk-claude-review-bot commented Jun 1, 2026 •

edited

Loading

Uh oh!

ldk-reviews-bot commented Jun 3, 2026

Uh oh!

ldk-reviews-bot commented Jun 6, 2026

Uh oh!

ldk-reviews-bot commented Jun 8, 2026

Uh oh!

Uh oh!

TheBlueMatt Jun 8, 2026

Uh oh!

wpaulino Jun 11, 2026

Uh oh!

Uh oh!

Uh oh!

wpaulino commented Jun 9, 2026

Uh oh!

joostjager Jun 11, 2026

Uh oh!

wpaulino Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wpaulino commented Jun 1, 2026

Uh oh!

ldk-reviews-bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldk-claude-review-bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

ldk-claude-review-bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldk-reviews-bot commented Jun 3, 2026

Uh oh!

ldk-reviews-bot commented Jun 6, 2026

Uh oh!

ldk-reviews-bot commented Jun 8, 2026

Uh oh!

Uh oh!

TheBlueMatt Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

wpaulino Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wpaulino commented Jun 9, 2026

Uh oh!

joostjager Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wpaulino Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ldk-reviews-bot commented Jun 1, 2026 •

edited

Loading

ldk-claude-review-bot commented Jun 1, 2026 •

edited

Loading