Optimize block management in decode phase #68

xiaohajiayou · 2025-07-04T07:40:07Z

In #71 #66 #65 #30 , there were questions about the timing of applying can_append and may_append for requesting new blocks. This PR will separate the logic for appending new blocks when the block is just filled, and the hash check when the block is not fully filled, in order to improve readability.
Key Changes:

Call check_and_update_hash before processing each sequence
Replace may_append with append for clarity
Simplify conditional logic for better readability

(Addresses: Decouple block management and hash computation)

xiaohajiayou · 2025-07-04T07:54:50Z

In #71 ,Combine the logic of block allocation and hash calculation into an atomic operation.
It seems that both methods are feasible, but there is a slight difference in how they improve readability:

Method 1: Pre-allocation Principle (len (seq) % self.block_size == 0)

This ensures that memory blocks are pre-allocated in exact multiples of the block size, making the logic relatively intuitive to understand. However, in practice, it still needs to enter may_append for runtime hash checks.

Method 2: On-demand Allocation Principle (len (seq) % self.block_size == 1)

Separate the responsibilities of block allocation and hash checking. Split can_append and may_append into can_append、append, and check_and_update_hash, thus keeping the block allocation logic decoupled from the hashing process.

CZWin32768 · 2025-08-05T06:46:54Z

Hi, thanks for your fix. I have a question: your impl of check_and_update_hash assert the last block hash always being -1. Is there situation that the previously preempted seq length equals block size so that the hash is not -1?

Optimize block management and export need_append func

dc416e4

xiaohajiayou force-pushed the main branch from f734848 to dc416e4 Compare July 4, 2025 11:30

xiaohajiayou mentioned this pull request Jul 7, 2025

fix: update can_append and may_append logic for block allocation #71

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize block management in decode phase #68

Optimize block management in decode phase #68

Uh oh!

xiaohajiayou commented Jul 4, 2025 •

edited

Loading

Uh oh!

xiaohajiayou commented Jul 4, 2025 •

edited

Loading

Uh oh!

CZWin32768 commented Aug 5, 2025

Uh oh!

Uh oh!

Optimize block management in decode phase #68

Are you sure you want to change the base?

Optimize block management in decode phase #68

Uh oh!

Conversation

xiaohajiayou commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xiaohajiayou commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Method 1: Pre-allocation Principle (len (seq) % self.block_size == 0)

Method 2: On-demand Allocation Principle (len (seq) % self.block_size == 1)

Uh oh!

CZWin32768 commented Aug 5, 2025

Uh oh!

Uh oh!

xiaohajiayou commented Jul 4, 2025 •

edited

Loading

xiaohajiayou commented Jul 4, 2025 •

edited

Loading