use new unique jobs implementation #38

bgentry · 2025-01-12T19:57:06Z

This moves the library to use the new unique jobs implementation from riverqueue/river#590 and migrates the sqlalchemy driver to use a unified insertion path, allowing bulk inserts to use unique jobs.

It's intended to be analogous to riverqueue/riverqueue-ruby#32.

Outstanding issues:

There's one failing test that I think is due to some mocking related issue and I haven't yet been able to figure it out. Maybe you'd be able to take a crack at that?
I think it also needs more comprehensive test coverage for the various unique options
by_args needs support for partial keys, and needs to use sorted json before hashing. Tests are essential for this.
There are a couple of type misalignment issues where I'm wanting to insert null values into the unique_key or unique_states fields, because sqlc generates code that doesn't allow None values in these input lists. I'm not seeing any clear way to resolve that.

This one's related to #38. It turns out that while trying to mock a context manager kind of works, it will do the wrong thing in edge cases like when an exception is thrown from inside a `with` block, silently swallowing it and causing a return that's completely wrong. There may be some way to fix the mock to make it do the right thing, but instead of getting fancier with these mocks that are already awful, instead repair the problem by defining a plain class that implements context manager and just use that.

brandur · 2025-01-25T22:53:15Z

There's one failing test that I think is due to some mocking related issue and I haven't yet been able to figure it out. Maybe you'd be able to take a crack at that?

This took an absurdly long time to figure out, but the problem was that the context manager mock to return an executor was swallowing a thrown AssertionError in this with block:

    def insert_many(self, args: List[JobArgs | InsertManyParams]) -> list[InsertResult]:

        with self.driver.executor() as exec:
            return self._insert_many_exec(exec, args)

Thereby causing insert_many to return None and then insert fail as it tried to index element [0].

I put in a fix in #39 which seems to work. Take a look at that, we can merge it, and then it should fix the tests over here.

Medium term I want to strip all these mocks out. They always seem like a good idea, but every time I use them I remember why they're so painful to work with.

I'm going to time out on this PR for the day since I already spent too much time on it, but if you get blocked on one of your other work streams, maybe see if you can start looking into your no. 2 and no. 3 checkboxes.

There are a couple of type misalignment issues where I'm wanting to insert null values into the unique_key or unique_states fields, because sqlc generates code that doesn't allow None values in these input lists. I'm not seeing any clear way to resolve that.

Didn't get a chance to look at this one, but it's possible it'll require an additional sql parameter like unique_states_is_null that gets based into the SQL query.

This one's related to #38. It turns out that while trying to mock a context manager kind of works, it will do the wrong thing in edge cases like when an exception is thrown from inside a `with` block, silently swallowing it and causing a return that's completely wrong. There may be some way to fix the mock to make it do the right thing, but instead of getting fancier with these mocks that are already awful, instead repair the problem by defining a plain class that implements context manager and just use that.

This moves the library to use the new unique jobs implementation from riverqueue/river#590 and migrates the sqlalchemy driver to use a unified insertion path, allowing bulk inserts to use unique jobs.

bgentry · 2025-01-29T16:15:44Z

@brandur nice, looks like #39 fixed the test issues. I rebased and now everything is passing from make test 🙌

brandur · 2025-02-15T20:14:50Z

@bgentry What's the next step with respect to this one? (i.e. Do you have what you need to keep going?) Just in light of Eric checking back in today, it probably makes sense to try and make sure there's no known bugs left in this package.

src/riverqueue/driver/riversqlalchemy/sql_alchemy_driver.py

brandur

Damn it, I wrote this days ago and forgot to complete the review.

brandur · 2025-01-25T21:45:11Z

src/riverqueue/insert_opts.py

    args and queues. If either args or queue is changed on a new job, it's
    allowed to be inserted as a new job.

+    TODO update description ⚠ ⚠️ ⚠


src/riverqueue/driver/riversqlalchemy/sql_alchemy_driver.py

bgentry · 2025-02-27T02:17:46Z

src/riverqueue/client.py

+            }
+
+        # Serialize with sorted keys and append to unique key:
+        sorted_args = json.dumps(args_to_include, sort_keys=True)


Something that just came to mind: are the JSON encodings between Go, Ruby, and Python identical on this front? Do they omit unnecessary spaces, trailing newlines, etc? Unique conflicts won't be detected properly if not.

Hmm, yeah that might be a problem. It looks like Ruby has the same behavior as Go with minimal spacing between keys:

> JSON.dump({'z': 'last', 'a': 1}.sort.to_h) => "{\"a\":1,\"z\":\"last\"}"

But Python puts spacing in by default:

>>> json.dumps({'z': 'last', 'a': 1}, sort_keys=True) '{"a": 1, "z": "last"}'

However, they've got a really convenient separators keyword to dumps that takes the whitespace out and makes it like Ruby/Go:

>>> json.dumps({'z': 'last', 'a': 1}, sort_keys=True, separators=(',', ':')) '{"a":1,"z":"last"}'

Would be worth adding that in here for sure!

Added, thanks for digging in to confirm!

brandur

Modulo one comment on use of separators from yesterday, .

bgentry requested a review from brandur January 12, 2025 19:57

brandur mentioned this pull request Jan 25, 2025

Don't try to mock context manager; use a simple class instead #39

Merged

bgentry force-pushed the bg-newer-unique-jobs branch from c65e79f to 3899985 Compare January 29, 2025 16:11

use new unique jobs implementation

31a2cd1

This moves the library to use the new unique jobs implementation from riverqueue/river#590 and migrates the sqlalchemy driver to use a unified insertion path, allowing bulk inserts to use unique jobs.

bgentry force-pushed the bg-newer-unique-jobs branch from 3899985 to 31a2cd1 Compare January 29, 2025 16:14

bgentry commented Feb 23, 2025

View reviewed changes

src/riverqueue/driver/riversqlalchemy/sql_alchemy_driver.py Outdated Show resolved Hide resolved

sort args before hashing, support partial arg extraction

b583535

brandur reviewed Feb 27, 2025

View reviewed changes

work around sqlc nullable array value type issue

ea8b94b

bgentry commented Feb 27, 2025

View reviewed changes

bgentry force-pushed the bg-newer-unique-jobs branch from 82f4348 to 92aed86 Compare February 27, 2025 02:32

documentation updates, changelog

273a641

bgentry force-pushed the bg-newer-unique-jobs branch from 92aed86 to 273a641 Compare February 27, 2025 02:35

bgentry marked this pull request as ready for review February 27, 2025 02:35

bgentry requested a review from brandur February 27, 2025 02:36

brandur approved these changes Feb 27, 2025

View reviewed changes

remove whitespace from unique key json component

dceb837

bgentry merged commit f27a3bb into master Feb 27, 2025
4 checks passed

bgentry deleted the bg-newer-unique-jobs branch February 27, 2025 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

use new unique jobs implementation #38

use new unique jobs implementation #38

Uh oh!

bgentry commented Jan 12, 2025 •

edited

Loading

Uh oh!

brandur commented Jan 25, 2025

Uh oh!

bgentry commented Jan 29, 2025

Uh oh!

brandur commented Feb 15, 2025

Uh oh!

Uh oh!

brandur left a comment

Uh oh!

brandur Jan 25, 2025

Uh oh!

Uh oh!

bgentry Feb 27, 2025

Uh oh!

brandur Feb 27, 2025 •

edited

Loading

Uh oh!

bgentry Feb 27, 2025

Uh oh!

brandur left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

use new unique jobs implementation #38

use new unique jobs implementation #38

Uh oh!

Conversation

bgentry commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandur commented Jan 25, 2025

Uh oh!

bgentry commented Jan 29, 2025

Uh oh!

brandur commented Feb 15, 2025

Uh oh!

Uh oh!

brandur left a comment

Choose a reason for hiding this comment

Uh oh!

brandur Jan 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bgentry Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

brandur Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bgentry Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

brandur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bgentry commented Jan 12, 2025 •

edited

Loading

brandur Feb 27, 2025 •

edited

Loading