KAFKA-19624: Improving consistency of command-line arguments for consumer performance tests #20385

aheev · 2025-08-20T14:40:54Z

resolves https://issues.apache.org/jira/browse/KAFKA-19624

Reviewers: @brandboat, @AndrewJSchofield, @m1a2st

…umer performance tests

aheev · 2025-08-20T14:41:28Z

@AndrewJSchofield can you please review this?

brandboat · 2025-08-25T02:11:38Z

ping @aheev, could you please fix the conflicts?

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java # tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java

aheev · 2025-08-25T03:07:42Z

ping @aheev, could you please fix the conflicts?

done

brandboat

Thanks for the patch, left some comments below.

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java

aheev · 2025-08-25T07:16:09Z

FAILED ❌ RestoreIntegrationTest > "shouldInvokeUserDefinedGlobalStateRestoreListener(boolean).useNewProtocol=false"

This test seems to be flaky. It succeeds on my machine and fails sometimes. Also the changes are not even related to the test

Just tested. Same issue on trunk too

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

m1a2st

Thanks for this patch, some comments left

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

AndrewJSchofield

Thanks for the PR. Please add some tests for the variation combinations of new/old options to make sure the rules in the KIP are correctly implemented.

tests/kafkatest/services/performance/consumer_performance.py

tests/kafkatest/services/performance/share_consumer_performance.py

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java

…nd `messages` traces from the tools

brandboat

Thanks for the update, some minor comments.

tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java

tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java

tools/src/test/java/org/apache/kafka/tools/ShareConsumerPerformanceTest.java

brandboat

LGTM, thanks!

tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java

AndrewJSchofield

Apart from adding --command-property to both kafka-consumer-perf-test.sh and kafka-share-consumer-perf-test.sh, this looks good to me.

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

aheev · 2025-09-02T13:01:08Z

Overall, LGTM. Could you run the e2e test and share the results?

What do you mean by e2e test?

m1a2st · 2025-09-02T13:14:19Z

Since you have modified the e2e test file consumer_performance.py, you can validate it by running the following command to ensure the test still works:
TC_PATHS="tests/kafkatest/benchmarks/core/benchmark_test.py" bash tests/docker/run_tests.sh

FYI: https://github.com/apache/kafka/blob/trunk/tests/README.md

aheev · 2025-09-02T13:21:57Z

Since you have modified the e2e test file consumer_performance.py, you can validate it by running the following command to ensure the test still works: TC_PATHS="tests/kafkatest/benchmarks/core/benchmark_test.py" bash tests/docker/run_tests.sh

FYI: https://github.com/apache/kafka/blob/trunk/tests/README.md

I will run them after andrew's comments are resolved

AndrewJSchofield · 2025-09-03T10:05:39Z

@aheev Please resolve conflicts

AndrewJSchofield

Thanks for the PR. Once we have a green build, I'm ready to merge this.

aheev · 2025-09-03T12:12:25Z

Thanks for the PR. Once we have a green build, I'm ready to merge this.

Shouldn't we wait for this? I am trying to run the tests, but running into some issues

Since you have modified the e2e test file consumer_performance.py, you can validate it by running the following command to ensure the test still works:
TC_PATHS="tests/kafkatest/benchmarks/core/benchmark_test.py" bash tests/docker/run_tests.sh

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

AndrewJSchofield · 2025-09-03T14:41:01Z

Thanks for the PR. Once we have a green build, I'm ready to merge this.

Shouldn't we wait for this? I am trying to run the tests, but running into some issues

Since you have modified the e2e test file consumer_performance.py, you can validate it by running the following command to ensure the test still works:
TC_PATHS="tests/kafkatest/benchmarks/core/benchmark_test.py" bash tests/docker/run_tests.sh

Yes, we should. I'll wait then.

aheev · 2025-09-04T16:57:31Z

After a lot of struggle I managed to run the test by using a different JDK(will file a ticket for this later). Here are the results @m1a2st . Three tests failed, all of which are related to producer-perf
benchmark_test.log

m1a2st · 2025-09-05T01:00:57Z

Here are the results @m1a2st . Three tests failed, all of which are related to producer-perf

Thanks, Could you file a ticket to trace this issue?

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java

aheev · 2025-09-05T06:53:22Z

Here are the results @m1a2st . Three tests failed, all of which are related to producer-perf

Thanks, Could you file a ticket to trace this issue?

https://issues.apache.org/jira/browse/KAFKA-19672

aheev · 2025-09-05T08:37:21Z

This test seems to be flaky and not related to the changes. It succeeded in a couple of runs in my local. @AndrewJSchofield should we trigger a CI re-run?

FAILED ❌ SmokeTestDriverIntegrationTest > "shouldWorkWithRebalance(boolean, boolean, boolean).stateUpdaterEnabled=false, processingThreadsEnabled=false, streamsProtocolEnabled=true"
FAILED ❌ SmokeTestDriverIntegrationTest > "shouldWorkWithRebalance(boolean, boolean, boolean).stateUpdaterEnabled=true, processingThreadsEnabled=false, streamsProtocolEnabled=true"
FAILED ❌ SmokeTestDriverIntegrationTest > "shouldWorkWithRebalance(boolean, boolean, boolean).stateUpdaterEnabled=true, processingThreadsEnabled=true, streamsProtocolEnabled=true"
Found 2 flaky test failures:
FLAKY ⚠️  SmokeTestDriverIntegrationTest > "shouldWorkWithRebalance(boolean, boolean, boolean).stateUpdaterEnabled=true, processingThreadsEnabled=true, streamsProtocolEnabled=false"
FLAKY ⚠️  SmokeTestDriverIntegrationTest > "shouldWorkWithRebalance(boolean, boolean, boolean).stateUpdaterEnabled=true, processingThreadsEnabled=false, streamsProtocolEnabled=false"

AndrewJSchofield · 2025-09-05T15:48:50Z

@aheev I would merge latest changes into this branch. I find the test fails on your branch, and passes on trunk. I can't see that this PR would cause the tests to fail, so I'd update the branch and let the CI run again.

tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java

tests/kafkatest/services/performance/consumer_performance.py

tests/kafkatest/services/performance/share_consumer_performance.py

…losed

aheev · 2025-09-07T16:04:36Z

@Yunyung ConsumerPerformanceService and ShareConsumerPerformanceService are used in benchmark_test.py(which uses DEV_BRANCH and test_performance_services.py(which uses LATEST_2_1 and DEV_BRANCH)

ShareConsumerPerformanceService runs only in test_performance_services.py runs on branches >= 4.1. Hence, we can ignore this
As for ConsumerPerformanceService, I have added changes, but, while running the test, ProducerPerformanceService, which is used to produce data in turn consumed by ConsumerPerformanceService, fails, thereby blocking the ConsumerPerformanceService to consume on LATEST_2_1. It fails on trunk too

[INFO:2025-09-07 09:01:58,715]: RunnerClient: kafkatest.sanity_checks.test_performance_services.PerformanceServiceTest.test_version.version=2.1.1.metadata_quorum=ISOLATED_KRAFT: FAIL: Exception('ProducerPerformanceService-0-125869363841104-worker-1: Traceback (most recent call last):\n  File "/usr/local/lib/python3.10/dist-packages/ducktape/services/background_thread.py", line 36, in _protected_worker\n    self._worker(idx, node)\n  File "/opt/kafka-dev/tests/kafkatest/services/performance/producer_performance.py", line 130, in _worker\n    wait_until(lambda: self.alive(node), timeout_sec=20, err_msg="ProducerPerformance failed to start")\n  File "/usr/local/lib/python3.10/dist-packages/ducktape/utils/util.py", line 58, in wait_until\n    raise TimeoutError(err_msg() if callable(err_msg) else err_msg) from last_exception\nducktape.errors.TimeoutError: ProducerPerformance failed to start\n')
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 351, in _do_run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 411, in run_test
    return self.test_context.function(self.test)
  File "/usr/local/lib/python3.10/dist-packages/ducktape/mark/_mark.py", line 438, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/opt/kafka-dev/tests/kafkatest/sanity_checks/test_performance_services.py", line 58, in test_version
    self.producer_perf.run()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/services/service.py", line 345, in run
    self.wait()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/services/background_thread.py", line 72, in wait
    self._propagate_exceptions()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/services/background_thread.py", line 103, in _propagate_exceptions
    raise Exception(self.errors)
Exception: ProducerPerformanceService-0-125869363841104-worker-1: Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/services/background_thread.py", line 36, in _protected_worker
    self._worker(idx, node)
  File "/opt/kafka-dev/tests/kafkatest/services/performance/producer_performance.py", line 130, in _worker
    wait_until(lambda: self.alive(node), timeout_sec=20, err_msg="ProducerPerformance failed to start")
  File "/usr/local/lib/python3.10/dist-packages/ducktape/utils/util.py", line 58, in wait_until
    raise TimeoutError(err_msg() if callable(err_msg) else err_msg) from last_exception
ducktape.errors.TimeoutError: ProducerPerformance failed to start

I think we need to fix our test suite first
@AndrewJSchofield what is the way forward here?

KAFKA-19487: Improving consistency of command-line arguments for cons…

931dfaf

…umer performance tests

github-actions bot added triage PRs from the community tools labels Aug 20, 2025

aheev changed the title ~~KAFKA-19487: Improving consistency of command-line arguments for consumer performance tests~~ KAFKA-19624: Improving consistency of command-line arguments for consumer performance tests Aug 20, 2025

AndrewJSchofield added ci-approved and removed triage PRs from the community labels Aug 22, 2025

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

17460c7

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java # tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java

brandboat reviewed Aug 25, 2025

View reviewed changes

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Show resolved Hide resolved

tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java Show resolved Hide resolved

Add deprecation cycle for num-records

34651a9

aheev requested a review from brandboat August 25, 2025 06:27

brandboat reviewed Aug 25, 2025

View reviewed changes

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Outdated Show resolved Hide resolved

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Outdated Show resolved Hide resolved

m1a2st reviewed Aug 25, 2025

View reviewed changes

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Show resolved Hide resolved

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Outdated Show resolved Hide resolved

AndrewJSchofield self-requested a review August 25, 2025 14:42

AndrewJSchofield requested changes Aug 25, 2025

View reviewed changes

aheev added 4 commits August 26, 2025 16:32

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

529eb36

Add deprecation signatures to old options; Remove consumer config a…

d82ce4f

…nd `messages` traces from the tools

Add checks for deprecated options

c4cd51e

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

b8b2dc4

aheev requested review from brandboat, m1a2st and AndrewJSchofield August 26, 2025 20:01

brandboat reviewed Aug 27, 2025

View reviewed changes

aheev added 2 commits August 27, 2025 14:55

Rename tests to old config names

5c8fd2b

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

d252457

brandboat approved these changes Aug 27, 2025

View reviewed changes

tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java Outdated Show resolved Hide resolved

Rename consumer config files to unique prefix-suffixes

9d9a8c6

aheev added 2 commits August 28, 2025 21:16

add required check on bootstrap server opt

7020430

add required check on bootstrap server opt tests

a99f389

AndrewJSchofield requested changes Aug 28, 2025

View reviewed changes

aheev added 2 commits September 1, 2025 16:47

Add command-property

a6b1d7d

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

0f06d95

AndrewJSchofield requested changes Sep 1, 2025

View reviewed changes

tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java Outdated Show resolved Hide resolved

aheev added 2 commits September 2, 2025 21:52

fix command-property description

4ff0992

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

a36a868

aheev requested a review from AndrewJSchofield September 2, 2025 16:48

AndrewJSchofield approved these changes Sep 3, 2025

View reviewed changes

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

5af4ee4

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ConsumerPerformance.java

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

fc98942

# Conflicts: # tools/src/main/java/org/apache/kafka/tools/ShareConsumerPerformance.java

Yunyung reviewed Sep 5, 2025

View reviewed changes

tools/src/test/java/org/apache/kafka/tools/ConsumerPerformanceTest.java Show resolved Hide resolved

Yunyung reviewed Sep 5, 2025

View reviewed changes

tests/kafkatest/services/performance/consumer_performance.py Outdated Show resolved Hide resolved

tests/kafkatest/services/performance/share_consumer_performance.py Show resolved Hide resolved

aheev added 4 commits September 6, 2025 01:03

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

116f7cd

rename messages to num-records in testMetricsRetrievedBeforeConsumerC…

d9f8854

…losed

Merge branch 'trunk' of https://github.com/apache/kafka into KAFKA-19487

037a177

Add support for running consumer_performance.py on older kafka brokers

f624975

KAFKA-19624: Improving consistency of command-line arguments for consumer performance tests #20385

Are you sure you want to change the base?

KAFKA-19624: Improving consistency of command-line arguments for consumer performance tests #20385

Uh oh!

Conversation

aheev commented Aug 20, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aheev commented Aug 20, 2025

Uh oh!

brandboat commented Aug 25, 2025

Uh oh!

aheev commented Aug 25, 2025

Uh oh!

brandboat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aheev commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!

m1a2st left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AndrewJSchofield left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brandboat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brandboat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AndrewJSchofield left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aheev commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

m1a2st commented Sep 2, 2025

Uh oh!

aheev commented Sep 2, 2025

Uh oh!

AndrewJSchofield commented Sep 3, 2025

Uh oh!

AndrewJSchofield left a comment

Choose a reason for hiding this comment

Uh oh!

aheev commented Sep 3, 2025

Uh oh!

AndrewJSchofield commented Sep 3, 2025

Uh oh!

aheev commented Sep 4, 2025

Uh oh!

m1a2st commented Sep 5, 2025

Uh oh!

aheev commented Sep 5, 2025

Uh oh!

aheev commented Aug 20, 2025 •

edited by github-actions bot

Loading

aheev commented Sep 2, 2025 •

edited

Loading

aheev commented Sep 7, 2025 •

edited

Loading