Make SSH integration tests easier to debug #1046

adombeck · 2025-08-28T12:17:38Z

Changes for improved debuggability, test execution time, and maintainability, including:

Lots of improvements to the logs printed by the tests.
Avoid unnecessary rebuilds
Refactorings

UDENG-8137

codecov-commenter · 2025-09-01T14:17:22Z

Codecov Report

❌ Patch coverage is 67.03297% with 30 lines in your changes missing coverage. Please review.
✅ Project coverage is 87.74%. Comparing base (3d2b788) to head (70b6f72).
⚠️ Report is 6 commits behind head on main.

Files with missing lines	Patch %	Lines
internal/testlog/testlog.go	59.70%	27 Missing ⚠️
pam/internal/adapter/nativemodel.go	60.00%	2 Missing ⚠️
log/journal.go	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1046      +/-   ##
==========================================
- Coverage   88.04%   87.74%   -0.30%     
==========================================
  Files          85       87       +2     
  Lines        6039     6112      +73     
  Branches      111      111              
==========================================
+ Hits         5317     5363      +46     
- Misses        666      693      +27     
  Partials       56       56

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

To avoid having to pass it to each method call.

We never use more than one output, so let's simplify things a bit.

It's a bit hidden in there. Let's move it to the caller so we don't wonder in the caller whether the tape is saved as an artifact or not.

When running sshd in daemon mode, the child processes it spawns for handling ssh connections do not print logs to the log file, and their stderr is not connected, so the log messages are lost. By running sshd in the foreground via -D, and making it log to stderr via -e, we get both the main processes and all child processes logs printed to stderr.

The service name is used as the filename of the artifact, where "authd-cli" doesn't make it clear that this is a PAM service file.

... but only print debug1 messages to stderr, to avoid spamming the test output too much. This can be configured via the AUTHD_SSHD_STDERR_DEBUG_LEVEL environment variable.

We were building everything twice, because testSSHAuthenticate is called twice (once with sharedSSHD set to true and once set to false).

We were printing "Building PAM module" when we were building the sshd_preloader library.

Output written directly to stdout/stderr is sometimes assigned to the wrong test, even when using `go test -json`. Using `t.Log` instead should avoid the problem.

This makes our own TestWriter obsolete. Also has the effect that the output written via this is not prefixed with the file and line, which makes it a lot more readable.

And use it when running gcov, because its output is a bit verbose and unhelpful when there is no error.

It's building a shared library.

I've seen this test time out in the CI.

So that we're able to use them from other packages than the pam/integration-tests.

Writing both stdout and stderr to the same file causes a data race. Lets use our SyncBuffer instead.

It failed with that error in CI: ssh_test.go:491: Error Trace: /home/runner/work/authd/authd/pam/integration-tests/ssh_test.go:795 /home/runner/work/authd/authd/pam/integration-tests/ssh_test.go:659 /home/runner/work/authd/authd/pam/integration-tests/ssh_test.go:491 /home/runner/work/authd/authd/internal/testutils/testrun.go:83 Error: Received unexpected error: dial tcp :35523: connect: connection reset by peer Test: TestSSHAuthenticate/Authenticate_user_switching_auth_mode

The tool's spelling is VHS, not vhs, so let's use that.

The function name RunAuthd made it seem like it would block until authd has finished, which is not the case. StartAuthd makes it clear that the function returns once authd was started.

adombeck force-pushed the improve-ssh-tests branch from a9453fd to cda8f9b Compare August 28, 2025 12:17

adombeck mentioned this pull request Aug 29, 2025

Fix cargo always treating nss crate as dirty #1042

Merged

adombeck force-pushed the improve-ssh-tests branch 11 times, most recently from 7762549 to 28f1faa Compare September 1, 2025 13:35

adombeck force-pushed the improve-ssh-tests branch 16 times, most recently from f1d6e86 to a0718d1 Compare September 2, 2025 17:05

adombeck added 26 commits September 9, 2025 09:03

refactor: Use fileutils.CopyFile

cbce469

tests: Avoid VHS suggesting to upload GIF

5b21f43

refactor: Extract field tapeData.OutputDir

d8e83ac

To avoid having to pass it to each method call.

refactor: Replace tapeData.Outputs with tapeData.OutputFilename

92589d8

We never use more than one output, so let's simplify things a bit.

refactor: Move the artifact-saving code out of PrepareTape

72c3d43

It's a bit hidden in there. Let's move it to the caller so we don't wonder in the caller whether the tape is saved as an artifact or not.

tests: Improve log messages

bfaeddd

tests: Make the name of the pam service clearer

42dcedc

The service name is used as the filename of the artifact, where "authd-cli" doesn't make it clear that this is a PAM service file.

tests: Store sshd output with log level debug3 as test artifact

d9708b1

... but only print debug1 messages to stderr, to avoid spamming the test output too much. This can be configured via the AUTHD_SSHD_STDERR_DEBUG_LEVEL environment variable.

ssh tests: Only build test dependencies once when needed

4a917c6

We were building everything twice, because testSSHAuthenticate is called twice (once with sharedSSHD set to true and once set to false).

tests: Fix log message

6b8c7f6

We were printing "Building PAM module" when we were building the sshd_preloader library.

tests: Log output via t.Log

72ddb84

Output written directly to stdout/stderr is sometimes assigned to the wrong test, even when using `go test -json`. Using `t.Log` instead should avoid the problem.

tests: Make use of the new Output method of testing.T

a66ac20

This makes our own TestWriter obsolete. Also has the effect that the output written via this is not prefixed with the file and line, which makes it a lot more readable.

tests: Run gcov with logging

e8c2ea3

tests: Add log messages when locking/unlocking rust build dir

e6fc9de

tests: Support printing stdout/stderr on error in RunWithTiming

cc3fdd8

And use it when running gcov, because its output is a bit verbose and unhelpful when there is no error.

refactor: Extract testlog package

0e9cc85

refactor: Rename buildCModule -> buildSharedLibrary

e861d69

It's building a shared library.

tests: Avoid data race when reading/writing sshd output buffer

42bdd9c

refactor: Rename some variables for better clarity

2261218

tests: Bump time waiting for server to quit

8bf0e05

I've seen this test time out in the CI.

refactor: Move artifacts-related functions to testutils

dd0d442

So that we're able to use them from other packages than the pam/integration-tests.

tests: Avoid data race when writing authd output

21199dd

Writing both stdout and stderr to the same file causes a data race. Lets use our SyncBuffer instead.

Use correct capitalization for VHS

b579fbc

The tool's spelling is VHS, not vhs, so let's use that.

refactor: Rename RunAuthd -> StartAuthd

042e6c2

The function name RunAuthd made it seem like it would block until authd has finished, which is not the case. StartAuthd makes it clear that the function returns once authd was started.

adombeck force-pushed the improve-ssh-tests branch from 17b7462 to 042e6c2 Compare September 9, 2025 07:04

adombeck mentioned this pull request Sep 11, 2025

Test locking and unlocking a user after DB migration #1072

Draft

adombeck changed the title ~~Improve SSH tests~~ Improve SSH integration tests Sep 25, 2025

adombeck changed the title ~~Improve SSH integration tests~~ Make SSH integration tests easier to debug Sep 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make SSH integration tests easier to debug #1046

Make SSH integration tests easier to debug #1046

Uh oh!

adombeck commented Aug 28, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Sep 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Make SSH integration tests easier to debug #1046

Are you sure you want to change the base?

Make SSH integration tests easier to debug #1046

Uh oh!

Conversation

adombeck commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

adombeck commented Aug 28, 2025 •

edited

Loading

codecov-commenter commented Sep 1, 2025 •

edited

Loading