[ENH] test suite for `pytorch-forecasting` forecasters #1780

fkiraly · 2025-02-22T22:35:27Z

This PR adds a systematic test suite and a check_estimator utility for pytorch-forecasting forecasters.

The interface checked is the current unified API across models. This may change in the future, but no changes to the API are made.

Work in progress.

PranavBhatP · 2025-05-13T06:17:46Z

pytorch_forecasting/tests/test_all_estimators.py

+        return all_train_kwargs, train_kwargs_names
+
+
+def _integration(


After looking at the _integration function in this line, I was wondering whether it can be modularized better by splitting it into smaller functions like prepare_data, setup_trainer etc. This might help debug errors flagged by the tests. If required, these smaller functions can be nested under a larger function.

Agreed! Was thinking the same thing.

My suggestion was to sequence this though, since across files there are multiple copy-paste-variations.

So:

refactor with the current _integration into a single file and loop, taking the variable parts

then refactor into modular parts

Not sure if this is the best way - if we get stuck, we may want to try the other way round, since 2 might make 1 easier.

phoeenniixx · 2025-05-19T07:04:39Z

pytorch_forecasting/_registry/_lookup.py

+    result = []
+    ROOT = str(Path(__file__).parent.parent)  # package root directory
+
+    def _coerce_to_str(obj):


Should we add these "coerce" functions to utils._coerce just to make it usable all over the module?

hm, I thought about it and would say "no", because it assumes the presence of get_tag means we have a BaseObject descendant instance.

The utils._coerce makes no specific assumptions on inheritance.

I wonder how we could resolve this, maybe we move the get_tag logic out?

phoeenniixx · 2025-05-19T07:22:37Z

I have few comments:

what will be the difference between class _BaseObject and _BasePtForecaster? I mean can _BasePtForecaster not inherit directly from _SkbaseBaseObject, why do we need _BaseObject ?
The func _integration is mainly for the v1, I think we should write a similar func for v2, that also checks the metadata transfer etc. Basis can be the tests written for TFT in [ENH] EXPERIMENTAL: TFT model based on the new data pipeline #1812. (see test_model_with_datamodule_integration )
I assume the currently fixture-generator and _BasePtForecaster are already compatible with both versions, right? Or am I missing something?
Do we need to add tests for D1-D2 integration as well as suggested by @PranavBhatP (some if it can be seen in [ENH] EXPERIMENTAL PR: D1 and D2 layer for v2 refactor #1811)?
For this to make compatible for v2 with just need to make some changes to data_scenarios and _conftest, other than that, everything is independent of versions right?

I think this PR can be the basis of the tests for v2 that I'll be working on.

phoeenniixx · 2025-05-19T07:27:48Z

One more doubt: Should we add some initialisation tests as well for the models or the integration tests are enough?

fkiraly · 2025-05-20T07:42:50Z

what will be the difference between class _BaseObject and _BasePtForecaster? I mean can _BasePtForecaster not inherit directly from _SkbaseBaseObject, why do we need _BaseObject ?

Good question - I thought we may later also attach metadata to layers, in which case the layers would also inherit from _BaseObject, forming the lowest common base object of everything in pytorch-forecasting. Which, I thought, would be nice if it is in the pcakage, from an architectural perspective ("single point of dependency").

The func _integration is mainly for the v1, I think we should write a similar func for v2, that also checks the metadata transfer etc. Basis can be the tests written for TFT in #1812. (see test_model_with_datamodule_integration )

Agreed, although I think there also should be smaller unit tests. Plus, we need to abstract out variable parts, possibly in more "test parameter getter" methods in the metadata class.

I assume the currently fixture-generator and _BasePtForecaster are already compatible with both versions, right? Or am I missing something?

The fixture generators are generic, and could be used (by inheritance) for any base class defined by the object_type tag. It will retrieve all classes with the specified object_type tag and run them through the tests, in a loop.

Do we need to add tests for D1-D2 integration as well as suggested by @PranavBhatP (some if it can be seen in #1811)?

I think that is a good idea. For v2, I would advise to add a get_datamodule function, so D2 and model can be tested together. For this, it would be good to have at least one D2 instance which is not decoder-encoder, to see whether the loop works properly.

For this to make compatible for v2 with just need to make some changes to data_scenarios and _conftest, other than that, everything is independent of versions right?

More things need to change:

the tests should work with v2, of course, with all changes that entalis
you will also need to take into account potentially varying D2, see above

I think this PR can be the basis of the tests for v2 that I'll be working on.

Yes, I would advise to branch off in a way that does not overwrite the v1 tests - which we will also need.

One way to filter could be using an additional object_type "forecaster_pytorch_v2".

fkiraly · 2025-05-20T07:43:42Z

Should we add some initialisation tests as well for the models or the integration tests are enough?

Yes, I think there should be init tests and unit tests as well. This PR just aimed at refactoring all the current tests - that is perhaps a good start, and we may like to complete it. For v2 though, which are written from scratch, I would strongly advise to add init and unit tests!

pytorch_forecasting/tests/test_all_estimators.py

### Description This PR fixes #1807 and stacks upon PR #1780. Builds upon the PR #1814 (closed due to complex commit history).

fkiraly added 4 commits February 22, 2025 23:18

test suite

b3644a6

Merge branch 'main' into test-suite

a1d64c6

skeleton

4b2486e

skeleton

02b0ce6

fkiraly added the enhancement New feature or request label Feb 22, 2025

fkiraly added 18 commits February 23, 2025 06:42

Update test_all_estimators.py

41cbf66

Update _base_object.py

cef62d3

Update _lookup.py

bc2e93b

Update _lookup.py

eee1c86

base metadatda

164fe0d

registry

20e88d0

fix private name

318c1fb

Update _base_object.py

012ab3d

test failure

86365a0

Update test_all_estimators.py

f6dee46

Update test_all_estimators.py

9b0e4ec

Update test_all_estimators.py

7de5285

test folders

57dfe3a

Update test.yml

c9f12db

test integration

fa8144e

fixes

232a510

Update _conftest.py

1c8d4b5

try scenarios

f632e32

fkiraly mentioned this pull request Apr 20, 2025

[ENH] building a systematic test suite for models #1771

Open

fkiraly added 3 commits May 1, 2025 14:42

Merge branch 'main' into test-suite

ef37f55

Update _lookup.py

a669134

Update _lookup.py

d78bf5d

PranavBhatP reviewed May 13, 2025

View reviewed changes

fkiraly moved this to PR in progress in May - Sep 2025 mentee projects May 15, 2025

fkiraly added this to May - Sep 2025 mentee projects May 15, 2025

fkiraly self-assigned this May 15, 2025

fkiraly mentioned this pull request May 15, 2025

[ENH] Implement D2 layer and BaseModel class for tslib specific models. #1833

Closed

phoeenniixx reviewed May 19, 2025

View reviewed changes

phoeenniixx mentioned this pull request May 19, 2025

[ENH] add tests for TiDEModel with test_tide.py #1814

Closed

4 tasks

This was referenced May 20, 2025

[ENH] Testing Framework for ptf-v2 #1838

Open

[ENH] Test framework for ptf-v2 #1841

Merged

PranavBhatP mentioned this pull request May 22, 2025

[ENH] tests for TiDE Model #1843

Merged

4 tasks

PranavBhatP reviewed May 22, 2025

View reviewed changes

pytorch_forecasting/tests/test_all_estimators.py Outdated Show resolved Hide resolved

PranavBhatP reviewed May 22, 2025

View reviewed changes

pytorch_forecasting/tests/test_all_estimators.py Outdated Show resolved Hide resolved

fkiraly added 3 commits May 27, 2025 22:28

Merge branch 'main' into test-suite

616aba7

move cell_type and n_plotting samples to kwargs

94e8ce8

doctest runner fixed

5d2d001

fkiraly marked this pull request as ready for review May 27, 2025 20:41

fkiraly requested review from benHeid, fnhirwa, jdb78 and yarnabrina as code owners May 27, 2025 20:41

Update test_all_estimators.py

487c981

fkiraly merged commit aa3bc9d into main May 28, 2025
34 of 51 checks passed

github-project-automation bot moved this from PR in progress to Done in May - Sep 2025 mentee projects May 28, 2025

fkiraly pushed a commit that referenced this pull request May 29, 2025

[ENH] tests for TiDE Model. (#1843)

756f61a

### Description This PR fixes #1807 and stacks upon PR #1780. Builds upon the PR #1814 (closed due to complex commit history).

phoeenniixx mentioned this pull request Jun 1, 2025

[ENH] Add TimeXer model to test framework of ptf-v2 #1868

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] test suite for `pytorch-forecasting` forecasters #1780

[ENH] test suite for `pytorch-forecasting` forecasters #1780

Uh oh!

fkiraly commented Feb 22, 2025

Uh oh!

PranavBhatP May 13, 2025

Uh oh!

fkiraly May 15, 2025

Uh oh!

phoeenniixx May 19, 2025

Uh oh!

fkiraly May 28, 2025

Uh oh!

phoeenniixx commented May 19, 2025 •

edited

Loading

Uh oh!

phoeenniixx commented May 19, 2025

Uh oh!

fkiraly commented May 20, 2025

Uh oh!

fkiraly commented May 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		return all_train_kwargs, train_kwargs_names


		def _integration(

[ENH] test suite for pytorch-forecasting forecasters #1780

[ENH] test suite for pytorch-forecasting forecasters #1780

Uh oh!

Conversation

fkiraly commented Feb 22, 2025

Uh oh!

PranavBhatP May 13, 2025

Choose a reason for hiding this comment

Uh oh!

fkiraly May 15, 2025

Choose a reason for hiding this comment

Uh oh!

phoeenniixx May 19, 2025

Choose a reason for hiding this comment

Uh oh!

fkiraly May 28, 2025

Choose a reason for hiding this comment

Uh oh!

phoeenniixx commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phoeenniixx commented May 19, 2025

Uh oh!

fkiraly commented May 20, 2025

Uh oh!

fkiraly commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ENH] test suite for `pytorch-forecasting` forecasters #1780

[ENH] test suite for `pytorch-forecasting` forecasters #1780

phoeenniixx commented May 19, 2025 •

edited

Loading

fkiraly commented May 20, 2025 •

edited

Loading