You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Updated adventure works for 1.3.0
* Adapted various notebooks to 1.3.0
* Adapted yet more notebooks to 1.3.0
* Updated the air pollution notebook to 1.3.0
* Updated four more notebooks to 1.3.0
* Removed the propositionalization notebooks
Co-authored-by: Patrick Urbanke <[email protected]>
In particular, we have benchmarked getML's _FastProp_ (short for fast propositionalization) against other implementations of the propositionalization algorithm.
As we can see, _FastProp_ is true to its name: It achieves similar or slightly better performance than _featuretools_ or _tsfresh_, but generates features between 11x to 65x faster than these implementations.
93
-
94
-
If you want to reproduce these results, please refer to the following notebooks:
|[Air pollution][airpollutionnb_prop]|~65x faster than featuretools, ~33x faster than tsfresh | The predictive accuracy can be significantly improved by using RelMT instead of propositionalization approaches, please refer to [this notebook][airpollutionnb]. |
99
-
|[Dodgers][dodgersnb_prop]|~42x faster than featuretools, ~75x faster than tsfresh | The predictive accuracy can be significantly improved by using the mapping preprocessor and/or more advanced feature learning algorithms, please refer to [this notebook][dodgersnb]. |
100
-
|[Interstate94][interstate94nb_prop]|~55x faster than featuretools ||
101
-
|[Occupancy][occupancynb_prop]|~87x faster than featuretools, ~41x faster than tsfresh ||
102
-
|[Robot][robotnb_prop]|~162x faster than featuretools, ~77x faster than tsfresh ||
103
-
104
-
These results are very hardware-dependent and may be different on your machine. However, we have no doubt that you will find that getML's _FastProp_ is significantly faster than _featuretools_ and _tsfresh_ while consuming considerably less memory.
105
-
106
84
### Relational Dataset Repository
107
85
108
86
Some benchmarks are also featured on the [Relational Dataset Repository](https://relational.fit.cvut.cz/):
@@ -139,10 +117,5 @@ Some benchmarks are also featured on the [Relational Dataset Repository](https:/
0 commit comments