You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In particular, we have benchmarked getML's _FastProp_ (short for fast propositionalization) against other implementations of the propositionalization algorithm.
As we can see, _FastProp_ is true to its name: It achieves similar or slightly better performance than _featuretools_ or _tsfresh_, but generates features between 11x to 65x faster than these implementations.
93
+
94
+
If you want to reproduce these results, please refer to the following notebooks:
|[Air pollution][airpollutionnb_prop]|~65x faster than featuretools, ~33x faster than tsfresh | The predictive accuracy can be significantly improved by using RelMT instead of propositionalization approaches, please refer to [this notebook][airpollutionnb]. |
99
+
|[Dodgers][dodgersnb_prop]|~42x faster than featuretools, ~75x faster than tsfresh | The predictive accuracy can be significantly improved by using the mapping preprocessor and/or more advanced feature learning algorithms, please refer to [this notebook][dodgersnb]. |
100
+
|[Interstate94][interstate94nb_prop]|~55x faster than featuretools ||
101
+
|[Occupancy][occupancynb_prop]|~87x faster than featuretools, ~41x faster than tsfresh ||
102
+
|[Robot][robotnb_prop]|~162x faster than featuretools, ~77x faster than tsfresh ||
103
+
104
+
These results are very hardware-dependent and may be different on your machine. However, we have no doubt that you will find that getML's _FastProp_ is significantly faster than _featuretools_ and _tsfresh_ while consuming considerably less memory.
105
+
84
106
### Relational Dataset Repository
85
107
86
108
Some benchmarks are also featured on the [Relational Dataset Repository](https://relational.fit.cvut.cz/):
@@ -117,5 +139,10 @@ Some benchmarks are also featured on the [Relational Dataset Repository](https:/
0 commit comments