1 min readMar 8, 2020
Hi! This experiment used the original partitions (only preprocessing described) and only 20 epochs for the early stopping (very short). However, the objective is just that, just a demonstration, but you can use more, of course. For the final experiment, in the Puigcerver’s paper, he uses 80 epochs instead.