Hello all, I have done quite a bit of research, and read the paper numerous times, however I haven’t been able to get a clear understanding that can provide an answer to my below questions. Thank you in advance for any help and clarity you can offer!

I have a few questions, as follows:

[Question 1] I understand the LMU to function similar to a standard RNN, as it relates to time series data. Does a sliding window work with the LMU for event based data?

[Question 2] Does unbalanced data have an effect on the LMU?

[2a] Is small batch training something compatible with the LMU?

[Question 3] Should the “size” of the LMU be determined by the total amount of training data, or, rather, the size of each subset of data (i.e., each individual timeseries).

Thank you very much in advance, and if you have any questions or need clarification, please let me know.