Conversation
|
Might be a good idea to review/merge this PR together with #1172 (@bwintermann). |
|
I indeed ran into the same issue and opened the PR mentioned by @fpjentzsch because of it. Did you measure how much faster this is than the previous approach? I would be quite happy if we didn't have to load some C code to get fast execution speed. |
|
I tried this on a MNv1 with FLOAT32 output and got this error because you expect a 5-dimensional output shape:
After adjusting the code from 5 to 3 dimensions, it worked for this case and sped up the output data unpacking by ~3000x. |
Faster data unpacking (Diffferent to #1291)
Faster implementation of finn.util.data_packing.packed_bytearray_to_finnpy. It utilizes bitshifts and is necessary for realtime processing on pynq.