Accelerating Deep Learning Inference in Constrained Embedded Devices Using Hardware Loops and a Dot Product Unit | IEEE Journals & Magazine | IEEE Xplore