Efficient deep learning inference at the edge