Platform-specific model compression for deep neural networks with joint methods