EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization

W Zeng, T Xu, M Li, R Wang - arXiv preprint arXiv:2404.09404, 2024 - arxiv.org
Private convolutional neural network (CNN) inference based on secure two-party
computation (2PC) suffers from high communication and latency overhead, especially from …