EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization
Private convolutional neural network (CNN) inference based on secure two-party
computation (2PC) suffers from high communication and latency overhead, especially from …
computation (2PC) suffers from high communication and latency overhead, especially from …