BasisNet: Two-stage Model Synthesis for Efficient Inference

Zhang, Mingda; Chu, Chun-Te; Zhmoginov, Andrey; Howard, Andrew; Jou, Brendan; Zhu, Yukun; Zhang, Li; Hwa, Rebecca; Kovashka, Adriana

Computer Science > Computer Vision and Pattern Recognition

arXiv:2105.03014 (cs)

[Submitted on 7 May 2021]

Title:BasisNet: Two-stage Model Synthesis for Efficient Inference

Authors:Mingda Zhang, Chun-Te Chu, Andrey Zhmoginov, Andrew Howard, Brendan Jou, Yukun Zhu, Li Zhang, Rebecca Hwa, Adriana Kovashka

View PDF

Abstract:In this work, we present BasisNet which combines recent advancements in efficient neural network architectures, conditional computation, and early termination in a simple new form. Our approach incorporates a lightweight model to preview the input and generate input-dependent combination coefficients, which later controls the synthesis of a more accurate specialist model to make final prediction. The two-stage model synthesis strategy can be applied to any network architectures and both stages are jointly trained. We also show that proper training recipes are critical for increasing generalizability for such high capacity neural networks. On ImageNet classification benchmark, our BasisNet with MobileNets as backbone demonstrated clear advantage on accuracy-efficiency trade-off over several strong baselines. Specifically, BasisNet-MobileNetV3 obtained 80.3% top-1 accuracy with only 290M Multiply-Add operations, halving the computational cost of previous state-of-the-art without sacrificing accuracy. With early termination, the average cost can be further reduced to 198M MAdds while maintaining accuracy of 80.0% on ImageNet.

Comments:	To appear, 4th Workshop on Efficient Deep Learning for Computer Vision (ECV2021), CVPR2021 Workshop
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2105.03014 [cs.CV]
	(or arXiv:2105.03014v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2105.03014

Submission history

From: Mingda Zhang [view email]
[v1] Fri, 7 May 2021 00:21:56 UTC (25,549 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:BasisNet: Two-stage Model Synthesis for Efficient Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BasisNet: Two-stage Model Synthesis for Efficient Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators