skip to main content
10.1145/3384419.3431159acmconferencesArticle/Chapter ViewAbstractPublication PagessensysConference Proceedingsconference-collections
research-article

ApproxDet: content and contention-aware approximate object detection for mobiles

Published:16 November 2020Publication History

ABSTRACT

Advanced video analytic systems, including scene classification and object detection, have seen widespread success in various domains such as smart cities and autonomous systems. With an evolution of heterogeneous client devices, there is incentive to move these heavy video analytics workloads from the cloud to mobile devices for low latency and real-time processing and to preserve user privacy. However, most video analytic systems are heavyweight and are trained offline with some pre-defined latency or accuracy requirements. This makes them unable to adapt at runtime in the face of three types of dynamism --- the input video characteristics change, the amount of compute resources available on the node changes due to co-located applications, and the user's latency-accuracy requirements change. In this paper we introduce ApproxDet, an adaptive video object detection framework for mobile devices to meet accuracy-latency requirements in the face of changing content and resource contention scenarios. To achieve this, we introduce a multi-branch object detection kernel, which incorporates a data-driven modeling approach on the performance metrics, and a latency SLA-driven scheduler to pick the best execution branch at runtime. We evaluate ApproxDet on a large benchmark video dataset and compare quantitatively to AdaScale and YOLOv3. We find that ApproxDet is able to adapt to a wide variety of contention and content characteristics and outshines all baselines, e.g., it achieves 52% lower latency and 11.1% higher accuracy over YOLOv3. Our software is open-sourced at https://github.com/purdue-dcsl/ApproxDet.

References

  1. Jason Ansel, Yee Lok Wong, Cy Chan, Marek Olszewski, Alan Edelman, and Saman Amarasinghe. 2011. Language and compiler support for auto-tuning variable-accuracy algorithms. In International Symposium on Code Generation and Optimization (CGO 2011). IEEE, 85--96.Google ScholarGoogle ScholarCross RefCross Ref
  2. Kittipat Apicharttrisorn, Xukan Ran, Jiasi Chen, Srikanth V Krishnamurthy, and Amit K Roy-Chowdhury. 2019. Frugal following: Power thrifty object detection and tracking for mobile augmented reality. In Proceedings of the Conference on Embedded Networked Sensor Systems (SenSys). 96--109.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Saurabh Bagchi, Vaneet Aggarwal, Somali Chaterji, Fred Douglis, Aly El Gamal, Jiawei Han, Brian J Henz, Henry Hoffmann, Suman Jana, Milind Kulkarni, et al. 2020. Vision Paper: Grand Challenges in Resilience: Autonomous System Resilience through Design and Runtime Measures. IEEE Open Journal of the Computer Society 1 (2020), 155--172.Google ScholarGoogle ScholarCross RefCross Ref
  4. Ross Bulat. 2020. React Native: Background Task Management in iOS. https://medium.com/@rossbulat/react-native-background-task-management-in-ios-d0f05ae53cc5Google ScholarGoogle Scholar
  5. Somali Chaterji, Nathan DeLay, John Evans, Nathan Mosier, Bernard Engel, Dennis Buckmaster, and Ranveer Chandra. 2020. Artificial Intelligence for Digital Agriculture at Scale: Techniques, Policies, and Challenges. arXiv preprint arXiv:2001.09786 (2020).Google ScholarGoogle Scholar
  6. Somali Chaterji, Parinaz Naghizadeh, Muhammad Ashraful Alam, Saurabh Bagchi, Mung Chiang, David Corman, Brian Henz, Suman Jana, Na Li, Shaoshuai Mou, et al. 2019. Resilient Cyberphysical Systems and their Application Drivers: A Technology Roadmap. arXiv preprint arXiv:2001.00090 (2019).Google ScholarGoogle Scholar
  7. Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W Sheaffer, Sang-Ha Lee, and Kevin Skadron. 2009. Rodinia: A benchmark suite for heterogeneous computing. In 2009 IEEE International Symposium on Workload Characterization (IISWC). Ieee, 44--54.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Tiffany Yu-Han Chen, Lenin Ravindranath, Shuo Deng, Paramvir Bahl, and Hari Balakrishnan. 2015. Glimpse: Continuous, real-time object recognition on mobile devices. In Proceedings of the ACM Conference on Embedded Networked Sensor Systems (SenSys). 155--168.Google ScholarGoogle Scholar
  9. Ting-Wu Chin, Ruizhou Ding, and Diana Marculescu. 2019. AdaScale: Towards real-time video object detection using adaptive scaling. In Proceedings of the Conference on Machine Learning and Systems (SysML).Google ScholarGoogle Scholar
  10. The Pokemon Company. 2020. PokÃl'mon GO | Augmented Reality Mobile Game. https://pokemongolive.com/en/Google ScholarGoogle Scholar
  11. NVIDIA Corporation. 2018. Jetson TX2 Module. Retrieved May 5, 2020 from https://developer.nvidia.com/embedded/buy/jetson-tx2Google ScholarGoogle Scholar
  12. Jifeng Dai, Yi Li, Kaiming He, and Jian Sun. 2016. R-FCN: Object detection via region-based fully convolutional networks. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS). 379--387.Google ScholarGoogle Scholar
  13. Christina Delimitrou and Christos Kozyrakis. 2013. ibench: Quantifying interference for datacenter applications. In 2013 IEEE International Symposium on Workload Characterization (IISWC). IEEE, 23--33.Google ScholarGoogle ScholarCross RefCross Ref
  14. Android Developer. 2019. Guide to background processing: Android. https://developer.android.com/guide/backgroundGoogle ScholarGoogle Scholar
  15. Apple Developer. 2019. Services provided by an app that require it to run in the background. https://developer.apple.com/documentation/bundleresources/information_property_list/uibackgroundmodesGoogle ScholarGoogle Scholar
  16. Yufei Ding, Jason Ansel, Kalyan Veeramachaneni, Xipeng Shen, Una-May OâĂŹReilly, and Saman Amarasinghe. 2015. Autotuning algorithmic choice for input sensitivity. ACM SIGPLAN Notices 50, 6 (2015), 379--390.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2019. Neural Architecture Search: A Survey. Journal of Machine Learning Research 20, 55 (2019), 1--21.Google ScholarGoogle Scholar
  18. Biyi Fang, Xiao Zeng, and Mi Zhang. 2018. NestDNN: Resource-aware multi-tenant on-device deep learning for continuous mobile vision. In Proceedings of the Annual International Conference on Mobile Computing and Networking (MobiCom). 115--127.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Gunnar Farnebäck. 2003. Two-frame motion estimation based on polynomial expansion. In Proceedings of Scandinavian Conference on Image Analysis. 363--370.Google ScholarGoogle ScholarCross RefCross Ref
  20. Christoph Feichtenhofer, Axel Pinz, and Andrew Zisserman. 2017. Detect to track and track to detect. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 3038--3046.Google ScholarGoogle ScholarCross RefCross Ref
  21. Sadjad Fouladi, Riad S Wahby, Brennan Shacklett, Karthikeyan Balasubramaniam, William Zeng, Rahul Bhalerao, Anirudh Sivaraman, George Porter, and Keith Winstein. 2017. Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads.. In Proceedings of the Symposium on Networked Systems Design and Implementation (NSDI). 363--376.Google ScholarGoogle Scholar
  22. Asish Ghoshal, Ananth Grama, Saurabh Bagchi, and Somali Chaterji. 2015. An ensemble svm model for the accurate prediction of non-canonical microrna targets. In Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics. 403--412.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Brad Glasbergen, Michael Abebe, Khuzaima Daudjee, and Amit Levi. 2020. Sentinel: universal analysis and insight for data systems. Proceedings of the VLDB Endowment 13, 12 (2020), 2720--2733.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Sudipto Guha, Nina Mishra, Gourav Roy, and Okke Schrijvers. 2016. Robust random cut forest based anomaly detection on streams. In Proceedings of the International Conference on Machine Learning (ICML). 2712--2721.Google ScholarGoogle Scholar
  25. Song Han, Huizi Mao, and William J Dally. 2016. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. (2016), 1--13.Google ScholarGoogle Scholar
  26. Seungyeop Han, Haichen Shen, Matthai Philipose, Sharad Agarwal, Alec Wolman, and Arvind Krishnamurthy. 2016. MCDNN: An approximation-based execution framework for deep stream processing under resource constraints. In Proceedings of the Annual International Conference on Mobile Systems, Applications, and Services (MobiSys). 123--136.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. F. Henriques, R. Caseiro, P. Martins, and J. Batista. 2014. High-Speed Tracking with Kernelized Correlation Filters. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 3 (2014), 583--596.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, et al. 2019. Searching for MobileNetV3. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 1314--1324.Google ScholarGoogle ScholarCross RefCross Ref
  29. Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).Google ScholarGoogle Scholar
  30. Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B Gibbons, and Onur Mutlu. 2018. Focus: Querying large video datasets with low latency and low cost. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). 269--286.Google ScholarGoogle Scholar
  31. Hengyuan Hu, Rui Peng, Yu-Wing Tai, and Chi-Keung Tang. 2016. Network trimming: A data-driven neuron pruning approach towards efficient deep architectures. arXiv preprint arXiv:1607.03250 (2016).Google ScholarGoogle Scholar
  32. Gao Huang, Danlu Chen, Tianhong Li, Felix Wu, Laurens van der Maaten, and Kilian Q Weinberger. 2018. Multi-scale dense networks for resource efficient image classification. In Proceedings of International Conference on Learning Representations (ICLR).Google ScholarGoogle Scholar
  33. Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2016. Binarized neural networks. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS). 4107--4115.Google ScholarGoogle Scholar
  34. Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2017. Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations. Journal of Machine Learning Research 18 (2017), 187--1.Google ScholarGoogle Scholar
  35. Forrest N Iandola, Song Han, Matthew W Moskewicz, Khalid Ashraf, William J Dally, and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size. In Proceedings of International Conference on Learning Representations (ICLR). 1--13.Google ScholarGoogle Scholar
  36. Angela H Jiang, Daniel L-K Wong, Christopher Canel, Lilia Tang, Ishan Misra, Michael Kaminsky, Michael A Kozuch, Padmanabhan Pillai, David G Andersen, and Gregory R Ganger. 2018. Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing. In Proceedings of USENIX Annual Technical Conference (USENIX ATC). 29--42.Google ScholarGoogle Scholar
  37. Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: scalable adaptation of video analytics. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM). 253--266.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Zdenek Kalal, Krystian Mikolajczyk, and Jiri Matas. 2010. Forward-Backward error: Automatic detection of tracking failures. In Proceedings of IEEE International Conference on Pattern Recognition (CVPR). 2756--2759.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, et al. 2017. T-CNN: Tubelets with convolutional neural networks for object detection from videos. IEEE Transactions on Circuits and Systems for Video Technology 28, 10 (2017), 2896--2907.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Michael A Laurenzano, Parker Hill, Mehrzad Samadi, Scott Mahlke, Jason Mars, and Lingjia Tang. 2016. Input responsiveness: using canary inputs to dynamically steer approximation. In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI). 161--176.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Chi Li, Shu Wang, Henry Hoffmann, and Shan Lu. 2020. Statically inferring performance properties of software configurations. In Proceedings of the Fifteenth European Conference on Computer Systems. 1--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, and Hans Peter Graf. 2017. Pruning filters for efficient ConvNets. In Proceedings of International Conference on Learning Representations (ICLR). 1--13.Google ScholarGoogle Scholar
  43. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2980--2988.Google ScholarGoogle ScholarCross RefCross Ref
  44. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision (ECCV). 740--755.Google ScholarGoogle ScholarCross RefCross Ref
  45. Luyang Liu, Hongyu Li, and Marco Gruteser. 2019. Edge assisted real-time object detection for mobile augmented reality. (2019), 1--16.Google ScholarGoogle Scholar
  46. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. SSD: Single shot multibox detector. In Proceedings of the European conference on Computer Vision (ECCV), Vol. 9907. 21--37.Google ScholarGoogle ScholarCross RefCross Ref
  47. Alan Lukežič, Tom'aš Voj'iř, Luka Čehovin Zajc, Jiř'i Matas, and Matej Kristan. 2018. Discriminative Correlation Filter Tracker with Channel and Spatial Reliability. International Journal of Computer Vision 126 (2018), 671--688.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Jian-Hao Luo, Hao Zhang, Hong-Yu Zhou, Chen-Wei Xie, Jianxin Wu, and Weiyao Lin. 2018. ThiNet: pruning CNN filters for a thinner net. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 10 (2018), 2525--2538.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Ashraf Mahgoub, Alexander Michaelson Medoff, Rakesh Kumar, Subrata Mitra, Ana Klimovic, Somali Chaterji, and Saurabh Bagchi. 2020. {OPTIMUSCLOUD}: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 {USENIX} Annual Technical Conference ({USENIX}{ATC} 20). 189--203.Google ScholarGoogle Scholar
  50. Ashraf Mahgoub, Paul Wood, Sachandhan Ganesh, Subrata Mitra, Wolfgang Gerlach, Travis Harrison, Folker Meyer, Ananth Grama, Saurabh Bagchi, and Somali Chaterji. 2017. Rafiki: a middleware for parameter tuning of nosql datastores for dynamic metagenomics workloads. In Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference. 28--40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Ashraf Mahgoub, Paul Wood, Alexander Medoff, Subrata Mitra, Folker Meyer, Somali Chaterji, and Saurabh Bagchi. 2019. {SOPHIA}: Online reconfiguration of clustered nosql databases for time-varying workloads. In 2019 {USENIX} Annual Technical Conference ({USENIX}{ATC} 19). 223--240.Google ScholarGoogle Scholar
  52. Amiya K Maji, Subrata Mitra, Bowen Zhou, Saurabh Bagchi, and Akshat Verma. 2014. Mitigating interference in cloud services by middleware reconfiguration. In Proceedings of the 15th International Middleware Conference. 277--288.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Jason Mars, Lingjia Tang, Robert Hundt, Kevin Skadron, and Mary Lou Soffa. 2011. Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture. 248--259.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. John D. McCalpin. 1991--2007. STREAM: Sustainable Memory Bandwidth in High Performance Computers. Technical Report. University of Virginia, Charlottesville, Virginia. http://www.cs.virginia.edu/stream/Google ScholarGoogle Scholar
  55. John D. McCalpin. 1995. Memory Bandwidth and Machine Balance in Current High Performance Computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter (Dec. 1995), 19--25.Google ScholarGoogle Scholar
  56. Chulhong Min, Alessandro Montanari, Akhil Mathur, and Fahim Kawsar. 2019. A closer look at quality-aware runtime assessment of sensing models in multi-device environments. In Proceedings of the 17th Conference on Embedded Networked Sensor Systems. 271--284.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Subrata Mitra, Manish K Gupta, Sasa Misailovic, and Saurabh Bagchi. 2017. Phase-aware optimization in approximate computing. In 2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO). IEEE, 185--196.Google ScholarGoogle ScholarCross RefCross Ref
  58. Rajesh Krishna Panta, Saurabh Bagchi, and Samuel P Midkiff. 2011. Efficient incremental code update for sensor networks. ACM Transactions on Sensor Networks (TOSN) 7, 4 (2011), 1--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, and Ali Farhadi. 2016. XNOR-Net: ImageNet classification using binary convolutional neural networks. In Proceedings of the European Conference on Computer Vision (ECCV), Vol. 9908. 525--542.Google ScholarGoogle ScholarCross RefCross Ref
  60. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788.Google ScholarGoogle ScholarCross RefCross Ref
  61. Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018).Google ScholarGoogle Scholar
  62. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS). 91--99.Google ScholarGoogle Scholar
  63. Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 3 (2015), 211--252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. 2018. MobileNetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4510--4520.Google ScholarGoogle ScholarCross RefCross Ref
  65. Xiaoyong Shen, Aaron Hertzmann, Jiaya Jia, Sylvain Paris, Brian Price, Eli Shechtman, and Ian Sachs. 2016. Automatic portrait segmentation for image stylization. In Computer Graphics Forum, Vol. 35. Wiley Online Library, 93--102.Google ScholarGoogle Scholar
  66. Hui Shuai, Qingshan Liu, Kaihua Zhang, Jing Yang, and Jiankang Deng. 2018. Cascaded Regional Spatio-Temporal Feature-Routing Networks for Video Object Detection. IEEE Access 6 (2018), 3096--3106.Google ScholarGoogle ScholarCross RefCross Ref
  67. Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, Mark Sandler, Andrew Howard, and Quoc V Le. 2019. MNasNet: Platform-aware neural architecture search for mobile. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2820--2828.Google ScholarGoogle ScholarCross RefCross Ref
  68. Mingxing Tan and Quoc V Le. 2019. EfficientNet: Rethinking model scaling for convolutional neural networks. In Proceedings of the International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  69. Mingxing Tan, Ruoming Pang, and Quoc V Le. 2020. EfficientDet: Scalable and efficient object detection. (2020), 10781--10790.Google ScholarGoogle Scholar
  70. Matthew Tancreti, Mohammad Sajjad Hossain, Saurabh Bagchi, and Vijay Raghunathan. 2011. Aveksha: A hardware-software approach for non-intrusive tracing and profiling of wireless embedded systems. In Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems. 288--301.Google ScholarGoogle ScholarDigital LibraryDigital Library
  71. Surat Teerapittayanon, Bradley McDanel, and HT Kung. 2016. BranchyNet: Fast inference via early exiting from deep neural networks. In Proceedings of IEEE International Conference on Pattern Recognition (ICPR). 2464--2469.Google ScholarGoogle ScholarCross RefCross Ref
  72. Andreas Veit and Serge Belongie. 2019. Convolutional networks with adaptive inference graphs. International Journal of Computer Vision 128 (2019), 730--741.Google ScholarGoogle ScholarCross RefCross Ref
  73. Robert J Wang, Xiang Li, and Charles X Ling. 2018. PELEE: A real-time object detection system on mobile devices. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS). 1963--1972.Google ScholarGoogle Scholar
  74. Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, and Kurt Keutzer. 2019. FBNet: Hardware-aware efficient convnet design via differentiable neural architecture search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 10734--10742.Google ScholarGoogle ScholarCross RefCross Ref
  75. Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S Davis, Kristen Grauman, and Rogerio Feris. 2018. BlockDrop: Dynamic inference paths in residual networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 8817--8826.Google ScholarGoogle ScholarCross RefCross Ref
  76. Ran Xu, Jinkyu Koo, Rakesh Kumar, Peter Bai, Subrata Mitra, Sasa Misailovic, and Saurabh Bagchi. 2018. VideoChef: efficient approximation for streaming video processing pipelines. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC). 43--56.Google ScholarGoogle Scholar
  77. Ran Xu, Rakesh Kumar, Pengcheng Wang, Peter Bai, Ganga Meghanath, Somali Chaterji, Subrata Mitra, and Saurabh Bagchi. 2020. ApproxNet: Content and Contention-Aware Video Analytics System for Embedded Clients. arXiv:1909.02068 [cs.CV]Google ScholarGoogle Scholar
  78. Ran Xu, Subrata Mitra, Jason Rahman, Peter Bai, Bowen Zhou, Greg Bronevetsky, and Saurabh Bagchi. 2018. Pythia: Improving datacenter utilization via precise contention prediction for multiple co-located workloads. In Proceedings of the International Middleware Conference (Middleware). 146--160.Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Le Yang, Yizeng Han, Xi Chen, Shiji Song, Jifeng Dai, and Gao Huang. 2020. Resolution Adaptive Networks for Efficient Inference. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2369--2378.Google ScholarGoogle ScholarCross RefCross Ref
  80. Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J Freedman. 2017. Live Video Analytics at Scale with Approximation and Delay-Tolerance.. In Proceedings of the Symposium on Networked Systems Design and Implementation (NSDI), Vol. 9. 377--392.Google ScholarGoogle Scholar
  81. Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, and Jian Sun. 2018. ShuffleNet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6848--6856.Google ScholarGoogle ScholarCross RefCross Ref
  82. Zhong-Qiu Zhao, Peng Zheng, Shou-tao Xu, and Xindong Wu. 2019. Object detection with deep learning: A review. IEEE Transactions on Neural Networks and Learning Systems 30, 11 (2019), 3212--3232.Google ScholarGoogle ScholarCross RefCross Ref
  83. Xizhou Zhu, Jifeng Dai, Lu Yuan, and Yichen Wei. 2018. Towards high performance video object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 7210--7218.Google ScholarGoogle ScholarCross RefCross Ref
  84. Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, and Yichen Wei. 2017. Flow-guided feature aggregation for video object detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 408--417.Google ScholarGoogle ScholarCross RefCross Ref
  85. Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, and Yichen Wei. 2017. Deep feature flow for video recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2349--2358.Google ScholarGoogle ScholarCross RefCross Ref
  86. Shlomo Zilberstein. 1996. Using anytime algorithms in intelligent systems. AI magazine 17, 3 (1996), 73--73.Google ScholarGoogle Scholar

Index Terms

  1. ApproxDet: content and contention-aware approximate object detection for mobiles

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SenSys '20: Proceedings of the 18th Conference on Embedded Networked Sensor Systems
            November 2020
            852 pages
            ISBN:9781450375900
            DOI:10.1145/3384419

            Copyright © 2020 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 16 November 2020

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate174of867submissions,20%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader