References
1.Akerele, J. I., Uzoka, A., Ojukwu, P. U. & Olamijuwon, O. J. Data management solutions for real-time analytics in retail cloud environments. Eng. Sci. Technol. Int. J. 5 (11), 3180–3192 (2024).
2.Bay, H., Ess, A., Tuytelaars, T. & Van Gool, L. Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110 (3), 346–359 (2008).
3.Bolles, R. C. & Fischler, M. A. A RANSAC-based approach to model fitting and its application to finding cylinders in range data. IJCAI 1981, 637–643 (1981).
4.Campello, R. J., Moulavi, D. & Sander, J. Density-based clustering based on hierarchical density estimates. Pacific-Asia conference on knowledge discovery and data mining,160–172 (2013).
5.Carion, N. et al. End-to-end object detection with transformers. Proc of European Conference on Computer Vision (ECCV), 213–229 (2020).
6.Corsten, D. & Gruen, T. Desperately seeking shelf availability: an examination of the extent, the causes, and the efforts to address retail out-of‐stocks. Int. J. Retail Distrib. Manag. 31 (12), 605–617 (2003).
7.Deng, J. et al. Imagenet: A large-scale hierarchical image database. Proc IEEE CVPR 2009, 248–255 (2009).
8.Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner,T., … Houlsby, N. (2020) An image is worth 16x16 words: Transformers for image recognition at scale. arXiv:2010.11929.
9.Ghosh, R. Product identification in retail stores by combining faster r-cnn and recurrent neural network. Multimed Tools Appl. 83 (3), 7135–7158 (2024).
10.He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Proc IEEE CVPR 2016, 770–778 (2016).
11.Hu, B., Zhou, N., Zhou, Q., Wang, X. & Liu, W. DiffNet: a learning to compare deep network for product recognition. IEEE Access. 8, 19336–19344 (2020).
12.Jocher, G., Chaurasia, A. & Qiu, J. Ultralytics YOLOv8, version 8.0. 0 (Ballenger Creek, MD, 2023). USA. https://github.com/ultralytics/ultralytics
13.Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks 25 (Advances in NIPS, 2012).
14.Laitala, J. & Ruotsalainen, L. Computer vision based planogram compliance evaluation. Appl. Sci. 13 (18), 10145 (2023).
15.Lin, T. Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. Proc IEEE ICCV 2017, 2980–2988 (2017).
16.Lin, T. Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., … Zitnick,C. L. (2014). Microsoft coco: Common objects in context. Proc Computer vision–ECCV 2014, 740–755.
17.Liu, S., Qi, L., Qin, H., Shi, J. & Jia, J. Path aggregation network for instance segmentation. Proc IEEE CVPR 2018, 8759–8768 (2018).
18.Lowe, D. G. Object recognition from local scale-invariant features. Proc IEEE ICCV 1999(2), 1150–1157 (1999).
A
19.Mohammed, A. M. M. A visually based approach to optimizing retail facility designs and shelve layouts. Facilities 42 (1/2), 83–104 (2024).
20.Needleman, S. B. & Wunsch, C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48 (3), 443–453 (1970).
21.NVIDIA. Develop and Deploy the Next Era of Physical AI Applications and Services. url: (2025). https://www.nvidia.com/en-us/omniverse/
22.Pearson, K. LIII. On lines and planes of closest fit to systems of points in space. Lond. Edinb. Dubl Phil Mag. 2 (11), 559–572 (1901).
23.Pietrini, R., Paolanti, M., Mancini, A., Frontoni, E. & Zingaretti, P. Shelf Management: A deep learning-based system for shelf visual monitoring. Expert Syst. Appl. 255, 124635 (2024).
24.Raghu, M., Zhang, C., Kleinberg, J. & Bengio, S. Transfusion: Understanding transfer learning for medical imaging. Adv. Neural Inf. Process. Syst., 32. (2019).
25.Ray, A., Kumar, N., Shaw, A. & Mukherjee, D. P. U-pc: Unsupervised planogram compliance. Proc of European Conference on Computer Vision (ECCV), 586–600 (2018).
26.Redmon, J. & Farhadi, A. Yolov3: An incremental improvement. arXiv:1804.02767 (2018).
27.Rezatofighi, H. et al. Generalized intersection over union: A metric and a loss for bounding box regression. Proc IEEE CVPR 2019, 658–666 (2019).
28.Saqlain, M., Rubab, S., Khan, M. M., Ali, N. & Ali, S. Hybrid Approach for Shelf Monitoring and Planogram Compliance (Hyb-SMPC) in Retails Using Deep Learning and Computer Vision. Math. Probl. Eng., 2022(1), 4916818. (2022).
29.Santra, B. & Mukherjee, D. P. A comprehensive survey on computer vision based approaches for automatic identification of products in retail store. Image Vis. Comput. 86, 45–63 (2019).
30.Saran, A., Hassan, E. & Maurya, A. K. Robust visual analysis for planogram compliance problem. IEEE IAPR MVA 2015, 576–579 (2015).
A
31.Sigurdsson, V., Saevarsson, H. & Foxall, G. Brand placement and consumer choice: an in-store experiment. Appl. Behav. Anal. 42 (3), 741–745 (2009).
32.Tan, M. & Le, Q. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning. PMLR, 6105–6114. (2019).
33.Thakoor, K. A., Marat, S., Nasiatka, P. J., McIntosh, B. P., Sahin, F. E., Tanguay,A. R., … Itti, L. (2013). Attention biased speeded up robust features (ab-surf): A neurally-inspired object recognition algorithm for a wearable aid for the visually-impaired.IEEE ICMEW 2013, 1–6.
34.Tonioni, A. & Di Stefano, L. Product recognition in store shelves as a sub-graph isomorphism problem. Proc ICIAP 2017, 682–693 (2017).
35.Varghese, R. & Sambath, M. Yolov8: A novel object detection algorithm with enhanced performance and robustness. Proc IEEE ADICS 2024, 1–6 (2024).
36.Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., … Polosukhin,I. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30.
37.Wang, C. Y., Bochkovskiy, A. & Liao, H. Y. M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proc IEEE CVPR 2023, 7464–7475 (2023).
38.Wang, C. Y. et al. CSPNet: A new backbone that can enhance learning capability of CNN. Proc IEEE CVPR 2023, 7464–7475, 390–391 (2020).
39.Wang, Y., Yao, Q., Kwok, J. T. & Ni, L. M. Generalizing from a few examples: A survey on few-shot learning. ACM Comput. Surveys. 53 (3), 1–34 (2020).
40.Wei, Y., Tran, S., Xu, S., Kang, B. & Springer, M. Deep learning for retail product recognition: Challenges and techniques. Comput. Intell. Neurosci., 2020(1), 8875910 (2020).
41.Yücel, M. E. & Ünsalan, C. Planogram compliance control via object detection, sequence alignment, and focused iterative search. Multimed Tools Appl. 83 (8), 24815–24839 (2024).
42.Yücel, M. E., Topaloğlu, S. & Ünsalan, C. Embedded planogram compliance control system. J. Real-Time Image Process. 21 (4), 145 (2024).
A
43.Zhong, Z., Zheng, L., Kang, G., Li, S. & Yang, Y. Random erasing data augmentation. Proc. AAAI Conf. Artif. Intell. 34 (7), 13001–13008 (2020).
44.Zhou, D. et al. Understanding the robustness in vision transformers. Int. Conf. Mach. Learn. ICML 2022, 27378–27394 (2022).
45.Zhu, X. et al. Deformable detr: Deformable transformers for end-to-end object detection. (2020). arXiv:2010.04159.
46.Li, L., Cherouat, A., Snoussi, H. & Wang, T. Grasping With Occlusion-Aware Ally Method in Complex Scenes. IEEE Trans. Autom. Sci. Eng. 22, 5944–5954 (2025).
The full. pseudocode for the virtual shelf algorithm is placed here.