AnnoVate: Revolutionizing Data Annotation with Automated Labeling Technique




This research introduces AnnoVate, an innovative web application designed to automate the labor-intensive task of object annotation for computer vision applications. Focused on image annotation, the study addresses the escalating demand for data refinement and labeling in the field of artificial intelligence (AI). Leveraging the power of YOLOv8 (You Only Look Once), a high-performance object detection algorithm, AnnoVate minimizes human intervention while achieving an impressive 85% overall accuracy in object detection. The methodology integrates active learning, allowing labelers to selectively prioritize uncertain data during the labeling process. An iterative training approach continuously refines the model, creating a self-improving loop that enhances accuracy over successive loops. The system's flexibility enables users to export labeled datasets for their preferred AI model architectures. AnnoVate not only overcomes the limitations of traditional labeling methods but also establishes a collaborative human-machine interaction paradigm, setting the stage for further advancements in computer vision.


M. Desmond, M. Muller, Z. Ashktorab, C. Dugan, E. Duesterwald, K. Brimijoin, C. Finegan-Dollak et al., "Increasing the speed and accuracy of data labeling through an ai assisted interface," in 26th International Conference on Intelligent User Interfaces, 2021, pp. 392-401.

M. Desmond, E. Duesterwald, K. Brimijoin, M. Brachman, and Q. Pan, "Semi-automated data labeling," in NeurIPS 2020 Competition and Demonstration Track, PMLR, 2021, pp. 156-169.

M. Knaeble, M. Nadj, and A. Maedche, "Oracle or Teacher? A Systematic Overview of Research on Interactive Labeling for Machine Learning," Wirtschaftsinformatik (Zentrale Tracks), 2020, pp. 2-16.

C. Schreiner, H. Zhang, C. Guerrero, K. Torkkola, and K. Zhang, "A semi-automatic data annotation tool for driving simulator data reduction," in Driving Simulation Conference, North America, 2007, p. 9.

T. Fredriksson, J. Bosch, and H. H. Olsson, "Machine Learning Models for Automatic Labeling: A Systematic Literature Review," in ICSOFT, 2020, pp. 552-561.

B. C. Benato, J. F. Gomes, A. C. Telea, and A. X. Falcao, "Semi-automatic data annotation guided by feature space projection," Pattern Recognition, vol. 109, 2021, p. 107612.

R. Pandey, H. Purohit, C. Castillo, and V. L. Shalin, "Modeling and mitigating human annotation errors to design efficient stream processing systems with human-in-the-loop machine learning," International Journal of Human-Computer Studies, vol. 160, 2022, p. 102772.

K. Woodward, E. Kanjo, A. Oikonomou, and A. Chamberlain, "LabelSens: enabling real-time sensor data labelling at the point of collection using an artificial intelligence-based approach," Personal and Ubiquitous Computing, vol. 24, no. 5, 2020, pp. 709-722.

M. Bouchakwa, Y. Ayadi, and I. Amous, "A review on visual content-based and users’ tags-based image annotation: methods and techniques," Multimedia Tools and Applications, vol. 79, no. 29, 2020, pp. 21679-21741.

A. Zhdanovskaya, D. Baidakova, and D. Ustalov, "Data labeling for machine learning engineers: project-based curriculum and data-centric competitions," in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 13, 2023, pp. 15886-15893.

M. M. Adnan, M. S. M. Rahim, K. Al-Jawaheri, M. H. Ali, S. R. Waheed, and A. H. Radie, "A survey and analysis on image annotation," in 2020 3rd International Conference on Engineering Technology and its Applications (IICETA), IEEE, 2020, pp. 203-208.

E. Bernasconi, M. Ceriani, D. D. Pierro, S. Ferilli, and D. Redavid, "Linked data interfaces: a survey," Information, vol. 14, no. 9, 2023, p. 483.

N. L. Rane, "Multidisciplinary collaboration: key players in successful implementation of ChatGPT and similar generative artificial intelligence in manufacturing, finance, retail, transportation, and construction industry," 2023.

A. Pliatsios, K. Kotis, and C. Goumopoulos, "A systematic review on semantic interoperability in the IoE-enabled smart cities," Internet of Things, vol. 22, 2023, p. 100754.

J. C. Weng, Putting Intellectual Robots to Work: Implementing Generative AI Tools in Project Management. NYU SPS Applied Analytics Laboratory, 2023.

C. G. Thirtle, V. W. Ruttan, and F. M. Scherer, "The Adoption and Diffusion of Innovations 88," in Role Of Demand And Supply In T. Routledge, 2022, pp. 77-129.

K. Bayoudh, R. Knani, F. Hamdaoui, and A. Mtibaa, "A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets," The Visual Computer, vol. 38, no. 8, 2022, pp. 2939-2970.

C. Sager, C. Janiesch, and P. Zschech, "A survey of image labelling for computer vision applications," Journal of Business Analytics, vol. 4, no. 2, 2021, pp. 91-110.

K. Bayoudh, R. Knani, F. Hamdaoui, and A. Mtibaa, "A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets," The Visual Computer, vol. 38, no. 8, 2022, pp. 2939-2970.

Y. Himeur et al., "AI-big data analytics for building automation and management systems: a survey, actual challenges and future perspectives," Artificial Intelligence Review, vol. 56, no. 6, 2023, pp. 4929-5021.




How to Cite

Qazi, F., Muhammad Naseem, Aslam, S., Zainab Attaria, Muhammad Ali Jan, & Syed Salman Junaid. (2024). AnnoVate: Revolutionizing Data Annotation with Automated Labeling Technique. VFAST Transactions on Software Engineering, 12(2), 24–30.