{"id":182181,"date":"2026-02-18T10:26:56","date_gmt":"2026-02-18T09:26:56","guid":{"rendered":"https:\/\/liora.io\/en\/?p=182181"},"modified":"2026-07-25T16:25:14","modified_gmt":"2026-07-25T15:25:14","slug":"you-only-look-once-yolo-what-is-it","status":"publish","type":"post","link":"https:\/\/liora.io\/en\/you-only-look-once-yolo-what-is-it","title":{"rendered":"You Only Look Once (YOLO): What is it?"},"content":{"rendered":"<strong>You Only Look Once or YOLO is an algorithm capable of detecting objects at first glance, performing detection and classification simultaneously. Find out all you need to know about this revolutionary approach to AI and Computer Vision!<\/strong>\n\n<!-- \/wp:post-content -->\n\n<!-- wp:paragraph -->\n\n<strong>Object detection<\/strong> is one of the mainstays of computer vision. The ability to detect and classify objects in images and videos is essential for many modern technologies. These <strong>include autonomous vehicles<\/strong>, surveillance systems, robotics and even generative artificial intelligence. After years of research aimed at improving the performance and efficiency of detection systems, it was in 2016 that an innovative algorithm introduced by Joseph Redmon changed everything: <strong>YOLO, You Only Look Once.<\/strong>\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:heading -->\n<h2 id=\"h-back-to-the-origins-of-object-detection-systems\" class=\"wp-block-heading\">Back to the origins of object detection systems<\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n\nObject detection is a fundamental task of <a href=\"https:\/\/liora.io\/en\/mushroom-recognition\">Computer Vision<\/a>. It is the foundation of a multitude of recent innovations, such as facial recognition, augmented reality, automated surveillance and even autonomous driving. In the past, the most widely used techniques were mainly based on traditional multi-step approaches. These methods were effective, but had major drawbacks in terms of processing speed. For example, Region-Based <a href=\"https:\/\/liora.io\/en\/convolutional-neural-network-everything-you-need-to-know\">Convolutional Neural Networks<\/a> (or R-CNN) consisted in initially proposing regions of interest (Rols) from the image. This was achieved mainly by using segmentation or contest detection algorithms.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nEach Rol was then resized for input to a classifier such as a convolutional neural network (CNN). The aim was to determine whether an object was located in a region. At the time, this technique was a significant advance, but suffered from considerable slowness due to the large number of candidate regions to be evaluated. To overcome these limitations, <strong>Fast R-CNN<\/strong> was born. It enabled ROls to be classified directly from a shared <strong>CNN feature map, rather than using separate classifiers.<\/strong>\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nDespite the speed gain over <strong>R-CNN<\/strong>, the selection process remained slow and complex. This is why the Faster R-CNN approach was created, introducing the proposal of regions of interest by a network. This development further automated and accelerated the generation of Rols, but remained hopelessly slow due to the multiple sequential steps required for object detection.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nIn addition to a lack of speed, these different methods had several shortcomings, such as <a href=\"https:\/\/liora.io\/en\/u-net-computer-visions-neural-network\">computational complexity<\/a> and difficulty of scaling. This severely limited their applicability in scenarios requiring real-time detection. Their effectiveness also depended on the quality of the proposed regions of interest, which could lead to detection errors if important regions were missed. For all these reasons, it was high time for a new technique to shake things up in this field. And that&#8217;s precisely what has happened with YOLO.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"id\":203263,\"scale\":\"cover\",\"sizeSlug\":\"full\",\"linkDestination\":\"none\",\"align\":\"center\"} -->\n\n<!-- \/wp:image -->\n\n<!-- wp:buttons {\"className\":\"is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\",\"layout\":{\"type\":\"flex\",\"justifyContent\":\"center\"}} -->\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\"><!-- wp:button -->\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/liora.io\/en\/courses\/data-ai\/data-scientist\">Training in artificial intelligence<\/a><\/div>\n<!-- \/wp:button -->\n\n<\/div>\n<!-- \/wp:buttons -->\n\n<!-- wp:heading -->\n<h2 id=\"h-what-is-you-only-look-once\" class=\"wp-block-heading\">What is You Only Look Once?<\/h2>\n<!-- wp:image {\"id\":207145,\"sizeSlug\":\"large\"} -->\n<figure class=\"wp-block-image size-large\" style=\"margin-top: var(--wp--preset--spacing--columns);margin-bottom: var(--wp--preset--spacing--columns)\"><img decoding=\"async\" class=\"wp-image-207145\" src=\"https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-1024x572.jpg\" alt=\"object-detector-simple-grid-projections.jpg\" width=\"1024\" height=\"572\" srcset=\"https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-1024x572.jpg 1024w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-300x167.jpg 300w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-768x429.jpg 768w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-1536x857.jpg 1536w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-2048x1143.jpg 2048w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-440x246.jpg 440w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-785x438.jpg 785w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-210x117.jpg 210w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2026\/02\/object-detector-simple-grid-projections-115x64.jpg 115w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<!-- \/wp:image -->\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n\nIf <strong>You Only Look Once<\/strong> marked a decisive turning point, it was because of its innovative approach. By simultaneously detecting and classifying objects in a single pass through a convolutional neural network, it combined<strong> real-time speed and accuracy.<\/strong> Its pipelined architecture, combined with specific region-of-interest mechanisms, outperformed all previous methods and rendered them obsolete. Indeed, its structure is fundamentally different from traditional techniques. Instead of proposing regions of interest in a first step, it divides the input image into a grid of cells, each of which is responsible for predicting the coordinates of the bounding boxes for the detected objects, as well as their probabilities of belonging to different classes.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nIn simple terms, each cell predicts a set of <strong>bounding boxes<\/strong> and the corresponding confidence scores for each class, based directly on features extracted by the <strong>CNN.<\/strong> This eliminates the need to go through the image several times. This use of a convolutional neural network to extract features from the input image is at the<strong> heart of YOLO.<\/strong> The CNN consists of several convolutional layers and pooling, to capture useful patterns and features at different spatial scales. This enables relevant object representations to be learned automatically and convolution operations to be performed efficiently, greatly reducing computational costs.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nAnother key feature is the use of &#8220;regions of interest&#8221; or &#8220;anchors&#8221;, which are predefined bounding boxes of different sizes and shapes that serve as a reference for predictions. Each<strong> grid cell is associated<\/strong> with a certain number of anchors, helping <strong>YOLO<\/strong> to generalize detections over different types of objects and scales. This greatly enhances detection accuracy.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:heading -->\n<h2 id=\"h-what-are-the-advantages\" class=\"wp-block-heading\">What are the advantages?<\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n\nClearly,<strong> You Only Look Once<\/strong> offers several significant improvements over previous object detection methods. This is what has enabled it to become one of the most widely used <a href=\"https:\/\/liora.io\/en\/boosting-business-with-3-essential-machine-learning-algorithms\">algorithms<\/a> for computer vision. Its main strength is its ability to detect objects instantaneously, and to reduce the total number of calculations required. Resource utilization is optimized, since shared features are calculated only once.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\n<strong>YOLO<\/strong> also stands out for its performance and accuracy. Its pipelined approach enables it to generalize on objects of various shapes, making it robust in the face of a wide variety of scenarios. It also excels at high-resolution <a href=\"https:\/\/liora.io\/en\/image-processing-fundamental-principles-and-practical-uses\">image processing<\/a>, since its efficient architecture enables it to handle larger image sizes without sacrificing speed. This is a real asset for applications such as aerial or satellite detection.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"id\":203266,\"scale\":\"cover\",\"sizeSlug\":\"full\",\"linkDestination\":\"none\",\"align\":\"center\"} -->\n\n<!-- \/wp:image -->\n\n<!-- wp:buttons {\"className\":\"is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\",\"layout\":{\"type\":\"flex\",\"justifyContent\":\"center\"}} -->\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\"><!-- wp:button -->\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/liora.io\/en\/courses\/data-ai\/machine-learning-engineer\">Become an AI engineer<\/a><\/div>\n<!-- \/wp:button -->\n\n<\/div>\n<!-- \/wp:buttons -->\n\n<!-- wp:heading -->\n<h2 id=\"h-what-s-it-for-what-are-the-applications\" class=\"wp-block-heading\">What&#8217;s it for? What are the applications?<\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n\nThanks to its flexibility and efficiency,<strong> You Only Look Once<\/strong> has found its way into a wide variety of fields and applications, including autonomous vehicles, to identify and track pedestrians, other vehicles, road signs and any potential obstacles on the road in real time. This enables control systems to react instantly to changes to ensure safe driving.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nFor video surveillance, <strong>YOLO<\/strong> can detect suspicious activity, intruders or abandoned objects even in crowds. It can also be used to locate lost people or animals in hard-to-reach places. It can also be used for security checks at airports, railway stations and other infrastructures. Its activity recognition capability can also be useful for detecting and tracking the movements of individuals in video sequences. This is a relevant use case for the surveillance of large spaces such as stadiums, shopping malls or during major sporting events.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nAnother application example is traffic management. The algorithm can be used for automatic recognition of license plates and faces. In the field of medicine,<strong> You Only Look Once<\/strong> is used to detect anomalies or specific objects in medical images such as X-rays, MRIs or scans. It can help diagnose diseases early by speeding up the <a href=\"https:\/\/liora.io\/en\/pillow-how-to-process-images-with-python\">image analysis<\/a> process. This can be vital in medical emergencies.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:heading -->\n<h2 id=\"h-conclusion-yolo-one-of-the-driving-forces-behind-the-new-wave-of-ai\" class=\"wp-block-heading\">Conclusion: YOLO, one of the driving forces behind the new wave of AI<\/h2>\n<!-- \/wp:heading -->\n\n<!-- wp:paragraph -->\n\nBy enabling instantaneous object detection for the first time, You Only Look Once has opened up a myriad of new possibilities for computer vision. It is one of the innovations that have ushered<strong> AI into a new era.<\/strong> To learn how to handle the best AI algorithms, you can choose Liora. Our various Data Science training courses include modules dedicated to AI and its various branches. In particular, you&#8217;ll learn about Machine Learning techniques, reinforcement learning, neural networks and tools like Keras, TensorFlow and PyTorch.\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:paragraph -->\n\nThrough our courses, you can acquire all the skills required to become a Data Analyst, Data Scientist, Data Engineer, Data Product Manager or Machine Learning Engineer. At the end of the course, you can obtain a state-approved &#8220;Project Manager in Artificial Intelligence&#8221; certification, a diploma from Mines ParisTech PSL Executive \u00c9ducation, and a certificate from our cloud partners AWS and Microsoft Azure. Discover Liora!\n\n<!-- \/wp:paragraph -->\n\n<!-- wp:image {\"id\":203269,\"scale\":\"cover\",\"sizeSlug\":\"full\",\"linkDestination\":\"none\",\"align\":\"center\"} -->\n<figure class=\"wp-block-image aligncenter size-full\"><img decoding=\"async\" width=\"814\" height=\"500\" class=\"wp-image-203269\" style=\"object-fit: cover\" src=\"https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3.jpg\" alt=\"\" srcset=\"https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3.jpg 814w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-300x184.jpg 300w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-768x472.jpg 768w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-440x270.jpg 440w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-716x440.jpg 716w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-785x482.jpg 785w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-210x129.jpg 210w, https:\/\/liora.io\/app\/uploads\/sites\/9\/2024\/03\/yolo3-104x64.jpg 104w\" sizes=\"(max-width: 814px) 100vw, 814px\" \/><\/figure>\n<!-- \/wp:image -->\n\n<!-- wp:buttons {\"className\":\"is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\",\"layout\":{\"type\":\"flex\",\"justifyContent\":\"center\"}} -->\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\"><!-- wp:button -->\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/liora.io\/en\/courses\/data-ai\/machine-learning-engineer\">Discover our AI training<\/a><\/div>\n<!-- \/wp:button -->\n\n<\/div>\n<!-- \/wp:buttons -->\n\n<!-- wp:html -->\n<script type=\"application\/ld+json\"><br \/>\n{<br \/>\n  \"@context\": \"https:\/\/schema.org\",<br \/>\n  \"@type\": \"FAQPage\",<br \/>\n  \"mainEntity\": [<br \/>\n    {<br \/>\n      \"@type\": \"Question\",<br \/>\n      \"name\": \"What is You Only Look Once (YOLO)?\",<br \/>\n      \"acceptedAnswer\": {<br \/>\n        \"@type\": \"Answer\",<br \/>\n        \"text\": \"You Only Look Once (YOLO) is an object detection algorithm that identifies and localizes multiple objects in images or video by using a single neural network pass to simultaneously predict bounding boxes and class probabilities, enabling real\u2011time detection.\u00a0([turn0search0][turn0search2])\"<br \/>\n      }<br \/>\n    },<br \/>\n    {<br \/>\n      \"@type\": \"Question\",<br \/>\n      \"name\": \"How does YOLO differ from traditional object detection methods?\",<br \/>\n      \"acceptedAnswer\": {<br \/>\n        \"@type\": \"Answer\",<br \/>\n        \"text\": \"Unlike older detection techniques that require multiple stages of region proposal and classification, YOLO treats object detection as a single regression problem and processes the entire image with one forward pass through a convolutional neural network for faster performance.\u00a0([turn0search2][turn0search0])\"<br \/>\n      }<br \/>\n    },<br \/>\n    {<br \/>\n      \"@type\": \"Question\",<br \/>\n      \"name\": \"Why is YOLO useful for real\u2011time applications?\",<br \/>\n      \"acceptedAnswer\": {<br \/>\n        \"@type\": \"Answer\",<br \/>\n        \"text\": \"YOLO\u2019s unified architecture and single\u2011pass design drastically reduce the number of computations required, allowing it to detect objects in real time \u2014 often at high frame rates \u2014 making it suitable for video and live applications.\u00a0([turn0search2][turn0search0])\"<br \/>\n      }<br \/>\n    },<br \/>\n    {<br \/>\n      \"@type\": \"Question\",<br \/>\n      \"name\": \"What are typical applications of YOLO?\",<br \/>\n      \"acceptedAnswer\": {<br \/>\n        \"@type\": \"Answer\",<br \/>\n        \"text\": \"YOLO is widely used in fields like autonomous driving to detect pedestrians and obstacles, video surveillance, robotics, medical image analysis, and other scenarios requiring rapid and reliable object detection.\u00a0([turn0search0])\"<br \/>\n      }<br \/>\n    },<br \/>\n    {<br \/>\n      \"@type\": \"Question\",<br \/>\n      \"name\": \"Who developed the YOLO algorithm?\",<br \/>\n      \"acceptedAnswer\": {<br \/>\n        \"@type\": \"Answer\",<br \/>\n        \"text\": \"YOLO was originally introduced by Joseph Redmon and his collaborators as a real\u2011time object detection system that brought a major shift in how detection and classification are integrated in computer vision models.\u00a0([turn0search2][turn0search0])\"<br \/>\n      }<br \/>\n    }<br \/>\n  ]<br \/>\n}<br \/>\n<\/script>\n\n<!-- \/wp:html -->","protected":false},"excerpt":{"rendered":"<p>You Only Look Once or YOLO is an algorithm capable of detecting objects at first glance, performing detection and classification simultaneously. Find out all you need to know about this revolutionary approach to AI and Computer Vision! Object detection is one of the mainstays of computer vision. The ability to detect and classify objects in [&hellip;]<\/p>\n","protected":false},"author":55,"featured_media":207147,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"editor_notices":[],"footnotes":""},"categories":[2433],"class_list":["post-182181","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai"],"acf":[],"_links":{"self":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/182181","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/users\/55"}],"replies":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/comments?post=182181"}],"version-history":[{"count":5,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/182181\/revisions"}],"predecessor-version":[{"id":208763,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/182181\/revisions\/208763"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/media\/207147"}],"wp:attachment":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/media?parent=182181"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/categories?post=182181"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}