{"id":183139,"date":"2024-03-18T02:16:00","date_gmt":"2024-03-18T01:16:00","guid":{"rendered":"https:\/\/liora.io\/en\/?p=183139"},"modified":"2026-02-06T08:23:19","modified_gmt":"2026-02-06T07:23:19","slug":"sarsa-how-does-machine-learning-work","status":"publish","type":"post","link":"https:\/\/liora.io\/en\/sarsa-how-does-machine-learning-work","title":{"rendered":"SARSA: How does Machine Learning work?"},"content":{"rendered":"<style>\n.elementor-heading-title{padding:0;margin:0;line-height:1}.elementor-widget-heading .elementor-heading-title[class*=elementor-size-]>a{color:inherit;font-size:inherit;line-height:inherit}.elementor-widget-heading .elementor-heading-title.elementor-size-small{font-size:15px}.elementor-widget-heading .elementor-heading-title.elementor-size-medium{font-size:19px}.elementor-widget-heading .elementor-heading-title.elementor-size-large{font-size:29px}.elementor-widget-heading .elementor-heading-title.elementor-size-xl{font-size:39px}.elementor-widget-heading .elementor-heading-title.elementor-size-xxl{font-size:59px}<\/style><h2>Reinforcement learning is, along with supervised and unsupervised learning, one of the three major machine learning techniques.<\/h2>\t\t\n\t\t<p>This family of algorithms has been creating a lot of buzz in recent years, with innovative products from the <a href=\"https:\/\/liora.io\/en\/unveiling-the-future-a-comprehensive-guide-to-the-open-ai-api\">OpenAI<\/a> company such as<strong> OpenAI Five,<\/strong> an AI that managed to beat a team of professional players on the Dota 2 video game, or the<a href=\"https:\/\/liora.io\/en\/chatgpt-how-does-this-nlp-algorithm-work\"> famous ChatGPT<\/a>, which uses this technique to adjust its parameters.<\/p>\t\t\n\t\t\t\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\"><div class=\"wp-block-button \"><a class=\"wp-block-button__link wp-element-button \" href=\"\/en\/courses\/data-ai\/data-scientist\">Learn all about reinforcement learning<\/a><\/div><\/div>\n\n\t\t\t<h3>What is reinforcement learning?<\/h3>\t\t\n\t\t<p>Reinforcement learning is a field of <a href=\"https:\/\/liora.io\/en\/machine-learning-engineer-bootcamp-why-is-it-interesting\">machine learning<\/a> in which an agent (virtual entity: robot, program, etc.) is placed in an interactive environment in which it must learn to perform actions that maximize quantitative rewards.<\/p><p>&nbsp;<\/p><p>?Related articles:<\/p><table dir=\"ltr\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\" data-sheets-root=\"1\"><colgroup><col width=\"656\"><\/colgroup><tbody><tr><td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Image Processing&quot;}\" data-sheets-hyperlink=\"https:\/\/liora.io\/en\/image-processing-fundamental-principles-and-practical-uses\"><a href=\"https:\/\/liora.io\/en\/image-processing-fundamental-principles-and-practical-uses\" target=\"_blank\" rel=\"noopener\">Image Processing<\/a><\/td><\/tr><tr><td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Deep Learning - All you need to know&quot;}\" data-sheets-hyperlink=\"https:\/\/liora.io\/en\/all-about-deep-learning\"><a href=\"https:\/\/liora.io\/en\/all-about-deep-learning\" target=\"_blank\" rel=\"noopener\">Deep Learning &#8211; All you need to know<\/a><\/td><\/tr><tr><td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Mushroom Recognition&quot;}\" data-sheets-hyperlink=\"https:\/\/liora.io\/en\/mushroom-recognition\"><a href=\"https:\/\/liora.io\/en\/mushroom-recognition\" target=\"_blank\" rel=\"noopener\">Mushroom Recognition<\/a><\/td><\/tr><tr><td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Tensor Flow - Google's ML&quot;}\" data-sheets-hyperlink=\"https:\/\/liora.io\/en\/tensor-flow-all-about-googles-machine-learning-framework\"><a href=\"https:\/\/liora.io\/en\/tensor-flow-all-about-googles-machine-learning-framework\" target=\"_blank\" rel=\"noopener\">Tensor Flow &#8211; Google&#8217;s ML<\/a><\/td><\/tr><tr><td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Dive into ML&quot;}\" data-sheets-hyperlink=\"https:\/\/liora.io\/en\/unlock-your-future-dive-into-machine-learning-engineer-training\"><a href=\"https:\/\/liora.io\/en\/unlock-your-future-dive-into-machine-learning-engineer-training\" target=\"_blank\" rel=\"noopener\">Dive into ML<\/a><\/td><\/tr><\/tbody><\/table>\t\t\n\t\t\t<h3>What is the SARSA algorithm?<\/h3>\t\t\n\t\t<p><strong>SARSA<\/strong> is a learning algorithm whose name comes from State-Action-Reward-State-Action, meaning State-Action-Reward-State-Action, and refers to the sequence of elements that make up the algorithm. It is an algorithm based on a table of action values (or Q-table, Q representing the measure of the quality of an action performed) which assigns to each state-action pair a value representing the expected reward.<\/p>\t\t\n\t\t\t<h3>Conclusion<\/h3>\t\t\n\t\t<p>In summary, SARSA is a reinforcement learning algorithm that aims to teach an agent the decisions to be made in an environment by means of an iteratively updated Q-table. It follows a policy of exploration and exploitation while interacting with the environment, and is used in various fields such as video games, decision-making in robotics, or solving path planning problems.<\/p>\t\t\n\t\t\t<style>\n.elementor-widget-image{text-align:center}.elementor-widget-image a{display:inline-block}.elementor-widget-image a img[src$=\".svg\"]{width:48px}.elementor-widget-image img{vertical-align:middle;display:inline-block}<\/style>\t\t\t\t\t\t\t\t\t<figure>\n\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/liora.io\/app\/uploads\/2023\/11\/SARSA-2.jpg\" title=\"\" alt=\"\" loading=\"lazy\">\t\t\t\t\t\t\t\t\t\t\t<figcaption><\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t<p>If you&#8217;d like to learn more about this field, take a look at our Data Scientist training course.<\/p>\t\t\n\t\t\t\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex is-content-justification-center\"><div class=\"wp-block-button \"><a class=\"wp-block-button__link wp-element-button \" href=\"\/en\/courses\/data-ai\/data-scientist\">Discover our Data Scientist training<\/a><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Reinforcement learning is, along with supervised and unsupervised learning, one of the three major machine learning techniques. This family of algorithms has been creating a lot of buzz in recent years, with innovative products from the OpenAI company such as OpenAI Five, an AI that managed to beat a team of professional players on the [&hellip;]<\/p>\n","protected":false},"author":76,"featured_media":183141,"comment_status":"open","ping_status":"open","sticky":false,"template":"elementor_theme","format":"standard","meta":{"_acf_changed":false,"editor_notices":[],"footnotes":""},"categories":[2433],"class_list":["post-183139","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai"],"acf":[],"_links":{"self":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/183139","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/users\/76"}],"replies":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/comments?post=183139"}],"version-history":[{"count":1,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/183139\/revisions"}],"predecessor-version":[{"id":205976,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/posts\/183139\/revisions\/205976"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/media\/183141"}],"wp:attachment":[{"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/media?parent=183139"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/liora.io\/en\/wp-json\/wp\/v2\/categories?post=183139"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}