1 00:00:01,920 --> 00:00:04,680 great so I think we can start since the 2 00:00:04,680 --> 00:00:07,859 meeting is recorded So if everyone uh 3 00:00:07,859 --> 00:00:11,160 jump jumps in later can can watch the 4 00:00:11,160 --> 00:00:12,420 recording 5 00:00:12,420 --> 00:00:15,780 so hi everyone and welcome to these 6 00:00:15,780 --> 00:00:18,000 um Cloud skill challenge study session 7 00:00:18,000 --> 00:00:20,880 around a create classification models 8 00:00:20,880 --> 00:00:24,000 with Azure machine learning designer 9 00:00:24,000 --> 00:00:27,240 so today I'm thrilled to be here with 10 00:00:27,240 --> 00:00:29,820 John uh John do my introduce briefly 11 00:00:29,820 --> 00:00:31,619 yourself 12 00:00:31,619 --> 00:00:34,160 uh thank you Carlotta hello everyone 13 00:00:34,160 --> 00:00:38,160 Welcome to our Workshop today I hope 14 00:00:38,160 --> 00:00:40,559 that you are all excited for it I am 15 00:00:40,559 --> 00:00:43,140 John Aziz a gold Microsoft learn student 16 00:00:43,140 --> 00:00:47,460 ambassador and I will be here with uh 17 00:00:47,460 --> 00:00:50,760 Carlota to like do the Practical part 18 00:00:50,760 --> 00:00:53,820 about this module of the cloud skills 19 00:00:53,820 --> 00:00:57,000 challenge thank you for having me 20 00:00:57,000 --> 00:00:59,219 perfect thanks John so for those who 21 00:00:59,219 --> 00:01:03,440 don't know me I'm kellota 22 00:01:03,440 --> 00:01:06,479 based in Italy and focus it on AI 23 00:01:06,479 --> 00:01:08,760 machine learning Technologies and about 24 00:01:08,760 --> 00:01:11,900 the use in education 25 00:01:12,060 --> 00:01:13,200 um so 26 00:01:13,200 --> 00:01:15,000 um these Cloud skill challenge study 27 00:01:15,000 --> 00:01:17,580 session is based on a learn module a 28 00:01:17,580 --> 00:01:21,540 dedicated learn module I sent to you uh 29 00:01:21,540 --> 00:01:23,939 the link to this module uh in the chat 30 00:01:23,939 --> 00:01:25,619 in a way that you can follow along the 31 00:01:25,619 --> 00:01:28,680 model if you want or just have a look at 32 00:01:28,680 --> 00:01:33,000 the module later at your own pace 33 00:01:33,000 --> 00:01:33,780 um 34 00:01:33,780 --> 00:01:37,020 so before starting I would also like to 35 00:01:37,020 --> 00:01:40,619 remember to remember you uh the code of 36 00:01:40,619 --> 00:01:43,439 conduct and guidelines of our student 37 00:01:43,439 --> 00:01:47,640 Masters community so please during this 38 00:01:47,640 --> 00:01:51,000 meeting be respectful and inclusive and 39 00:01:51,000 --> 00:01:53,579 be friendly open and with coming and 40 00:01:53,579 --> 00:01:56,159 respectful of other each other 41 00:01:56,159 --> 00:01:57,720 differences 42 00:01:57,720 --> 00:02:01,200 if you want to learn more about the code 43 00:02:01,200 --> 00:02:03,540 of content you can use this link in the 44 00:02:03,540 --> 00:02:08,880 deck again.ms slash s-a-c-o-c 45 00:02:09,660 --> 00:02:12,420 and now we are 46 00:02:12,420 --> 00:02:15,420 um we are ready to to start our session 47 00:02:15,420 --> 00:02:18,959 so as we mentioned it we are going to 48 00:02:18,959 --> 00:02:21,780 focus on classification models and Azure 49 00:02:21,780 --> 00:02:24,900 ml uh today so first of all we are going 50 00:02:24,900 --> 00:02:28,920 to um identify uh the kind of 51 00:02:28,920 --> 00:02:31,080 um of scenarios in which you should 52 00:02:31,080 --> 00:02:33,900 choose to use a classification model 53 00:02:33,900 --> 00:02:36,660 we're going to introduce Azure machine 54 00:02:36,660 --> 00:02:39,060 learning and Azure machine designer 55 00:02:39,060 --> 00:02:41,879 we're going to understand uh which are 56 00:02:41,879 --> 00:02:43,680 the steps to follow to create a 57 00:02:43,680 --> 00:02:46,200 classification model in Azure mesh 58 00:02:46,200 --> 00:02:48,599 learning and then John will 59 00:02:48,599 --> 00:02:49,500 um 60 00:02:49,500 --> 00:02:52,379 lead an amazing demo about training and 61 00:02:52,379 --> 00:02:54,300 Publishing a classification model in 62 00:02:54,300 --> 00:02:57,000 Azure ml designer 63 00:02:57,000 --> 00:02:59,819 so let's start from the beginning let's 64 00:02:59,819 --> 00:03:02,640 start from identifying classification 65 00:03:02,640 --> 00:03:05,220 machine learning scenarios 66 00:03:05,220 --> 00:03:07,640 so first of all what is classification 67 00:03:07,640 --> 00:03:09,959 classification is a form of machine 68 00:03:09,959 --> 00:03:12,120 learning that is used to predict which 69 00:03:12,120 --> 00:03:15,599 category or class an item belongs to for 70 00:03:15,599 --> 00:03:17,340 example we might want to develop a 71 00:03:17,340 --> 00:03:19,800 classifier able to identify if an 72 00:03:19,800 --> 00:03:22,200 Incoming Email should be filtered or not 73 00:03:22,200 --> 00:03:25,080 according to the style the center the 74 00:03:25,080 --> 00:03:28,140 length of the email Etc in this case the 75 00:03:28,140 --> 00:03:30,060 characteristics of the email are the 76 00:03:30,060 --> 00:03:31,080 features 77 00:03:31,080 --> 00:03:34,200 and the label is a classification of 78 00:03:34,200 --> 00:03:38,099 either a zero or one representing a Spam 79 00:03:38,099 --> 00:03:40,860 or non-spam for the including email so 80 00:03:40,860 --> 00:03:42,360 this is an example of a binary 81 00:03:42,360 --> 00:03:44,400 classifier if you want to assign 82 00:03:44,400 --> 00:03:46,260 multiple categories to the incoming 83 00:03:46,260 --> 00:03:48,959 email like work letters love letters 84 00:03:48,959 --> 00:03:52,080 complaints or other categories in this 85 00:03:52,080 --> 00:03:54,000 case a binary classifier is not longer 86 00:03:54,000 --> 00:03:55,739 enough and we should develop a 87 00:03:55,739 --> 00:03:58,319 multi-class classifier so classification 88 00:03:58,319 --> 00:04:00,599 is an example of what is called 89 00:04:00,599 --> 00:04:02,519 supervised machine learning 90 00:04:02,519 --> 00:04:05,280 in which you train a model using data 91 00:04:05,280 --> 00:04:07,080 that includes both the features and 92 00:04:07,080 --> 00:04:08,879 known values for label 93 00:04:08,879 --> 00:04:11,099 so that the model learns to fit the 94 00:04:11,099 --> 00:04:13,560 feature combinations to the label then 95 00:04:13,560 --> 00:04:15,420 after training has been completed you 96 00:04:15,420 --> 00:04:17,040 can use the train model to predict 97 00:04:17,040 --> 00:04:19,500 labels for new items for for which the 98 00:04:19,500 --> 00:04:22,320 label is unknown 99 00:04:22,320 --> 00:04:25,440 but let's see some examples of scenarios 100 00:04:25,440 --> 00:04:27,120 for classification machine learning 101 00:04:27,120 --> 00:04:29,160 models so we already mentioned an 102 00:04:29,160 --> 00:04:31,020 example of a solution in which we would 103 00:04:31,020 --> 00:04:33,660 need a classifier but let's explore 104 00:04:33,660 --> 00:04:35,699 other scenarios for classification in 105 00:04:35,699 --> 00:04:37,979 other Industries for example you can use 106 00:04:37,979 --> 00:04:40,380 a classification model for a health 107 00:04:40,380 --> 00:04:43,680 clinic scenario and use clinical data to 108 00:04:43,680 --> 00:04:45,720 predict whether patient will become sick 109 00:04:45,720 --> 00:04:47,060 or not 110 00:04:47,060 --> 00:04:49,680 uh you can use 111 00:04:49,680 --> 00:04:51,740 um 112 00:05:03,780 --> 00:05:07,860 oh sorry so when I became muted it's a 113 00:05:07,860 --> 00:05:11,940 long time or you can use you can use uh 114 00:05:11,940 --> 00:05:13,560 some models for classification for 115 00:05:13,560 --> 00:05:16,919 example you can use you're saying this 116 00:05:16,919 --> 00:05:21,199 uh so I I was I was 117 00:05:21,660 --> 00:05:24,180 this one like you you have been muted 118 00:05:24,180 --> 00:05:27,060 for uh one second okay okay perfect 119 00:05:27,060 --> 00:05:30,419 perfect uh yeah I was talking sorry for 120 00:05:30,419 --> 00:05:34,560 that so I was talking about the possible 121 00:05:34,560 --> 00:05:37,320 you can use a classification model like 122 00:05:37,320 --> 00:05:39,660 have Clinic scenario Financial scenario 123 00:05:39,660 --> 00:05:41,699 or other third one is business type of 124 00:05:41,699 --> 00:05:44,100 scenario you can use characteristics or 125 00:05:44,100 --> 00:05:45,900 small business to predict if a new 126 00:05:45,900 --> 00:05:47,880 Venture will will succeed or not for 127 00:05:47,880 --> 00:05:49,560 example and these are all types of 128 00:05:49,560 --> 00:05:52,160 binary classification 129 00:05:52,160 --> 00:05:55,199 uh but today we are also going to talk 130 00:05:55,199 --> 00:05:57,240 about Azure machine learning so let's 131 00:05:57,240 --> 00:05:58,139 see 132 00:05:58,139 --> 00:06:00,660 um what is azure Mash learning so 133 00:06:00,660 --> 00:06:02,160 training and deploying an effective 134 00:06:02,160 --> 00:06:04,199 machine learning model involves a lot of 135 00:06:04,199 --> 00:06:06,539 work much of it time consuming and 136 00:06:06,539 --> 00:06:08,880 resource intensive so Azure machine 137 00:06:08,880 --> 00:06:11,039 learning is a cloud-based service that 138 00:06:11,039 --> 00:06:12,780 helps simplify some of the tasks it 139 00:06:12,780 --> 00:06:15,720 takes to prepare data train a model and 140 00:06:15,720 --> 00:06:18,060 also deploy it as a predictive service 141 00:06:18,060 --> 00:06:20,220 so it helps that the scientists increase 142 00:06:20,220 --> 00:06:22,380 their efficiency by automating many of 143 00:06:22,380 --> 00:06:24,660 the time consuming tasks Associated to 144 00:06:24,660 --> 00:06:27,539 creating and training a model 145 00:06:27,539 --> 00:06:29,520 and it enables them also to use 146 00:06:29,520 --> 00:06:31,740 cloud-based compute resources that scale 147 00:06:31,740 --> 00:06:33,720 effectively to handle large volumes of 148 00:06:33,720 --> 00:06:36,300 data while incurring costs only when 149 00:06:36,300 --> 00:06:38,699 actually used 150 00:06:38,699 --> 00:06:41,220 to use Azure machine learning you 151 00:06:41,220 --> 00:06:43,199 fasting fast you need to create a 152 00:06:43,199 --> 00:06:44,940 workspace resource in your Azure 153 00:06:44,940 --> 00:06:47,520 subscription and you can then use these 154 00:06:47,520 --> 00:06:50,220 workspace to manage data compute 155 00:06:50,220 --> 00:06:52,440 resources code models and other 156 00:06:52,440 --> 00:06:55,139 artifacts after you have created an 157 00:06:55,139 --> 00:06:56,819 Azure machine learning workspace you can 158 00:06:56,819 --> 00:06:58,560 develop Solutions with the Azure machine 159 00:06:58,560 --> 00:07:00,840 learning service either with developer 160 00:07:00,840 --> 00:07:02,580 tools or the Azure machine Learning 161 00:07:02,580 --> 00:07:04,380 Studio web portal 162 00:07:04,380 --> 00:07:06,360 in particular International Learning 163 00:07:06,360 --> 00:07:07,800 Studio is a web portal for machine 164 00:07:07,800 --> 00:07:09,720 learning Solutions in Azure and it 165 00:07:09,720 --> 00:07:11,639 includes a wide range of features and 166 00:07:11,639 --> 00:07:13,800 capabilities that help data scientists 167 00:07:13,800 --> 00:07:16,259 prepare data train models publish 168 00:07:16,259 --> 00:07:18,479 Predictive Services and monitor also 169 00:07:18,479 --> 00:07:19,680 their usage 170 00:07:19,680 --> 00:07:22,139 so to begin using the web portal you 171 00:07:22,139 --> 00:07:23,880 need to assign the workspace you created 172 00:07:23,880 --> 00:07:26,819 in the Azure portal to the Azure machine 173 00:07:26,819 --> 00:07:29,419 Learning Studio 174 00:07:29,520 --> 00:07:31,800 as its core Azure Mash learning is a 175 00:07:31,800 --> 00:07:33,720 service for training and managing 176 00:07:33,720 --> 00:07:36,000 machine learning models for which you 177 00:07:36,000 --> 00:07:38,220 need compute resources on which to run 178 00:07:38,220 --> 00:07:39,919 the training process 179 00:07:39,919 --> 00:07:44,280 compute targets are um one of the main 180 00:07:44,280 --> 00:07:46,740 basic concept of azure Mash learning 181 00:07:46,740 --> 00:07:48,780 they are cloud-based resources on which 182 00:07:48,780 --> 00:07:50,639 you can run model training and AD 183 00:07:50,639 --> 00:07:53,220 exploration processes 184 00:07:53,220 --> 00:07:54,780 so initial machine Learning Studio you 185 00:07:54,780 --> 00:07:56,759 can manage the compute targets for your 186 00:07:56,759 --> 00:07:58,740 data science activities and there are 187 00:07:58,740 --> 00:08:03,240 four kinds of of compute targets you can 188 00:08:03,240 --> 00:08:05,940 create we have the computer instances 189 00:08:05,940 --> 00:08:09,539 which are vital machine set up for 190 00:08:09,539 --> 00:08:10,979 running machine learning code during 191 00:08:10,979 --> 00:08:13,319 development so they are not designed for 192 00:08:13,319 --> 00:08:14,460 production 193 00:08:14,460 --> 00:08:17,099 then we have compute clusters which are 194 00:08:17,099 --> 00:08:19,800 a set of virtual machines that can scale 195 00:08:19,800 --> 00:08:22,199 up automatically based on traffic 196 00:08:22,199 --> 00:08:24,599 we have inference clusters which are 197 00:08:24,599 --> 00:08:26,699 similar to compute clusters but they are 198 00:08:26,699 --> 00:08:29,340 designed for deployment so they are a 199 00:08:29,340 --> 00:08:31,979 deployment targets for Predictive 200 00:08:31,979 --> 00:08:35,820 Services that use train models 201 00:08:35,820 --> 00:08:38,339 and finally we have attached compute 202 00:08:38,339 --> 00:08:41,339 which are any compute Target that you 203 00:08:41,339 --> 00:08:44,159 manage yourself outside of azramel like 204 00:08:44,159 --> 00:08:46,560 for example virtual machines or Azure 205 00:08:46,560 --> 00:08:49,700 databricks clusters 206 00:08:49,980 --> 00:08:52,800 so we talked about Azure machine 207 00:08:52,800 --> 00:08:54,300 learning but we also mentioned it 208 00:08:54,300 --> 00:08:55,500 mentioned Azure machine learning 209 00:08:55,500 --> 00:08:57,540 designer what is azure machine learning 210 00:08:57,540 --> 00:09:00,120 designer so initial machine Learning 211 00:09:00,120 --> 00:09:02,880 Studio there are several ways to author 212 00:09:02,880 --> 00:09:04,560 classification machine learning models 213 00:09:04,560 --> 00:09:08,100 one way is to use a visual interface and 214 00:09:08,100 --> 00:09:10,260 this visual interface is called designer 215 00:09:10,260 --> 00:09:13,140 and you can use it to train test and 216 00:09:13,140 --> 00:09:15,540 also deploy machine learning models and 217 00:09:15,540 --> 00:09:17,940 the drag and drop interface makes use of 218 00:09:17,940 --> 00:09:20,279 clearly defined inputs and outputs that 219 00:09:20,279 --> 00:09:22,680 can be shared reused and also Version 220 00:09:22,680 --> 00:09:23,880 Control 221 00:09:23,880 --> 00:09:25,920 and using the designer you can identify 222 00:09:25,920 --> 00:09:28,080 the building blocks or components needed 223 00:09:28,080 --> 00:09:30,839 for your model place and connect them on 224 00:09:30,839 --> 00:09:33,120 your canvas and run a machine learning 225 00:09:33,120 --> 00:09:35,300 job 226 00:09:35,399 --> 00:09:36,779 so 227 00:09:36,779 --> 00:09:39,120 um each designer project so each project 228 00:09:39,120 --> 00:09:42,360 in the designer is known as a pipeline 229 00:09:42,360 --> 00:09:45,600 and in the design we have a left panel 230 00:09:45,600 --> 00:09:48,360 for navigation and a canvas on your 231 00:09:48,360 --> 00:09:50,640 right hand side in which you build your 232 00:09:50,640 --> 00:09:53,940 pipeline visually so pipelines let you 233 00:09:53,940 --> 00:09:56,100 organize manage and reuse complex 234 00:09:56,100 --> 00:09:58,260 machine learning workflows across 235 00:09:58,260 --> 00:10:00,480 projects and users 236 00:10:00,480 --> 00:10:03,000 a pipeline starts with the data set from 237 00:10:03,000 --> 00:10:04,140 which you want to train the model 238 00:10:04,140 --> 00:10:05,880 because all begins with data when 239 00:10:05,880 --> 00:10:07,380 talking about data science and machine 240 00:10:07,380 --> 00:10:09,540 learning and each time you run a 241 00:10:09,540 --> 00:10:10,980 pipeline the configuration of the 242 00:10:10,980 --> 00:10:12,959 pipeline and its results are stored in 243 00:10:12,959 --> 00:10:17,339 your workspace as a pipeline job 244 00:10:17,339 --> 00:10:21,959 so the second main concept of azure 245 00:10:21,959 --> 00:10:25,080 machine learning is a component so going 246 00:10:25,080 --> 00:10:28,440 hierarchically from the pipeline we can 247 00:10:28,440 --> 00:10:30,540 say that each building block of a 248 00:10:30,540 --> 00:10:33,920 pipeline is called a component 249 00:10:33,920 --> 00:10:36,959 learning component encapsulate one step 250 00:10:36,959 --> 00:10:39,420 in a machine learning pipeline so it's a 251 00:10:39,420 --> 00:10:41,640 reusable piece of code with inputs and 252 00:10:41,640 --> 00:10:44,100 outputs something very similar to a 253 00:10:44,100 --> 00:10:46,500 function in any programming language 254 00:10:46,500 --> 00:10:48,899 and in a pipeline project you can access 255 00:10:48,899 --> 00:10:51,480 data assets and components from the left 256 00:10:51,480 --> 00:10:52,700 panels 257 00:10:52,700 --> 00:10:56,279 asset Library tab as you can see 258 00:10:56,279 --> 00:11:00,200 um here in the screenshot in the deck 259 00:11:00,300 --> 00:11:03,360 so you can create data assets on using 260 00:11:03,360 --> 00:11:08,339 an adoc page called Data Page and a data 261 00:11:08,339 --> 00:11:11,160 set is a reference to a data source 262 00:11:11,160 --> 00:11:12,480 location 263 00:11:12,480 --> 00:11:15,720 so this data source location could be a 264 00:11:15,720 --> 00:11:18,779 local file a data store a web file or 265 00:11:18,779 --> 00:11:21,660 even an age group open that set 266 00:11:21,660 --> 00:11:23,880 and these data assets will appear along 267 00:11:23,880 --> 00:11:26,459 with standard sample data set in the 268 00:11:26,459 --> 00:11:30,019 designers asset Library 269 00:11:30,839 --> 00:11:31,560 um 270 00:11:31,560 --> 00:11:36,959 and another basic concept of azure ml is 271 00:11:36,959 --> 00:11:38,880 azure machine learning jobs 272 00:11:38,880 --> 00:11:43,519 so basically when you submit a pipeline 273 00:11:43,519 --> 00:11:47,040 you create a job which will run all the 274 00:11:47,040 --> 00:11:49,920 steps in your pipeline so a job execute 275 00:11:49,920 --> 00:11:52,800 a task against a specified compute 276 00:11:52,800 --> 00:11:53,760 Target 277 00:11:53,760 --> 00:11:56,640 jobs enable systematic tracking for your 278 00:11:56,640 --> 00:11:58,560 machine learning experimentation in 279 00:11:58,560 --> 00:11:59,880 Azure ml 280 00:11:59,880 --> 00:12:02,399 and once a job is created azramel 281 00:12:02,399 --> 00:12:05,459 maintains a run record uh for for the 282 00:12:05,459 --> 00:12:07,640 job 283 00:12:08,399 --> 00:12:12,180 um but let's move to the classification 284 00:12:12,180 --> 00:12:14,040 steps so 285 00:12:14,040 --> 00:12:17,160 um let's introduce uh how to create a 286 00:12:17,160 --> 00:12:21,360 classification model in Azure ml but you 287 00:12:21,360 --> 00:12:23,640 will see it in more details in a 288 00:12:23,640 --> 00:12:26,339 handsome demo that John will will guide 289 00:12:26,339 --> 00:12:29,459 through in a few minutes 290 00:12:29,459 --> 00:12:32,220 so you can think of the steps to train 291 00:12:32,220 --> 00:12:33,720 and evaluate a classification machine 292 00:12:33,720 --> 00:12:36,660 learning model as four main steps so 293 00:12:36,660 --> 00:12:38,459 first of all you need to prepare your 294 00:12:38,459 --> 00:12:41,100 data so you need to identify the 295 00:12:41,100 --> 00:12:43,139 features and the label in your data set 296 00:12:43,139 --> 00:12:46,139 you need to pre-process so you need to 297 00:12:46,139 --> 00:12:48,839 clean and transform the data as needed 298 00:12:48,839 --> 00:12:51,120 then the second step of course is 299 00:12:51,120 --> 00:12:52,740 training the model 300 00:12:52,740 --> 00:12:54,600 and for training the model you need to 301 00:12:54,600 --> 00:12:57,060 split the data into two groups a 302 00:12:57,060 --> 00:12:59,519 training and a validation set 303 00:12:59,519 --> 00:13:01,320 then you train a machine learning model 304 00:13:01,320 --> 00:13:03,540 using the training data set and you test 305 00:13:03,540 --> 00:13:05,040 the machine learning model for 306 00:13:05,040 --> 00:13:07,019 performance using the validation data 307 00:13:07,019 --> 00:13:08,100 set 308 00:13:08,100 --> 00:13:12,180 the third step is performance evaluation 309 00:13:12,180 --> 00:13:14,519 um which means comparing how close the 310 00:13:14,519 --> 00:13:16,139 model's predictions are to the known 311 00:13:16,139 --> 00:13:20,519 labels and these lead us to compute some 312 00:13:20,519 --> 00:13:23,279 evaluation performance metrics 313 00:13:23,279 --> 00:13:25,740 and then finally 314 00:13:25,740 --> 00:13:29,880 um so these three steps are not 315 00:13:29,880 --> 00:13:33,000 um not performed uh every time in a 316 00:13:33,000 --> 00:13:35,459 linear manner it's more an iterative 317 00:13:35,459 --> 00:13:39,420 process but once you obtain you achieve 318 00:13:39,420 --> 00:13:42,959 a a performance with which you are 319 00:13:42,959 --> 00:13:45,779 satisfied so you are ready to let's say 320 00:13:45,779 --> 00:13:48,660 go into production and you can deploy 321 00:13:48,660 --> 00:13:51,920 your train model as a predictive service 322 00:13:51,920 --> 00:13:55,980 into a real-time uh to a real-time 323 00:13:55,980 --> 00:13:58,019 endpoint and to do so you need to 324 00:13:58,019 --> 00:14:00,240 convert the training pipeline into a 325 00:14:00,240 --> 00:14:02,820 real-time inference Pipeline and then 326 00:14:02,820 --> 00:14:04,260 you can deploy the model as an 327 00:14:04,260 --> 00:14:06,779 application on a server or device so 328 00:14:06,779 --> 00:14:11,420 that others can consume this model 329 00:14:11,459 --> 00:14:14,279 so let's start with the first step which 330 00:14:14,279 --> 00:14:17,700 is prepaid data reward data can contain 331 00:14:17,700 --> 00:14:19,920 many different issues that can affect 332 00:14:19,920 --> 00:14:22,320 the utility of the data and our 333 00:14:22,320 --> 00:14:24,959 interpretation of the results so also 334 00:14:24,959 --> 00:14:26,579 the machine learning model that you 335 00:14:26,579 --> 00:14:29,399 train using this data for example real 336 00:14:29,399 --> 00:14:31,440 world data can be affected by a bed 337 00:14:31,440 --> 00:14:34,079 recording or a bad measurement and it 338 00:14:34,079 --> 00:14:36,480 can also contain missing values for some 339 00:14:36,480 --> 00:14:38,880 parameters and Azure machine learning 340 00:14:38,880 --> 00:14:40,860 designer has several pre-built 341 00:14:40,860 --> 00:14:43,019 components that can be used to prepaid 342 00:14:43,019 --> 00:14:46,079 data for training these components 343 00:14:46,079 --> 00:14:48,300 enable you to clean data normalize 344 00:14:48,300 --> 00:14:52,940 features join tables and and more 345 00:14:53,000 --> 00:14:57,120 let's come to uh training so to train a 346 00:14:57,120 --> 00:14:59,220 classification model you need a data set 347 00:14:59,220 --> 00:15:02,160 that includes historical features so the 348 00:15:02,160 --> 00:15:03,899 characteristics of the entity for which 349 00:15:03,899 --> 00:15:06,899 one to make a prediction and known label 350 00:15:06,899 --> 00:15:09,779 values the label is the class indicator 351 00:15:09,779 --> 00:15:11,820 we want to train a model to predict it 352 00:15:11,820 --> 00:15:13,920 and it's common practice to train a 353 00:15:13,920 --> 00:15:16,199 model using a subset of the data while 354 00:15:16,199 --> 00:15:18,300 holding back some data with which to 355 00:15:18,300 --> 00:15:20,760 test the train model and this enables 356 00:15:20,760 --> 00:15:22,440 you to compare the labels that the model 357 00:15:22,440 --> 00:15:25,380 predicts with the actual known labels in 358 00:15:25,380 --> 00:15:27,420 the original data set 359 00:15:27,420 --> 00:15:29,880 this operation can be performed in the 360 00:15:29,880 --> 00:15:32,100 designer using the split data component 361 00:15:32,100 --> 00:15:34,740 as shown by the screenshot here in the 362 00:15:34,740 --> 00:15:36,660 in the deck 363 00:15:36,660 --> 00:15:39,540 there's also another component that you 364 00:15:39,540 --> 00:15:40,980 should use which is the score model 365 00:15:40,980 --> 00:15:43,139 component to generate the predicted 366 00:15:43,139 --> 00:15:45,360 class label value using the validation 367 00:15:45,360 --> 00:15:48,060 data as input so once you connect all 368 00:15:48,060 --> 00:15:49,800 these components 369 00:15:49,800 --> 00:15:52,440 um the component specifying the the 370 00:15:52,440 --> 00:15:54,959 model we are going to use the split data 371 00:15:54,959 --> 00:15:57,060 component the trained model component 372 00:15:57,060 --> 00:16:00,300 and the score model component you want 373 00:16:00,300 --> 00:16:02,639 to run an a new experiment in the 374 00:16:02,639 --> 00:16:05,760 initial map which will use the data set 375 00:16:05,760 --> 00:16:09,600 on the canvas to train and score a model 376 00:16:09,600 --> 00:16:12,000 after training a model it is important 377 00:16:12,000 --> 00:16:14,639 we say to evaluate its performance to 378 00:16:14,639 --> 00:16:17,060 understand how bad how how good sorry 379 00:16:17,060 --> 00:16:20,760 our model is performing 380 00:16:20,760 --> 00:16:22,680 and there are many performance metrics 381 00:16:22,680 --> 00:16:24,600 and methodologies for evaluating how 382 00:16:24,600 --> 00:16:27,000 well a model makes predictions the 383 00:16:27,000 --> 00:16:29,160 component to use to perform evaluation 384 00:16:29,160 --> 00:16:32,220 in Azure ml designer is called as 385 00:16:32,220 --> 00:16:35,060 intuitive as it is evaluate model 386 00:16:35,060 --> 00:16:38,339 once the job of training and evaluation 387 00:16:38,339 --> 00:16:40,740 of the model is completed you can review 388 00:16:40,740 --> 00:16:42,959 evaluation metrics on the completed job 389 00:16:42,959 --> 00:16:45,860 Page by right clicking on the component 390 00:16:45,860 --> 00:16:48,480 in the evaluation results you can also 391 00:16:48,480 --> 00:16:51,000 find the so-called confusion Matrix that 392 00:16:51,000 --> 00:16:53,399 you can see here in the right side of of 393 00:16:53,399 --> 00:16:55,079 this deck 394 00:16:55,079 --> 00:16:57,420 a confusion Matrix shows cases where 395 00:16:57,420 --> 00:16:59,220 both the predicted and actual values 396 00:16:59,220 --> 00:17:01,980 were one uh the so-called true positives 397 00:17:01,980 --> 00:17:04,500 at the top left and also cases where 398 00:17:04,500 --> 00:17:06,600 both the predicted and the actual values 399 00:17:06,600 --> 00:17:08,459 were zero the so-called true negatives 400 00:17:08,459 --> 00:17:10,919 at the bottom right while the other 401 00:17:10,919 --> 00:17:13,679 cells show cases where the predicting 402 00:17:13,679 --> 00:17:15,380 and actual values differ 403 00:17:15,380 --> 00:17:17,939 called false positive and false 404 00:17:17,939 --> 00:17:19,919 negatives and this is an example of a 405 00:17:19,919 --> 00:17:23,579 confusion Matrix for a binary classifier 406 00:17:23,579 --> 00:17:25,559 why for a multi-class classification 407 00:17:25,559 --> 00:17:28,079 model the same approach is used to 408 00:17:28,079 --> 00:17:30,120 tabulate each possible combination of 409 00:17:30,120 --> 00:17:32,940 actual and predictive value counts so 410 00:17:32,940 --> 00:17:34,740 for example a model with three possible 411 00:17:34,740 --> 00:17:37,559 classes would result in three times 412 00:17:37,559 --> 00:17:39,120 three Matrix 413 00:17:39,120 --> 00:17:41,880 the confusion Matrix is also useful for 414 00:17:41,880 --> 00:17:43,860 the metrics that can be derived from it 415 00:17:43,860 --> 00:17:48,260 like accuracy recall or precision 416 00:17:49,320 --> 00:17:52,080 um we we say that the last step is 417 00:17:52,080 --> 00:17:55,620 deploying the train model to a real-time 418 00:17:55,620 --> 00:17:59,280 endpoint as a predictive service and in 419 00:17:59,280 --> 00:18:00,900 order to automate your model into 420 00:18:00,900 --> 00:18:02,760 service that makes continuous 421 00:18:02,760 --> 00:18:04,980 predictions you need first of all to 422 00:18:04,980 --> 00:18:08,039 create any and and then deploy an 423 00:18:08,039 --> 00:18:10,080 inference pipeline the process of 424 00:18:10,080 --> 00:18:11,940 converting the training pipeline into a 425 00:18:11,940 --> 00:18:13,980 real-time inference pipeline removes 426 00:18:13,980 --> 00:18:16,260 training components and adds web service 427 00:18:16,260 --> 00:18:18,960 inputs and outputs to handle requests 428 00:18:18,960 --> 00:18:21,240 and the inference pipeline performs they 429 00:18:21,240 --> 00:18:22,679 seem that the transformation as the 430 00:18:22,679 --> 00:18:26,160 first pipeline but for new data then it 431 00:18:26,160 --> 00:18:28,679 uses the train model to infer or predict 432 00:18:28,679 --> 00:18:32,539 label values based on its feature 433 00:18:32,820 --> 00:18:36,120 um so I think I've talked a lot for now 434 00:18:36,120 --> 00:18:40,380 I would like to let John show us 435 00:18:40,380 --> 00:18:44,340 something in practice uh with uh with 436 00:18:44,340 --> 00:18:47,280 the Hands-On demo so please John go 437 00:18:47,280 --> 00:18:49,860 ahead sharing your screen and guide us 438 00:18:49,860 --> 00:18:52,380 through this demo of creating a 439 00:18:52,380 --> 00:18:53,760 classification with the Azure machine 440 00:18:53,760 --> 00:18:55,860 learning designer 441 00:18:55,860 --> 00:18:58,919 uh thank you so much Carlotta for this 442 00:18:58,919 --> 00:19:01,380 interesting explanation of the Azure ml 443 00:19:01,380 --> 00:19:04,740 designer and now 444 00:19:04,740 --> 00:19:07,500 um I'm going to start with you in the 445 00:19:07,500 --> 00:19:10,200 Practical demo part so uh if you want to 446 00:19:10,200 --> 00:19:13,320 follow along go to the link that Carlota 447 00:19:13,320 --> 00:19:18,380 sent in the chat so like you can do 448 00:19:18,380 --> 00:19:21,840 the demo or the Practical part with me 449 00:19:21,840 --> 00:19:25,260 I'm just going to share my screen 450 00:19:25,260 --> 00:19:27,140 and 451 00:19:27,140 --> 00:19:31,559 go here so uh 452 00:19:31,559 --> 00:19:34,320 where am I right now I'm inside the 453 00:19:34,320 --> 00:19:36,960 Microsoft learn documentation this is 454 00:19:36,960 --> 00:19:40,260 the exercise part of this module and we 455 00:19:40,260 --> 00:19:43,080 will start by setting two things which 456 00:19:43,080 --> 00:19:45,299 are a prequisite for us to work inside 457 00:19:45,299 --> 00:19:49,919 this module which are the users group 458 00:19:49,919 --> 00:19:52,400 and the Azure machine learning workspace 459 00:19:52,400 --> 00:19:55,620 and something extra which is the compute 460 00:19:55,620 --> 00:19:59,760 cluster that calendar Target about so I 461 00:19:59,760 --> 00:20:02,100 just want to make sure that you all have 462 00:20:02,100 --> 00:20:05,660 a resource Group created inside your 463 00:20:05,660 --> 00:20:08,039 auditor inside your Microsoft Azure 464 00:20:08,039 --> 00:20:11,100 platform so this is my research group 465 00:20:11,100 --> 00:20:14,640 inside this is this Resource Group I 466 00:20:14,640 --> 00:20:17,299 have created an Azure machine learning 467 00:20:17,299 --> 00:20:21,539 workspace so I'm just going to access 468 00:20:21,539 --> 00:20:24,000 the workspace that I have created 469 00:20:24,000 --> 00:20:27,000 already from this link I am going to 470 00:20:27,000 --> 00:20:30,240 open it which is the studio web URL and 471 00:20:30,240 --> 00:20:33,000 I will follow the steps so what is this 472 00:20:33,000 --> 00:20:35,760 this is your machine learning workspace 473 00:20:35,760 --> 00:20:38,220 or machine Learning Studio you can do a 474 00:20:38,220 --> 00:20:40,080 lot of things here but we are going to 475 00:20:40,080 --> 00:20:42,419 focus mainly on the designer and the 476 00:20:42,419 --> 00:20:46,080 data and the compute so another 477 00:20:46,080 --> 00:20:49,140 prequisite here as Carlotta told you and 478 00:20:49,140 --> 00:20:51,480 we need some resources to power up the 479 00:20:51,480 --> 00:20:54,299 the classification the processes that 480 00:20:54,299 --> 00:20:55,140 will happen 481 00:20:55,140 --> 00:20:58,080 so we have created this Computing 482 00:20:58,080 --> 00:20:59,100 cluster 483 00:20:59,100 --> 00:21:02,880 and we have like Set uh some presets for 484 00:21:02,880 --> 00:21:04,140 it so 485 00:21:04,140 --> 00:21:07,080 where can you find this preset you go 486 00:21:07,080 --> 00:21:10,200 here under the create compute you'll 487 00:21:10,200 --> 00:21:13,220 find everything that you need to do so 488 00:21:13,220 --> 00:21:16,740 the size is the Standard ds11 Version 2 489 00:21:16,740 --> 00:21:19,799 and it's a CPU not GPU because we don't 490 00:21:19,799 --> 00:21:22,500 know the GPU and we don't need a GPU and 491 00:21:22,500 --> 00:21:25,799 a like it is ready for us to use 492 00:21:25,799 --> 00:21:30,900 the next thing which we will look into 493 00:21:30,900 --> 00:21:33,600 is the designer how can you access the 494 00:21:33,600 --> 00:21:35,100 designer 495 00:21:35,100 --> 00:21:37,679 you can either click on this icon or 496 00:21:37,679 --> 00:21:40,020 click on the navigation menu and click 497 00:21:40,020 --> 00:21:42,299 on the designer for me 498 00:21:42,299 --> 00:21:42,900 um 499 00:21:42,900 --> 00:21:45,780 now I am inside my designer 500 00:21:45,780 --> 00:21:47,640 what we are going to do now is the 501 00:21:47,640 --> 00:21:50,280 pipeline that Carlotta told you about 502 00:21:50,280 --> 00:21:54,360 and from where can I know these steps if 503 00:21:54,360 --> 00:21:57,120 you follow along in the learn module you 504 00:21:57,120 --> 00:21:58,740 will find everything that I'm doing 505 00:21:58,740 --> 00:22:02,340 right now in details uh with screenshots 506 00:22:02,340 --> 00:22:05,820 of course so I'm going to create a new 507 00:22:05,820 --> 00:22:09,120 pipeline and I can do so by clicking on 508 00:22:09,120 --> 00:22:10,980 this plus button 509 00:22:10,980 --> 00:22:13,740 it's going to redirect me to the 510 00:22:13,740 --> 00:22:17,100 designer authoring the pipeline uh where 511 00:22:17,100 --> 00:22:19,500 I can drag and drop data and components 512 00:22:19,500 --> 00:22:21,780 that the Carlota told you the difference 513 00:22:21,780 --> 00:22:22,980 between 514 00:22:22,980 --> 00:22:26,340 and here I am going to do some changes 515 00:22:26,340 --> 00:22:29,100 to the settings I am going to connect 516 00:22:29,100 --> 00:22:31,860 this with my compute cluster that I 517 00:22:31,860 --> 00:22:35,120 created previously so I can utilize it 518 00:22:35,120 --> 00:22:38,100 from here I'm going to choose this 519 00:22:38,100 --> 00:22:40,380 compute cluster demo that I have showed 520 00:22:40,380 --> 00:22:42,600 you before in the Clusters here 521 00:22:42,600 --> 00:22:45,900 and I am going to change the name to 522 00:22:45,900 --> 00:22:47,820 something more meaningful instead of 523 00:22:47,820 --> 00:22:50,580 byline and the date of today I'm going 524 00:22:50,580 --> 00:22:53,760 to name it diabetes 525 00:22:53,760 --> 00:22:56,120 and 526 00:22:56,120 --> 00:23:00,020 let's just check this training 527 00:23:00,020 --> 00:23:05,100 let's say training 0.1 or okay 528 00:23:05,100 --> 00:23:09,360 and I am going to close this tab and in 529 00:23:09,360 --> 00:23:12,000 order to have a bigger place to work 530 00:23:12,000 --> 00:23:14,700 inside because this is where we will 531 00:23:14,700 --> 00:23:17,220 work where everything will happen so I 532 00:23:17,220 --> 00:23:19,559 will click on close from here 533 00:23:19,559 --> 00:23:23,460 and I will go to the data and I will 534 00:23:23,460 --> 00:23:25,620 create a new data set 535 00:23:25,620 --> 00:23:27,900 how can I create a new data set there is 536 00:23:27,900 --> 00:23:29,880 multiple options here you can find from 537 00:23:29,880 --> 00:23:31,799 local files from data store from web 538 00:23:31,799 --> 00:23:34,020 files from open data set but I'm going 539 00:23:34,020 --> 00:23:36,539 to choose from web files as this is the 540 00:23:36,539 --> 00:23:40,280 way we're going to create our data 541 00:23:40,280 --> 00:23:43,380 from here the information of my data set 542 00:23:43,380 --> 00:23:47,340 I'm going to get them from the Microsoft 543 00:23:47,340 --> 00:23:50,820 learn module so if we go to the step 544 00:23:50,820 --> 00:23:52,860 that says create a data set 545 00:23:52,860 --> 00:23:55,020 under it it illustrates that you can 546 00:23:55,020 --> 00:23:57,720 access the data from inside the asset 547 00:23:57,720 --> 00:23:59,760 library and inside your asset liability 548 00:23:59,760 --> 00:24:01,679 you'll find the data and find the 549 00:24:01,679 --> 00:24:05,539 component and I'm going to select 550 00:24:05,539 --> 00:24:09,000 this link because this is where my data 551 00:24:09,000 --> 00:24:12,000 is stored if you open this link you will 552 00:24:12,000 --> 00:24:14,820 find this is this is a CSV file I think 553 00:24:14,820 --> 00:24:17,400 yeah and you can like all the data are 554 00:24:17,400 --> 00:24:18,360 here 555 00:24:18,360 --> 00:24:21,419 all right now let's get back 556 00:24:21,419 --> 00:24:21,540 um 557 00:24:21,540 --> 00:24:24,770 [Music] 558 00:24:26,880 --> 00:24:28,200 and you are going to do something 559 00:24:28,200 --> 00:24:29,880 meaningful but because I have already 560 00:24:29,880 --> 00:24:31,820 created it before twice so I'm gonna 561 00:24:31,820 --> 00:24:34,980 like add a number to the name 562 00:24:34,980 --> 00:24:37,559 uh the data set is tabular and there is 563 00:24:37,559 --> 00:24:39,360 the file but this is a table so we're 564 00:24:39,360 --> 00:24:40,760 going to choose the table 565 00:24:40,760 --> 00:24:42,240 [Music] 566 00:24:42,240 --> 00:24:43,740 for data set time 567 00:24:43,740 --> 00:24:46,260 now we will click on next that's gonna 568 00:24:46,260 --> 00:24:51,179 review or uh display for you the content 569 00:24:51,179 --> 00:24:54,020 of this file that you have 570 00:24:54,020 --> 00:24:57,419 like imported to this workspace 571 00:24:57,419 --> 00:25:01,559 and for these settings these are like 572 00:25:01,559 --> 00:25:03,720 related to our filed format 573 00:25:03,720 --> 00:25:08,280 so this is a delimited file and it's not 574 00:25:08,280 --> 00:25:11,400 plain text it's not a Json the delimiter 575 00:25:11,400 --> 00:25:14,159 is comma as like we have seen that they 576 00:25:14,159 --> 00:25:16,640 those 577 00:25:26,700 --> 00:25:29,039 so I'm choosing 578 00:25:29,039 --> 00:25:32,900 errors because the only the first five 579 00:25:33,510 --> 00:25:34,880 [Music] 580 00:25:34,880 --> 00:25:38,159 for example okay uh if you have any 581 00:25:38,159 --> 00:25:39,960 doubts if you have any problems please 582 00:25:39,960 --> 00:25:42,960 don't hesitate to all right through me 583 00:25:42,960 --> 00:25:45,020 in the chat 584 00:25:45,020 --> 00:25:48,480 and like what what is blocking you and 585 00:25:48,480 --> 00:25:50,940 me and Carlota will try to help you and 586 00:25:50,940 --> 00:25:53,220 like whenever possible 587 00:25:53,220 --> 00:25:55,799 and now this is the new preview for my 588 00:25:55,799 --> 00:25:57,840 data set I can see that I have an ID I 589 00:25:57,840 --> 00:25:59,700 have patient ID I have pregnancies I 590 00:25:59,700 --> 00:26:02,220 have the age of the people 591 00:26:02,220 --> 00:26:05,720 have the body mass together I think 592 00:26:05,720 --> 00:26:08,460 and they have diabetical or not as a 593 00:26:08,460 --> 00:26:10,679 zero and one zero indicates a negative 594 00:26:10,679 --> 00:26:14,159 the person doesn't have diabetes and one 595 00:26:14,159 --> 00:26:16,080 indicates a positive that this person 596 00:26:16,080 --> 00:26:18,299 has diabetes okay 597 00:26:18,299 --> 00:26:20,520 now I'm going to click on next here I am 598 00:26:20,520 --> 00:26:23,400 defining my schema all the data types 599 00:26:23,400 --> 00:26:25,380 inside my columns the column names which 600 00:26:25,380 --> 00:26:28,760 columns to include which to exclude and 601 00:26:28,760 --> 00:26:31,500 here we will include everything except 602 00:26:31,500 --> 00:26:35,580 the path of the bath color and we are 603 00:26:35,580 --> 00:26:37,860 going to review the data types of each 604 00:26:37,860 --> 00:26:40,440 column so let's review this first one 605 00:26:40,440 --> 00:26:43,320 this is numbers numbers then it's the 606 00:26:43,320 --> 00:26:45,779 integer and this is 607 00:26:45,779 --> 00:26:48,679 um like decimal 608 00:26:48,679 --> 00:26:50,900 dotted 609 00:26:50,900 --> 00:26:53,580 decimal number so we are going to choose 610 00:26:53,580 --> 00:26:55,020 this data type 611 00:26:55,020 --> 00:26:57,200 and for this one 612 00:26:57,200 --> 00:27:01,200 it says diabetic and it's a zero under 613 00:27:01,200 --> 00:27:02,460 one and we are going to make it as 614 00:27:02,460 --> 00:27:04,460 integerables 615 00:27:04,460 --> 00:27:07,980 now we are going to click on next and 616 00:27:07,980 --> 00:27:10,080 move to reviewing everything this is 617 00:27:10,080 --> 00:27:11,279 everything that we have defined together 618 00:27:11,279 --> 00:27:13,500 I will click on create 619 00:27:13,500 --> 00:27:15,179 and 620 00:27:15,179 --> 00:27:17,940 now the first step has ended we have 621 00:27:17,940 --> 00:27:19,919 gotten our data ready 622 00:27:19,919 --> 00:27:22,440 now what now we're going to utilize the 623 00:27:22,440 --> 00:27:24,360 designer 624 00:27:24,360 --> 00:27:26,820 um Power we're going to drag and drop 625 00:27:26,820 --> 00:27:29,820 our data set to create the pipeline 626 00:27:29,820 --> 00:27:33,179 so I have like click on it and drag it 627 00:27:33,179 --> 00:27:35,640 to this space it's gonna appear to you 628 00:27:35,640 --> 00:27:39,659 and we can inspect it by right click and 629 00:27:39,659 --> 00:27:42,179 choose preview data 630 00:27:42,179 --> 00:27:46,200 to see what we have created together 631 00:27:46,200 --> 00:27:48,900 from here you can see everything that we 632 00:27:48,900 --> 00:27:50,700 have like seen previously but in more 633 00:27:50,700 --> 00:27:53,100 details and we are just going to close 634 00:27:53,100 --> 00:27:56,580 this now what now we are gonna do the 635 00:27:56,580 --> 00:28:00,799 processing that Carlota like mentioned 636 00:28:00,799 --> 00:28:03,659 these are some instructions about the 637 00:28:03,659 --> 00:28:05,460 data about how you can loot them how you 638 00:28:05,460 --> 00:28:07,140 can open them but we are going to move 639 00:28:07,140 --> 00:28:09,720 to the transformation or the processing 640 00:28:09,720 --> 00:28:13,500 so as Carlotta told you like any data 641 00:28:13,500 --> 00:28:15,480 for us to work on we have to do some 642 00:28:15,480 --> 00:28:17,299 processing to it 643 00:28:17,299 --> 00:28:20,159 to make it easy easier for the model to 644 00:28:20,159 --> 00:28:23,279 be trained and easier to work with so uh 645 00:28:23,279 --> 00:28:25,860 we're gonna do the normalization and 646 00:28:25,860 --> 00:28:29,159 normalization meaning is uh 647 00:28:29,159 --> 00:28:33,539 to scale our data either down or up but 648 00:28:33,539 --> 00:28:35,400 we're going to scale them down 649 00:28:35,400 --> 00:28:38,820 and like we are going to decrease and 650 00:28:38,820 --> 00:28:40,799 relatively decrease 651 00:28:40,799 --> 00:28:44,640 the the values all the values to work 652 00:28:44,640 --> 00:28:48,120 with lower numbers and if we are working 653 00:28:48,120 --> 00:28:49,559 with larger numbers it's going to take 654 00:28:49,559 --> 00:28:52,500 more time if we're working with smaller 655 00:28:52,500 --> 00:28:54,779 numbers it's going to take less time to 656 00:28:54,779 --> 00:28:59,159 calculate them and that's it so 657 00:28:59,159 --> 00:29:02,159 where can I find the normalized data I 658 00:29:02,159 --> 00:29:04,260 can find it inside my component 659 00:29:04,260 --> 00:29:06,720 so I will choose the component and 660 00:29:06,720 --> 00:29:09,659 search for normalized data 661 00:29:09,659 --> 00:29:12,360 I will drag and drop it as usual and I 662 00:29:12,360 --> 00:29:14,820 will connect between these two things 663 00:29:14,820 --> 00:29:18,360 by clicking on this spot this like 664 00:29:18,360 --> 00:29:20,159 Circle and 665 00:29:20,159 --> 00:29:23,159 drag and drop until the next circuit 666 00:29:23,159 --> 00:29:24,899 now we are going to Define our 667 00:29:24,899 --> 00:29:27,419 normalization method 668 00:29:27,419 --> 00:29:31,080 so I'm going to double click on the 669 00:29:31,080 --> 00:29:32,640 normalized data 670 00:29:32,640 --> 00:29:34,860 it's going to open the settings for the 671 00:29:34,860 --> 00:29:36,480 normalization 672 00:29:36,480 --> 00:29:38,820 as better transformation method which is 673 00:29:38,820 --> 00:29:40,500 a mathematical way 674 00:29:40,500 --> 00:29:42,299 that is going to scale our data 675 00:29:42,299 --> 00:29:44,520 according to 676 00:29:44,520 --> 00:29:47,760 we're going to choose min max and for 677 00:29:47,760 --> 00:29:51,539 this one we are going to choose use 0 678 00:29:51,539 --> 00:29:53,100 for constant column we are going to 679 00:29:53,100 --> 00:29:54,480 choose true 680 00:29:54,480 --> 00:29:56,880 and we are going to Define which columns 681 00:29:56,880 --> 00:29:58,860 to normalize so we are not going to 682 00:29:58,860 --> 00:30:01,080 normalize the whole data set we are 683 00:30:01,080 --> 00:30:02,760 going to choose a subset from the data 684 00:30:02,760 --> 00:30:04,559 set to normalize so we're going to 685 00:30:04,559 --> 00:30:07,020 choose everything except for the patient 686 00:30:07,020 --> 00:30:09,000 ID and the diabetic because the patient 687 00:30:09,000 --> 00:30:10,919 ID is a number but it's a categorical 688 00:30:10,919 --> 00:30:13,740 data it describes a vision it's not a 689 00:30:13,740 --> 00:30:17,460 number that I can sum I can say patient 690 00:30:17,460 --> 00:30:20,159 ID number one plus patient ID number two 691 00:30:20,159 --> 00:30:21,720 no this is a patient and another 692 00:30:21,720 --> 00:30:23,399 location it's not a number that I can do 693 00:30:23,399 --> 00:30:25,740 mathematical operations on so I'm not 694 00:30:25,740 --> 00:30:28,200 going to choose it so we will choose 695 00:30:28,200 --> 00:30:30,539 everything as I said except for the 696 00:30:30,539 --> 00:30:33,480 diabetic and the patient might I will 697 00:30:33,480 --> 00:30:34,860 click on Save 698 00:30:34,860 --> 00:30:37,740 and it's not showing me a warning again 699 00:30:37,740 --> 00:30:39,480 everything is good 700 00:30:39,480 --> 00:30:41,880 now I can click on submit 701 00:30:41,880 --> 00:30:46,799 and review my normalization output 702 00:30:46,799 --> 00:30:48,240 um 703 00:30:48,240 --> 00:30:51,659 so uh if you click on submit here 704 00:30:51,659 --> 00:30:54,659 and you will like choose create new and 705 00:30:54,659 --> 00:30:56,460 set the name that is mentioned here 706 00:30:56,460 --> 00:30:59,899 inside the notebook so it it tells you 707 00:30:59,899 --> 00:31:03,419 to like create a job and name it name 708 00:31:03,419 --> 00:31:05,460 the experiment Ms learn diabetes 709 00:31:05,460 --> 00:31:06,720 training because you will continue 710 00:31:06,720 --> 00:31:10,440 working on and building component later 711 00:31:10,440 --> 00:31:13,020 I have it already like created I am the 712 00:31:13,020 --> 00:31:16,919 like we can review it together so uh let 713 00:31:16,919 --> 00:31:19,860 me just open this in another tab I think 714 00:31:19,860 --> 00:31:21,000 I have it 715 00:31:21,000 --> 00:31:23,659 here 716 00:31:25,679 --> 00:31:28,220 okay 717 00:31:30,720 --> 00:31:34,740 so these are all the jobs that I have 718 00:31:34,740 --> 00:31:37,340 read them 719 00:31:37,860 --> 00:31:39,899 all the jobs there let's do this over 720 00:31:39,899 --> 00:31:42,059 these are all the jobs that I have 721 00:31:42,059 --> 00:31:43,679 submitted previously 722 00:31:43,679 --> 00:31:45,840 and I think this one is the 723 00:31:45,840 --> 00:31:48,360 normalization job so let's see the 724 00:31:48,360 --> 00:31:50,100 output of it 725 00:31:50,100 --> 00:31:54,120 as you can see it says uh check mark yes 726 00:31:54,120 --> 00:31:56,640 which means that it worked and we can 727 00:31:56,640 --> 00:31:59,399 preview it how can I do that right click 728 00:31:59,399 --> 00:32:02,539 on it choose preview data 729 00:32:02,539 --> 00:32:06,659 and as you can see all the data are 730 00:32:06,659 --> 00:32:08,399 scaled down 731 00:32:08,399 --> 00:32:10,980 so everything is between zero 732 00:32:10,980 --> 00:32:15,860 and uh one I think 733 00:32:15,860 --> 00:32:18,899 so like everything is good for us now we 734 00:32:18,899 --> 00:32:21,840 can move forward to the next step 735 00:32:21,840 --> 00:32:27,799 which is to create the whole pipeline so 736 00:32:27,799 --> 00:32:30,840 uh Carlota told you that 737 00:32:30,840 --> 00:32:33,179 we're going to use a classification 738 00:32:33,179 --> 00:32:37,260 model to create this data set so uh let 739 00:32:37,260 --> 00:32:40,620 me just drag and drop everything 740 00:32:40,620 --> 00:32:45,380 to get runtime and we're doing 741 00:32:45,799 --> 00:32:50,419 about about everything by 742 00:32:51,419 --> 00:32:52,919 so 743 00:32:52,919 --> 00:32:57,380 as a result we are going to explain 744 00:32:59,760 --> 00:33:03,600 yeah so I'm going to give this split 745 00:33:03,600 --> 00:33:06,240 data I'm going to take the 746 00:33:06,240 --> 00:33:08,880 transformation data to split data and 747 00:33:08,880 --> 00:33:10,380 connect it like that 748 00:33:10,380 --> 00:33:12,299 I'm going to get a three model 749 00:33:12,299 --> 00:33:15,240 components because I want to train my 750 00:33:15,240 --> 00:33:16,679 model 751 00:33:16,679 --> 00:33:19,740 and I'm going to put it right here 752 00:33:19,740 --> 00:33:21,740 okay 753 00:33:21,740 --> 00:33:24,419 like let's just move it down there okay 754 00:33:24,419 --> 00:33:27,059 and we are going to use a classification 755 00:33:27,059 --> 00:33:28,620 model 756 00:33:28,620 --> 00:33:31,880 a two class 757 00:33:32,240 --> 00:33:35,399 logistic regression model 758 00:33:35,399 --> 00:33:38,159 so I'm going to give this algorithm to 759 00:33:38,159 --> 00:33:41,480 enable my model to work 760 00:33:41,820 --> 00:33:45,960 this is the untrained model this is 761 00:33:45,960 --> 00:33:48,059 here 762 00:33:48,059 --> 00:33:51,120 the left the left 763 00:33:51,120 --> 00:33:52,860 the left like Circle I'm going to 764 00:33:52,860 --> 00:33:54,899 connect it to the data set and the right 765 00:33:54,899 --> 00:33:56,940 one we are going to connect it to 766 00:33:56,940 --> 00:33:59,700 evaluate model 767 00:33:59,700 --> 00:34:02,640 evaluate model so let's search for 768 00:34:02,640 --> 00:34:05,220 evaluate model here 769 00:34:05,220 --> 00:34:07,440 so because we want to do what we want to 770 00:34:07,440 --> 00:34:10,800 evaluate our model and see how it it has 771 00:34:10,800 --> 00:34:14,580 been doing it is it good is it bad 772 00:34:14,580 --> 00:34:18,200 um sorry like 773 00:34:19,980 --> 00:34:22,820 this is 774 00:34:23,460 --> 00:34:25,560 down there 775 00:34:25,560 --> 00:34:28,139 after the school 776 00:34:28,139 --> 00:34:31,320 so we have to get the score model first 777 00:34:31,320 --> 00:34:33,960 so let's get it 778 00:34:33,960 --> 00:34:36,119 and this will take the trained model and 779 00:34:36,119 --> 00:34:37,260 the data set 780 00:34:37,260 --> 00:34:39,419 to score our model and see if it's 781 00:34:39,419 --> 00:34:42,179 performing good or bad 782 00:34:42,179 --> 00:34:44,899 and 783 00:34:45,240 --> 00:34:47,159 um 784 00:34:47,159 --> 00:34:49,080 after that like we have finished 785 00:34:49,080 --> 00:34:51,060 everything now we are going to do the 786 00:34:51,060 --> 00:34:52,139 what 787 00:34:52,139 --> 00:34:54,359 the presets for everything 788 00:34:54,359 --> 00:34:56,820 as a starter we will be splitting our 789 00:34:56,820 --> 00:34:58,920 data so 790 00:34:58,920 --> 00:35:01,140 how are we going to do this according to 791 00:35:01,140 --> 00:35:03,780 what to the split rules so I'm going to 792 00:35:03,780 --> 00:35:05,940 double click on and choose split rows 793 00:35:05,940 --> 00:35:09,420 and the percentage is 794 00:35:09,420 --> 00:35:12,780 70 percent for the and 30 percent of the 795 00:35:12,780 --> 00:35:14,820 data for 796 00:35:14,820 --> 00:35:18,420 the valuation or for the scoring okay 797 00:35:18,420 --> 00:35:20,880 I'm going to make it a randomization so 798 00:35:20,880 --> 00:35:22,980 I'm going to split data randomly and the 799 00:35:22,980 --> 00:35:26,060 seat is uh 800 00:35:26,060 --> 00:35:29,339 132 23 I think yeah 801 00:35:29,339 --> 00:35:32,520 and I think that's it 802 00:35:32,520 --> 00:35:35,040 the split says why this holes and that's 803 00:35:35,040 --> 00:35:36,240 good 804 00:35:36,240 --> 00:35:39,540 now for the next one which is the train 805 00:35:39,540 --> 00:35:42,000 model we are going to connect it as 806 00:35:42,000 --> 00:35:43,500 mentioned here 807 00:35:43,500 --> 00:35:48,660 and like we have done that and then why 808 00:35:48,660 --> 00:35:50,700 am I having here like let's double click 809 00:35:50,700 --> 00:35:54,660 on it yeah it has like it needs the 810 00:35:54,660 --> 00:35:57,180 label column that I am trying to predict 811 00:35:57,180 --> 00:35:58,680 so from here I'm going to choose 812 00:35:58,680 --> 00:36:01,380 diabetic I'm going to save 813 00:36:01,380 --> 00:36:05,180 I'm going to close this one 814 00:36:05,520 --> 00:36:07,380 so it says here 815 00:36:07,380 --> 00:36:10,619 the diabetic label the model it will 816 00:36:10,619 --> 00:36:12,300 predict the zero and one because this is 817 00:36:12,300 --> 00:36:14,700 a binary classification algorithm so 818 00:36:14,700 --> 00:36:16,260 it's going to predict either this or 819 00:36:16,260 --> 00:36:17,520 that 820 00:36:17,520 --> 00:36:18,900 and 821 00:36:18,900 --> 00:36:20,160 um 822 00:36:20,160 --> 00:36:23,880 I think that's everything to run the the 823 00:36:23,880 --> 00:36:25,859 pipeline 824 00:36:25,859 --> 00:36:29,040 so everything is done everything is good 825 00:36:29,040 --> 00:36:31,200 for this one we're just gonna leave it 826 00:36:31,200 --> 00:36:34,140 like for now because this is the next 827 00:36:34,140 --> 00:36:36,480 step 828 00:36:36,480 --> 00:36:39,839 um this will like be put instead of the 829 00:36:39,839 --> 00:36:43,520 score model but then it's 830 00:36:44,099 --> 00:36:46,920 delete it for now 831 00:36:46,920 --> 00:36:49,500 okay 832 00:36:49,500 --> 00:36:52,920 now we have to submit the job in order 833 00:36:52,920 --> 00:36:55,680 to see the output of it so I can click 834 00:36:55,680 --> 00:36:59,280 on submit and choose the previous job 835 00:36:59,280 --> 00:37:01,200 which is the one that I have showed you 836 00:37:01,200 --> 00:37:02,460 before 837 00:37:02,460 --> 00:37:05,460 and then let's review its output 838 00:37:05,460 --> 00:37:06,960 together here 839 00:37:06,960 --> 00:37:09,960 so if I go to the jobs 840 00:37:09,960 --> 00:37:15,119 if I go to Ms learn maybe it is training 841 00:37:15,119 --> 00:37:18,180 I think it's the one that lasted the 842 00:37:18,180 --> 00:37:20,640 longest this one here 843 00:37:20,640 --> 00:37:23,700 so here I can see 844 00:37:23,700 --> 00:37:27,079 the job output what happened inside 845 00:37:27,079 --> 00:37:30,420 the model as you can see 846 00:37:30,420 --> 00:37:33,839 so the normalization we have like seen 847 00:37:33,839 --> 00:37:36,540 before the split data I can preview it 848 00:37:36,540 --> 00:37:39,359 the result one or the result two as it 849 00:37:39,359 --> 00:37:41,760 splits the data to 70 here and three 850 00:37:41,760 --> 00:37:44,339 thirty percent here 851 00:37:44,339 --> 00:37:46,859 um I can see the score model which is 852 00:37:46,859 --> 00:37:49,140 like something that we need 853 00:37:49,140 --> 00:37:51,920 to review 854 00:37:52,380 --> 00:37:56,820 um inside the scroll model uh like from 855 00:37:56,820 --> 00:37:57,960 here 856 00:37:57,960 --> 00:38:00,960 we can see that 857 00:38:00,960 --> 00:38:04,460 let's get back here 858 00:38:05,940 --> 00:38:08,220 like this is the data that the model has 859 00:38:08,220 --> 00:38:11,579 been scored and this is a scoring output 860 00:38:11,579 --> 00:38:15,300 so it says code label true and if he is 861 00:38:15,300 --> 00:38:18,300 not diabetic so this is 862 00:38:18,300 --> 00:38:19,200 um 863 00:38:19,200 --> 00:38:21,839 around prediction let's say 864 00:38:21,839 --> 00:38:23,880 for this one it's true and true and this 865 00:38:23,880 --> 00:38:26,880 is like a good like what do you say 866 00:38:26,880 --> 00:38:29,460 prediction and the probabilities of this 867 00:38:29,460 --> 00:38:30,420 score 868 00:38:30,420 --> 00:38:33,119 which means the certainty of our model 869 00:38:33,119 --> 00:38:36,960 of that this is really true it's 80 for 870 00:38:36,960 --> 00:38:38,780 this one is 75 871 00:38:38,780 --> 00:38:42,599 so these are some cool metrics that we 872 00:38:42,599 --> 00:38:45,359 can review to understand how our model 873 00:38:45,359 --> 00:38:47,700 is performing it's performing good for 874 00:38:47,700 --> 00:38:48,540 now 875 00:38:48,540 --> 00:38:53,180 let's check our evaluation model 876 00:38:53,180 --> 00:38:56,700 so this is the extra one that I told you 877 00:38:56,700 --> 00:38:59,579 about instead of the like 878 00:38:59,579 --> 00:39:01,800 score model only we are going to add 879 00:39:01,800 --> 00:39:04,260 what evaluate model 880 00:39:04,260 --> 00:39:06,900 after it so here 881 00:39:06,900 --> 00:39:09,420 we're going to go to our asset library 882 00:39:09,420 --> 00:39:12,180 and we are going to choose the evaluate 883 00:39:12,180 --> 00:39:14,940 model 884 00:39:14,940 --> 00:39:17,760 and we are going to put it here and we 885 00:39:17,760 --> 00:39:20,220 are going to connect it and we are going 886 00:39:20,220 --> 00:39:23,099 to submit the job using the same name of 887 00:39:23,099 --> 00:39:25,140 the job that we used previously 888 00:39:25,140 --> 00:39:29,520 let's review it uh also so after it 889 00:39:29,520 --> 00:39:33,300 finishes you will find it here so I have 890 00:39:33,300 --> 00:39:35,280 already done it before this is how I'm 891 00:39:35,280 --> 00:39:37,380 able to see the output 892 00:39:37,380 --> 00:39:40,320 so let's see 893 00:39:40,320 --> 00:39:43,280 what what is the output of this 894 00:39:43,280 --> 00:39:45,660 evaluation process 895 00:39:45,660 --> 00:39:49,800 here it mentioned to you that there are 896 00:39:49,800 --> 00:39:51,480 some metrics 897 00:39:51,480 --> 00:39:54,839 like the confusion Matrix which Carlotta 898 00:39:54,839 --> 00:39:57,060 told you about there is the accuracy the 899 00:39:57,060 --> 00:39:59,760 Precision the recall and F1 School 900 00:39:59,760 --> 00:40:02,339 every Matrix gives us some insight about 901 00:40:02,339 --> 00:40:04,920 our model it helps us to understand it 902 00:40:04,920 --> 00:40:08,579 more more and 903 00:40:08,579 --> 00:40:10,560 like understand if it's overfitting if 904 00:40:10,560 --> 00:40:12,240 it's good if it's bad and really really 905 00:40:12,240 --> 00:40:16,339 like understand how it's working 906 00:40:17,060 --> 00:40:20,400 now I'm just waiting for the job to load 907 00:40:20,400 --> 00:40:22,920 until it loads 908 00:40:22,920 --> 00:40:23,640 um 909 00:40:23,640 --> 00:40:26,040 we can continue to 910 00:40:26,040 --> 00:40:28,740 to work on our 911 00:40:28,740 --> 00:40:31,800 model so I will go to my designer I'm 912 00:40:31,800 --> 00:40:34,740 just going to confirm this 913 00:40:34,740 --> 00:40:38,280 and I'm going to continue working on it 914 00:40:38,280 --> 00:40:39,780 from 915 00:40:39,780 --> 00:40:42,119 where we have stopped where have we 916 00:40:42,119 --> 00:40:43,560 stopped 917 00:40:43,560 --> 00:40:46,440 we have stopped on the evaluate model so 918 00:40:46,440 --> 00:40:48,960 I'm going to choose this one 919 00:40:48,960 --> 00:40:53,420 and it says here 920 00:40:54,180 --> 00:40:56,940 select experiment create inference 921 00:40:56,940 --> 00:40:58,200 pipeline so 922 00:40:58,200 --> 00:41:01,079 I am going to go to the jobs 923 00:41:01,079 --> 00:41:04,680 I'm going to select my experiment 924 00:41:04,680 --> 00:41:06,660 I hope this works 925 00:41:06,660 --> 00:41:09,720 okay salute finally now we have our 926 00:41:09,720 --> 00:41:12,180 evaluate model output 927 00:41:12,180 --> 00:41:15,480 let's previews evaluation results 928 00:41:15,480 --> 00:41:18,660 and uh 929 00:41:18,660 --> 00:41:22,220 cool come on 930 00:41:25,500 --> 00:41:28,020 finally now we can create our inference 931 00:41:28,020 --> 00:41:31,020 pipeline so 932 00:41:31,020 --> 00:41:34,200 I think it says that 933 00:41:34,200 --> 00:41:35,280 um 934 00:41:35,280 --> 00:41:38,160 select the experiment then select Ms 935 00:41:38,160 --> 00:41:39,359 learn so 936 00:41:39,359 --> 00:41:43,320 I am just going to select it 937 00:41:43,320 --> 00:41:48,300 and finally now we can the ROC curve we 938 00:41:48,300 --> 00:41:51,000 can see it that the true positive rate 939 00:41:51,000 --> 00:41:53,760 and the force was integrate the false 940 00:41:53,760 --> 00:41:56,660 positive rate is increasing with time 941 00:41:56,660 --> 00:42:01,020 and also the true positive rate true 942 00:42:01,020 --> 00:42:03,540 positive is something that it predicted 943 00:42:03,540 --> 00:42:06,960 that it is uh positive it has diabetes 944 00:42:06,960 --> 00:42:09,480 and it's really a it's really true it 945 00:42:09,480 --> 00:42:12,599 the person really has diabetes okay and 946 00:42:12,599 --> 00:42:14,760 for the false positive it predicted that 947 00:42:14,760 --> 00:42:17,579 someone has diabetes and someone doesn't 948 00:42:17,579 --> 00:42:20,960 has it this is what true position and 949 00:42:20,960 --> 00:42:24,960 false positive means this is The Recoil 950 00:42:24,960 --> 00:42:28,020 curve so we can like review the metrics 951 00:42:28,020 --> 00:42:32,160 of our model this is the lift curve I 952 00:42:32,160 --> 00:42:36,000 can change the threshold of my confusion 953 00:42:36,000 --> 00:42:37,740 Matrix here 954 00:42:37,740 --> 00:42:39,119 and this could look don't want to add 955 00:42:39,119 --> 00:42:43,920 anything about the the the graphs and 956 00:42:43,920 --> 00:42:47,000 you can do so 957 00:42:50,460 --> 00:42:51,000 um 958 00:42:51,000 --> 00:42:54,720 yeah so just wanted to if you go yeah I 959 00:42:54,720 --> 00:42:57,119 just wanted to comment comment for the 960 00:42:57,119 --> 00:43:00,480 RSC curve uh that actually from this 961 00:43:00,480 --> 00:43:03,900 graph the metric which uh usually we're 962 00:43:03,900 --> 00:43:06,960 going to compute is the end area under 963 00:43:06,960 --> 00:43:09,900 under the curve and this coefficient or 964 00:43:09,900 --> 00:43:12,240 metric 965 00:43:12,240 --> 00:43:15,060 um it's a confusion 966 00:43:15,060 --> 00:43:18,420 um is a value that could span from from 967 00:43:18,420 --> 00:43:22,920 zero to one and the the highest is 968 00:43:22,920 --> 00:43:23,480 um 969 00:43:23,480 --> 00:43:26,700 this the highest is the the score so the 970 00:43:26,700 --> 00:43:29,220 the closest one 971 00:43:29,220 --> 00:43:32,760 um so the the highest is the amount of 972 00:43:32,760 --> 00:43:35,280 area under this curve 973 00:43:35,280 --> 00:43:40,500 um the the the highest performance uh we 974 00:43:40,500 --> 00:43:43,319 we've got from from our model and 975 00:43:43,319 --> 00:43:46,440 another thing is what John is 976 00:43:46,440 --> 00:43:49,680 um playing with so this threshold for 977 00:43:49,680 --> 00:43:51,380 the logistic 978 00:43:51,380 --> 00:43:55,920 regression is the threshold used by the 979 00:43:55,920 --> 00:43:57,180 model 980 00:43:57,180 --> 00:43:58,740 um to 981 00:43:58,740 --> 00:43:59,520 um 982 00:43:59,520 --> 00:44:02,940 to predict uh if the category is zero or 983 00:44:02,940 --> 00:44:05,220 one so if the probability the 984 00:44:05,220 --> 00:44:08,599 probability score is above the threshold 985 00:44:08,599 --> 00:44:11,579 then the category will be predicted as 986 00:44:11,579 --> 00:44:15,359 one while if the the probability is 987 00:44:15,359 --> 00:44:17,460 below the threshold in this case for 988 00:44:17,460 --> 00:44:21,300 example 0.5 the category is predicted as 989 00:44:21,300 --> 00:44:23,579 as zero so that's why it's very 990 00:44:23,579 --> 00:44:26,099 important to um to choose the the 991 00:44:26,099 --> 00:44:27,839 threshold because the performance really 992 00:44:27,839 --> 00:44:29,520 can vary 993 00:44:29,520 --> 00:44:30,560 um 994 00:44:30,560 --> 00:44:34,380 with this threshold value 995 00:44:34,380 --> 00:44:41,099 uh thank you uh so much uh kellota and 996 00:44:41,400 --> 00:44:44,400 as I mentioned now we are going to like 997 00:44:44,400 --> 00:44:46,560 create our inference pipeline so we are 998 00:44:46,560 --> 00:44:48,540 going to select the latest one which I 999 00:44:48,540 --> 00:44:50,819 already have it opened here this is the 1000 00:44:50,819 --> 00:44:52,859 one that we were reviewing together this 1001 00:44:52,859 --> 00:44:55,500 is where we have stopped and we're going 1002 00:44:55,500 --> 00:44:57,599 to create an inference pipeline we are 1003 00:44:57,599 --> 00:44:59,520 going to choose a real-time inference 1004 00:44:59,520 --> 00:45:02,520 pipeline okay 1005 00:45:02,520 --> 00:45:05,160 um from where I can find this here as it 1006 00:45:05,160 --> 00:45:08,099 says real-time inference pipeline 1007 00:45:08,099 --> 00:45:10,680 so it's gonna add some things to my 1008 00:45:10,680 --> 00:45:12,420 workspace it's going to add the web 1009 00:45:12,420 --> 00:45:13,980 service inboard it's going to have the 1010 00:45:13,980 --> 00:45:15,780 web service output because we will be 1011 00:45:15,780 --> 00:45:18,180 creating it as a web service to access 1012 00:45:18,180 --> 00:45:19,740 it from the internet 1013 00:45:19,740 --> 00:45:21,900 uh what are we going to do we're going 1014 00:45:21,900 --> 00:45:24,720 to remove this diabetes data okay 1015 00:45:24,720 --> 00:45:27,540 and we are going to get a component 1016 00:45:27,540 --> 00:45:29,359 called Web 1017 00:45:29,359 --> 00:45:33,180 input and what's up let me check 1018 00:45:33,180 --> 00:45:35,940 it's enter data manually 1019 00:45:35,940 --> 00:45:38,400 we have we already have the with input 1020 00:45:38,400 --> 00:45:39,540 present 1021 00:45:39,540 --> 00:45:42,119 so we are going to get the entire data 1022 00:45:42,119 --> 00:45:43,200 manually 1023 00:45:43,200 --> 00:45:45,420 and we're going to collect it to connect 1024 00:45:45,420 --> 00:45:49,560 it as it was connected before like that 1025 00:45:49,560 --> 00:45:53,040 and also I am not going to directly take 1026 00:45:53,040 --> 00:45:55,260 the web service sorry escort model to 1027 00:45:55,260 --> 00:45:57,839 the web service output like that 1028 00:45:57,839 --> 00:46:00,240 I'm going to delete this 1029 00:46:00,240 --> 00:46:03,960 and I'm going to execute a python script 1030 00:46:03,960 --> 00:46:05,880 before 1031 00:46:05,880 --> 00:46:09,500 I display my result 1032 00:46:10,680 --> 00:46:12,060 so 1033 00:46:12,060 --> 00:46:17,480 this will be connected like okay but 1034 00:46:19,260 --> 00:46:20,400 so 1035 00:46:20,400 --> 00:46:23,599 the other way around 1036 00:46:23,599 --> 00:46:27,660 and from here I am going to connect this 1037 00:46:27,660 --> 00:46:30,960 with that and there is some data uh that 1038 00:46:30,960 --> 00:46:33,480 we will be getting from the node or from 1039 00:46:33,480 --> 00:46:37,680 the the examination here and this is the 1040 00:46:37,680 --> 00:46:40,740 data that will be entered like to our 1041 00:46:40,740 --> 00:46:44,400 website manually okay this is instead of 1042 00:46:44,400 --> 00:46:47,460 the data that we have been getting from 1043 00:46:47,460 --> 00:46:49,740 our data set that we created so I'm just 1044 00:46:49,740 --> 00:46:51,960 going to double click on it and choose 1045 00:46:51,960 --> 00:46:55,579 CSV and I will choose it has headers 1046 00:46:55,579 --> 00:47:00,839 and I will take or copy this content and 1047 00:47:00,839 --> 00:47:02,819 put it there okay 1048 00:47:02,819 --> 00:47:05,700 so let's do it 1049 00:47:05,700 --> 00:47:07,920 I think I have to click on edit code now 1050 00:47:07,920 --> 00:47:10,680 I can click on Save and I can close it 1051 00:47:10,680 --> 00:47:13,079 another thing which is the python script 1052 00:47:13,079 --> 00:47:16,700 that we will be executing 1053 00:47:17,099 --> 00:47:19,380 um yeah we are going to remove this also 1054 00:47:19,380 --> 00:47:21,140 we don't need the evaluate model anymore 1055 00:47:21,140 --> 00:47:24,319 so we are going to remove 1056 00:47:24,319 --> 00:47:28,579 script that I will be executing okay 1057 00:47:28,579 --> 00:47:32,599 I can find it here 1058 00:47:33,540 --> 00:47:34,619 um 1059 00:47:34,619 --> 00:47:35,760 yeah 1060 00:47:35,760 --> 00:47:38,640 this is the python script that we will 1061 00:47:38,640 --> 00:47:41,520 execute and it says to you that this 1062 00:47:41,520 --> 00:47:43,619 code selects only the patient's ID 1063 00:47:43,619 --> 00:47:45,000 that's correct label the school 1064 00:47:45,000 --> 00:47:47,700 probability and return returns them to 1065 00:47:47,700 --> 00:47:49,980 the web service output so we don't want 1066 00:47:49,980 --> 00:47:51,960 to return all the columns as we have 1067 00:47:51,960 --> 00:47:53,339 seen previously 1068 00:47:53,339 --> 00:47:55,560 uh the determines everything 1069 00:47:55,560 --> 00:47:56,940 so 1070 00:47:56,940 --> 00:47:59,040 we want to return certain stuff the 1071 00:47:59,040 --> 00:48:02,940 stuff that we will use inside our 1072 00:48:02,940 --> 00:48:05,640 endpoint so I'm just going to select 1073 00:48:05,640 --> 00:48:07,980 everything and delete it and 1074 00:48:07,980 --> 00:48:11,060 paste the code that I have gotten from 1075 00:48:11,060 --> 00:48:14,280 the uh 1076 00:48:14,280 --> 00:48:16,500 the Microsoft learn Docs 1077 00:48:16,500 --> 00:48:19,079 now I can click on Save and I can close 1078 00:48:19,079 --> 00:48:20,280 this 1079 00:48:20,280 --> 00:48:21,960 let me check something I don't think 1080 00:48:21,960 --> 00:48:25,020 it's saved it's saved but the display is 1081 00:48:25,020 --> 00:48:26,160 wrong okay 1082 00:48:26,160 --> 00:48:30,300 and now I think everything is good to go 1083 00:48:30,300 --> 00:48:32,640 I'm just gonna double check everything 1084 00:48:32,640 --> 00:48:36,359 so uh yeah we are gonna change the name 1085 00:48:36,359 --> 00:48:38,640 of this uh 1086 00:48:38,640 --> 00:48:40,800 Pipeline and we are gonna call it 1087 00:48:40,800 --> 00:48:42,780 predict 1088 00:48:42,780 --> 00:48:46,319 diabetes okay 1089 00:48:46,319 --> 00:48:50,339 now let's close it and 1090 00:48:50,339 --> 00:48:57,119 I think that we are good to go so 1091 00:48:57,119 --> 00:48:59,300 um 1092 00:48:59,720 --> 00:49:04,460 okay I think everything is good for us 1093 00:49:06,420 --> 00:49:08,339 I just want to make sure of something is 1094 00:49:08,339 --> 00:49:12,420 the data is correct the data is uh yeah 1095 00:49:12,420 --> 00:49:13,560 it's correct 1096 00:49:13,560 --> 00:49:16,319 okay now I can run the pipeline let's 1097 00:49:16,319 --> 00:49:17,640 submit 1098 00:49:17,640 --> 00:49:21,000 select an existing Pipeline and we're 1099 00:49:21,000 --> 00:49:22,740 going to choose the MS layer and 1100 00:49:22,740 --> 00:49:24,599 diabetes training which is the pipeline 1101 00:49:24,599 --> 00:49:27,060 that we have been working on 1102 00:49:27,060 --> 00:49:31,619 from the beginning of this module 1103 00:49:31,680 --> 00:49:33,839 I don't think that this is going to take 1104 00:49:33,839 --> 00:49:36,060 much time so we have submitted the job 1105 00:49:36,060 --> 00:49:37,319 and it's running 1106 00:49:37,319 --> 00:49:40,140 until the job ends we are going to set 1107 00:49:40,140 --> 00:49:41,720 everything 1108 00:49:41,720 --> 00:49:45,599 and for deploying a service 1109 00:49:45,599 --> 00:49:49,560 in order to deploy a service okay 1110 00:49:49,560 --> 00:49:50,520 um 1111 00:49:50,520 --> 00:49:54,000 I have to have the job ready so 1112 00:49:54,000 --> 00:49:56,040 until it's ready or you can deploy it so 1113 00:49:56,040 --> 00:49:58,319 let's go to the job the job details from 1114 00:49:58,319 --> 00:50:01,319 here okay 1115 00:50:01,319 --> 00:50:05,119 and until it finishes 1116 00:50:05,119 --> 00:50:07,260 Carlotta do you think that we can have 1117 00:50:07,260 --> 00:50:09,240 the questions and then we can get back 1118 00:50:09,240 --> 00:50:12,859 to the job I'm deploying it 1119 00:50:13,700 --> 00:50:17,579 yeah yeah yeah so yeah yeah guys if you 1120 00:50:17,579 --> 00:50:18,980 have any questions 1121 00:50:18,980 --> 00:50:24,119 uh on on what you just uh just saw here 1122 00:50:24,119 --> 00:50:26,940 or into introductions feel free this is 1123 00:50:26,940 --> 00:50:30,300 a good moment we can uh we can discuss 1124 00:50:30,300 --> 00:50:33,900 now while we wait for this job to to 1125 00:50:33,900 --> 00:50:36,260 finish 1126 00:50:36,300 --> 00:50:38,760 uh and the 1127 00:50:38,760 --> 00:50:40,220 can can 1128 00:50:40,220 --> 00:50:45,000 we have the energy check one or like 1129 00:50:45,000 --> 00:50:47,700 what do you think uh yeah we can also go 1130 00:50:47,700 --> 00:50:49,680 to the knowledge check 1131 00:50:49,680 --> 00:50:50,940 um 1132 00:50:50,940 --> 00:50:56,339 yeah okay so let me share my screen 1133 00:50:56,339 --> 00:50:58,980 yeah so if you have not any questions 1134 00:50:58,980 --> 00:51:01,619 for us we can maybe propose some 1135 00:51:01,619 --> 00:51:05,339 questions to to you that you can 1136 00:51:05,339 --> 00:51:06,240 um 1137 00:51:06,240 --> 00:51:09,660 uh to check our knowledge so far and you 1138 00:51:09,660 --> 00:51:12,900 can uh maybe answer to these questions 1139 00:51:12,900 --> 00:51:15,420 uh via chat 1140 00:51:15,420 --> 00:51:18,300 um so we have do you see my screen can 1141 00:51:18,300 --> 00:51:19,859 you see my screen 1142 00:51:19,859 --> 00:51:22,020 yes 1143 00:51:22,020 --> 00:51:25,440 um so John I think I will read this 1144 00:51:25,440 --> 00:51:29,040 question loud and ask it to you okay so 1145 00:51:29,040 --> 00:51:32,040 are you ready to transfer 1146 00:51:32,040 --> 00:51:33,660 yes I am 1147 00:51:33,660 --> 00:51:35,460 so 1148 00:51:35,460 --> 00:51:37,260 um you're using Azure machine learning 1149 00:51:37,260 --> 00:51:39,780 designer to create a training pipeline 1150 00:51:39,780 --> 00:51:42,540 for a binary classification model so 1151 00:51:42,540 --> 00:51:45,300 what what we were doing in our demo 1152 00:51:45,300 --> 00:51:48,059 right and you have added a data set 1153 00:51:48,059 --> 00:51:51,660 containing features and labels uh a true 1154 00:51:51,660 --> 00:51:54,359 class decision Forest module so we used 1155 00:51:54,359 --> 00:51:56,819 a logistic regression model our 1156 00:51:56,819 --> 00:51:59,099 um in our example here we're using A2 1157 00:51:59,099 --> 00:52:01,260 class decision force model 1158 00:52:01,260 --> 00:52:04,500 and of course a trained model model you 1159 00:52:04,500 --> 00:52:07,200 plan now to use score model and evaluate 1160 00:52:07,200 --> 00:52:09,480 model modules to test the train model 1161 00:52:09,480 --> 00:52:11,640 with the subset of the data set that 1162 00:52:11,640 --> 00:52:13,500 wasn't used for training 1163 00:52:13,500 --> 00:52:15,960 but what are we missing so what's 1164 00:52:15,960 --> 00:52:18,780 another model you should add and we have 1165 00:52:18,780 --> 00:52:21,660 three options we have join data we have 1166 00:52:21,660 --> 00:52:25,200 split data or we have select columns in 1167 00:52:25,200 --> 00:52:26,819 in that set 1168 00:52:26,819 --> 00:52:28,260 so 1169 00:52:28,260 --> 00:52:32,040 um while John thinks about the answer uh 1170 00:52:32,040 --> 00:52:33,839 go ahead and 1171 00:52:33,839 --> 00:52:34,800 um 1172 00:52:34,800 --> 00:52:37,800 answer yourself so give us your your 1173 00:52:37,800 --> 00:52:39,540 guess 1174 00:52:39,540 --> 00:52:41,940 put in the chat or just come off mute 1175 00:52:41,940 --> 00:52:44,900 and announcer 1176 00:52:46,740 --> 00:52:48,960 a b yes 1177 00:52:48,960 --> 00:52:50,579 yeah what do you think is the correct 1178 00:52:50,579 --> 00:52:53,579 answer for this one I need something to 1179 00:52:53,579 --> 00:52:56,579 uh like I have to score my model and I 1180 00:52:56,579 --> 00:53:00,359 have to evaluate it so I I like I need 1181 00:53:00,359 --> 00:53:03,119 something to enable me to do these two 1182 00:53:03,119 --> 00:53:05,359 things 1183 00:53:06,660 --> 00:53:09,119 I think it's something you showed us in 1184 00:53:09,119 --> 00:53:12,980 in your pipeline right John 1185 00:53:13,260 --> 00:53:16,819 of course I did 1186 00:53:23,460 --> 00:53:28,020 uh we have no guests yeah 1187 00:53:28,020 --> 00:53:32,280 can someone like someone want to guess 1188 00:53:32,280 --> 00:53:35,579 uh we have a b yeah 1189 00:53:35,579 --> 00:53:38,760 uh maybe 1190 00:53:38,760 --> 00:53:43,260 so uh in order to do this in order to do 1191 00:53:43,260 --> 00:53:46,200 this I mentioned the 1192 00:53:46,200 --> 00:53:49,380 the module that is going to help me to 1193 00:53:49,380 --> 00:53:53,819 to divide my data into two things 70 for 1194 00:53:53,819 --> 00:53:56,220 the training and thirty percent for the 1195 00:53:56,220 --> 00:53:59,339 evaluation so what did I use I used 1196 00:53:59,339 --> 00:54:01,859 split data because this is what is going 1197 00:54:01,859 --> 00:54:05,280 to split my data randomly into training 1198 00:54:05,280 --> 00:54:08,579 data and validation data so the correct 1199 00:54:08,579 --> 00:54:12,240 answer is B and good job eek thank you 1200 00:54:12,240 --> 00:54:13,980 for participating 1201 00:54:13,980 --> 00:54:17,400 next question please 1202 00:54:17,400 --> 00:54:19,339 yes 1203 00:54:19,339 --> 00:54:22,559 answer so thanks John 1204 00:54:22,559 --> 00:54:26,040 uh for uh explaining us the the correct 1205 00:54:26,040 --> 00:54:26,940 one 1206 00:54:26,940 --> 00:54:30,420 and we want to go with question two 1207 00:54:30,420 --> 00:54:33,180 yeah so uh I'm going to ask you now 1208 00:54:33,180 --> 00:54:35,880 karnata you use Azure machine learning 1209 00:54:35,880 --> 00:54:38,280 designer to create a training pipeline 1210 00:54:38,280 --> 00:54:40,500 for your classification model 1211 00:54:40,500 --> 00:54:44,099 what must you do before you deploy this 1212 00:54:44,099 --> 00:54:45,960 model as a service you have to do 1213 00:54:45,960 --> 00:54:47,579 something before you deploy it what do 1214 00:54:47,579 --> 00:54:49,740 you think is the correct answer 1215 00:54:49,740 --> 00:54:52,740 is it a b or c 1216 00:54:52,740 --> 00:54:55,020 share your thoughts without in touch 1217 00:54:55,020 --> 00:54:58,380 with us in the chat and 1218 00:54:58,380 --> 00:55:00,180 um and I'm also going to give you some 1219 00:55:00,180 --> 00:55:02,940 like minutes to think of it before I 1220 00:55:02,940 --> 00:55:06,020 like tell you about 1221 00:55:06,599 --> 00:55:09,000 yeah so let me go through the possible 1222 00:55:09,000 --> 00:55:12,359 answers right so we have a uh create an 1223 00:55:12,359 --> 00:55:14,940 inference pipeline from the training 1224 00:55:14,940 --> 00:55:16,020 pipeline 1225 00:55:16,020 --> 00:55:19,260 uh B we have ADD and evaluate model 1226 00:55:19,260 --> 00:55:22,380 module to the training Pipeline and then 1227 00:55:22,380 --> 00:55:25,079 three we have uh clone the training 1228 00:55:25,079 --> 00:55:29,480 Pipeline with a different name 1229 00:55:29,520 --> 00:55:31,559 so what do you think is the correct 1230 00:55:31,559 --> 00:55:33,960 answer a b or c 1231 00:55:33,960 --> 00:55:36,660 uh also this time I think it's something 1232 00:55:36,660 --> 00:55:39,300 we mentioned both in the decks and in 1233 00:55:39,300 --> 00:55:41,960 the demo right 1234 00:55:42,599 --> 00:55:44,819 yes it is 1235 00:55:44,819 --> 00:55:48,720 it's something that I have done like two 1236 00:55:48,720 --> 00:55:51,800 like five minutes ago 1237 00:55:51,800 --> 00:55:57,200 it's real time real time what's 1238 00:55:58,020 --> 00:55:58,760 um 1239 00:55:58,760 --> 00:56:02,040 yeah so think about you need to deploy 1240 00:56:02,040 --> 00:56:05,460 uh the model as a service so uh if I'm 1241 00:56:05,460 --> 00:56:07,980 going to deploy model 1242 00:56:07,980 --> 00:56:10,380 um I cannot like evaluate the model 1243 00:56:10,380 --> 00:56:12,839 after deploying it right because I 1244 00:56:12,839 --> 00:56:14,940 cannot go into production if I'm not 1245 00:56:14,940 --> 00:56:17,579 sure I'm not satisfied over my model and 1246 00:56:17,579 --> 00:56:19,500 I'm not sure that my model is performing 1247 00:56:19,500 --> 00:56:20,280 well 1248 00:56:20,280 --> 00:56:23,460 so that's why I would go with 1249 00:56:23,460 --> 00:56:24,319 um 1250 00:56:24,319 --> 00:56:30,480 I would like exclude B from from my from 1251 00:56:30,480 --> 00:56:31,520 my answer 1252 00:56:31,520 --> 00:56:33,599 uh while 1253 00:56:33,599 --> 00:56:36,960 um thinking about C uh I don't see you I 1254 00:56:36,960 --> 00:56:39,480 didn't see you John cloning uh the 1255 00:56:39,480 --> 00:56:41,420 training Pipeline with a different name 1256 00:56:41,420 --> 00:56:44,640 uh so I I don't think this is the the 1257 00:56:44,640 --> 00:56:46,920 right answer 1258 00:56:46,920 --> 00:56:49,619 um while I've seen you creating an 1259 00:56:49,619 --> 00:56:52,859 inference pipeline uh yeah from the 1260 00:56:52,859 --> 00:56:55,020 training Pipeline and you just converted 1261 00:56:55,020 --> 00:56:59,280 it using uh a one-click button right 1262 00:56:59,280 --> 00:57:03,300 yeah that's correct so uh this is the 1263 00:57:03,300 --> 00:57:04,280 right answer 1264 00:57:04,280 --> 00:57:07,460 uh good job so I created an inference 1265 00:57:07,460 --> 00:57:11,280 real-time Pipeline and it has done it 1266 00:57:11,280 --> 00:57:13,440 like it finished it finished the job is 1267 00:57:13,440 --> 00:57:18,000 finished so uh we can now deploy 1268 00:57:18,000 --> 00:57:19,400 ment 1269 00:57:19,400 --> 00:57:21,500 yeah 1270 00:57:21,500 --> 00:57:25,339 exactly like on time 1271 00:57:25,380 --> 00:57:27,839 I like it finished like two seconds 1272 00:57:27,839 --> 00:57:30,859 three three four seconds ago 1273 00:57:30,859 --> 00:57:33,119 so uh 1274 00:57:33,119 --> 00:57:36,480 until like um 1275 00:57:36,480 --> 00:57:39,839 this is my job review so 1276 00:57:39,839 --> 00:57:43,260 uh like this is the job details that I 1277 00:57:43,260 --> 00:57:45,540 have already submitted it's just opening 1278 00:57:45,540 --> 00:57:48,119 and once it opens 1279 00:57:48,119 --> 00:57:50,180 um 1280 00:57:50,400 --> 00:57:52,740 like I don't know why it's so heavy 1281 00:57:52,740 --> 00:57:56,780 today it's not like that usually 1282 00:57:58,740 --> 00:58:01,020 yeah it's probably because you are also 1283 00:58:01,020 --> 00:58:06,000 showing your your screen on teams 1284 00:58:06,000 --> 00:58:08,160 okay so that's the bandwidth of your 1285 00:58:08,160 --> 00:58:10,740 connection is exactly do something here 1286 00:58:10,740 --> 00:58:13,740 because yeah finally 1287 00:58:13,740 --> 00:58:16,440 I can switch to my mobile internet if it 1288 00:58:16,440 --> 00:58:18,599 did it again so I will click on deploy 1289 00:58:18,599 --> 00:58:20,700 it's that simple I'll just click on 1290 00:58:20,700 --> 00:58:23,040 deploy and 1291 00:58:23,040 --> 00:58:25,619 I am going to deploy a new real-time 1292 00:58:25,619 --> 00:58:27,960 endpoint 1293 00:58:27,960 --> 00:58:30,300 so what I'm going to name it I'm 1294 00:58:30,300 --> 00:58:31,740 description and the compute type 1295 00:58:31,740 --> 00:58:33,720 everything is already mentioned for me 1296 00:58:33,720 --> 00:58:36,240 here so I'm just gonna copy and paste it 1297 00:58:36,240 --> 00:58:38,940 because we like we have we are running 1298 00:58:38,940 --> 00:58:41,280 out of time 1299 00:58:41,280 --> 00:58:45,680 so it's all Azure container instance 1300 00:58:45,680 --> 00:58:48,720 which is a containerization service also 1301 00:58:48,720 --> 00:58:51,059 both are for containerization but this 1302 00:58:51,059 --> 00:58:52,440 gives you something and this gives you 1303 00:58:52,440 --> 00:58:54,960 something else for the advanced options 1304 00:58:54,960 --> 00:58:57,420 it doesn't say for us to do anything so 1305 00:58:57,420 --> 00:59:00,420 we are just gonna click on deploy 1306 00:59:00,420 --> 00:59:05,220 and now we can test our endpoint from 1307 00:59:05,220 --> 00:59:07,859 the endpoints that we can find here so 1308 00:59:07,859 --> 00:59:11,460 it's in progress if I go here 1309 00:59:11,460 --> 00:59:13,799 under the assets I can find something 1310 00:59:13,799 --> 00:59:16,680 called endpoints and I can find the 1311 00:59:16,680 --> 00:59:18,599 real-time ones and the batch endpoints 1312 00:59:18,599 --> 00:59:22,020 and we have created a real-time endpoint 1313 00:59:22,020 --> 00:59:25,260 so we are going to find it under this uh 1314 00:59:25,260 --> 00:59:29,760 title so if I like click on it I should 1315 00:59:29,760 --> 00:59:32,640 be able to test it once it's ready 1316 00:59:32,640 --> 00:59:37,200 it's still like loading but this is the 1317 00:59:37,200 --> 00:59:40,980 input and this is the output that we 1318 00:59:40,980 --> 00:59:45,200 will get back so if I click on test and 1319 00:59:45,200 --> 00:59:49,920 from here I will input some data to the 1320 00:59:49,920 --> 00:59:50,900 endpoint 1321 00:59:50,900 --> 00:59:54,599 which are the patient information The 1322 00:59:54,599 --> 00:59:57,119 Columns that we have already seen in our 1323 00:59:57,119 --> 01:00:00,380 data set the patient ID the pregnancies 1324 01:00:00,380 --> 01:00:03,960 and of course of course I'm not gonna 1325 01:00:03,960 --> 01:00:05,940 enter the label that I'm trying to 1326 01:00:05,940 --> 01:00:08,099 predict so I'm not going to give him if 1327 01:00:08,099 --> 01:00:10,680 the patient is diabetic or not this end 1328 01:00:10,680 --> 01:00:13,200 point is to tell me this is the end 1329 01:00:13,200 --> 01:00:15,599 point or the URL is going to give me 1330 01:00:15,599 --> 01:00:17,640 back this information whether someone 1331 01:00:17,640 --> 01:00:22,680 has diabetes or he doesn't so if I input 1332 01:00:22,680 --> 01:00:24,780 these this data I'm just going to copy 1333 01:00:24,780 --> 01:00:27,780 it and go to my endpoint and click on 1334 01:00:27,780 --> 01:00:30,180 test I'm gonna give the result pack 1335 01:00:30,180 --> 01:00:32,359 which are the three columns that we have 1336 01:00:32,359 --> 01:00:35,520 defined inside our python script the 1337 01:00:35,520 --> 01:00:37,859 patient ID the diabetic prediction and 1338 01:00:37,859 --> 01:00:41,040 the probability the certainty of whether 1339 01:00:41,040 --> 01:00:45,720 someone is diabetic or not based on the 1340 01:00:45,720 --> 01:00:50,660 uh based on the prediction so that's it 1341 01:00:50,660 --> 01:00:54,359 and like uh I think that this is really 1342 01:00:54,359 --> 01:00:56,819 simple step to do you can do it on your 1343 01:00:56,819 --> 01:00:58,380 own you can test it 1344 01:00:58,380 --> 01:01:01,140 and I think that I have finished so 1345 01:01:01,140 --> 01:01:03,020 thank you 1346 01:01:03,020 --> 01:01:06,599 uh yes we are running out of time I I 1347 01:01:06,599 --> 01:01:09,780 just wanted to uh thank you John for for 1348 01:01:09,780 --> 01:01:12,299 this demo for going through all these 1349 01:01:12,299 --> 01:01:14,099 steps to 1350 01:01:14,099 --> 01:01:16,740 um create train a classification model 1351 01:01:16,740 --> 01:01:19,680 and also deploy it as a predictive 1352 01:01:19,680 --> 01:01:23,040 service and I encourage you all to go 1353 01:01:23,040 --> 01:01:25,079 back to the learn module 1354 01:01:25,079 --> 01:01:28,260 um and uh like depend all these topics 1355 01:01:28,260 --> 01:01:31,760 at your at your own pace and also maybe 1356 01:01:31,760 --> 01:01:34,799 uh do this demo on your own on your 1357 01:01:34,799 --> 01:01:37,140 subscription on your Azure for student 1358 01:01:37,140 --> 01:01:39,359 subscription 1359 01:01:39,359 --> 01:01:43,200 um and I would also like to recall that 1360 01:01:43,200 --> 01:01:46,260 this is part of a series of study 1361 01:01:46,260 --> 01:01:49,500 sessions of cloud skill challenge study 1362 01:01:49,500 --> 01:01:51,059 sessions 1363 01:01:51,059 --> 01:01:54,059 um so you will have more in the in the 1364 01:01:54,059 --> 01:01:57,540 in the following days and this is for 1365 01:01:57,540 --> 01:02:00,480 you to prepare let's say to to help you 1366 01:02:00,480 --> 01:02:04,880 in taking the a cloud skills challenge 1367 01:02:04,880 --> 01:02:07,040 which collect 1368 01:02:07,040 --> 01:02:10,799 a very interesting learn module that you 1369 01:02:10,799 --> 01:02:14,540 can use to scale up on various topics 1370 01:02:14,540 --> 01:02:18,359 and some of them are focused on AI and 1371 01:02:18,359 --> 01:02:20,819 ml so if you are interested in these 1372 01:02:20,819 --> 01:02:23,099 topics you can select these these learn 1373 01:02:23,099 --> 01:02:24,780 modules 1374 01:02:24,780 --> 01:02:27,660 um so let me also copy 1375 01:02:27,660 --> 01:02:29,819 um the link the short link to the 1376 01:02:29,819 --> 01:02:32,700 challenge in the chat uh remember that 1377 01:02:32,700 --> 01:02:34,980 you have time until the 13th of 1378 01:02:34,980 --> 01:02:37,980 September to take the challenge and also 1379 01:02:37,980 --> 01:02:40,440 remember that in October on the 7th of 1380 01:02:40,440 --> 01:02:43,020 October you have the you can join the 1381 01:02:43,020 --> 01:02:46,619 student the the student developer Summit 1382 01:02:46,619 --> 01:02:50,640 which is uh which will be a virtual or 1383 01:02:50,640 --> 01:02:53,220 in for some for some cases and hybrid 1384 01:02:53,220 --> 01:02:56,040 event so stay tuned because you will 1385 01:02:56,040 --> 01:02:58,559 have some surprises in the following 1386 01:02:58,559 --> 01:03:01,260 days and if you want to learn more about 1387 01:03:01,260 --> 01:03:03,480 this event you can check the Microsoft 1388 01:03:03,480 --> 01:03:08,099 Imaging cap Twitter page and stay tuned 1389 01:03:08,099 --> 01:03:11,460 so thank you everyone for uh for joining 1390 01:03:11,460 --> 01:03:13,079 this session today and thank you very 1391 01:03:13,079 --> 01:03:16,500 much Sean for co-hosting with with this 1392 01:03:16,500 --> 01:03:20,359 session with me it was a pleasure 1393 01:03:21,839 --> 01:03:24,119 thank you so much Carlotta for having me 1394 01:03:24,119 --> 01:03:26,579 with you today and thank you like for 1395 01:03:26,579 --> 01:03:28,079 giving me this opportunity to be with 1396 01:03:28,079 --> 01:03:30,180 you here 1397 01:03:30,180 --> 01:03:33,480 great I hope that uh yeah I hope that we 1398 01:03:33,480 --> 01:03:36,480 work again in the future sure I I hope 1399 01:03:36,480 --> 01:03:38,160 so as well 1400 01:03:38,160 --> 01:03:40,760 um so 1401 01:03:44,099 --> 01:03:46,500 bye bye speak to you soon 1402 01:03:46,500 --> 01:03:48,920 bye