1 00:00:00,170 --> 00:00:02,530 ‫The first pillar in the well-architected framework, 2 00:00:02,530 --> 00:00:05,060 ‫is going to be operational excellence, 3 00:00:05,060 --> 00:00:08,070 ‫so operational excellence includes the ability to run 4 00:00:08,070 --> 00:00:10,390 ‫and monitor systems to deliver business values 5 00:00:10,390 --> 00:00:12,340 ‫and to continuously improve 6 00:00:12,340 --> 00:00:14,350 ‫supporting processes and procedures. 7 00:00:14,350 --> 00:00:16,620 ‫It's about having operations that are great. 8 00:00:16,620 --> 00:00:18,610 ‫There are design principles that you need to know about, 9 00:00:18,610 --> 00:00:20,290 ‫and questions that you ask yourself. 10 00:00:20,290 --> 00:00:21,240 ‫First of all, 11 00:00:21,240 --> 00:00:23,160 ‫all your operations should be code, 12 00:00:23,160 --> 00:00:25,380 ‫infrastructure as code is going to be the cornerstone 13 00:00:25,380 --> 00:00:26,820 ‫of operational excellence, 14 00:00:26,820 --> 00:00:30,470 ‫and remember infrastructure as code is CloudFormation. 15 00:00:30,470 --> 00:00:31,710 ‫You need to annotate documentation, 16 00:00:31,710 --> 00:00:33,410 ‫so when you create documentation, 17 00:00:33,410 --> 00:00:34,570 ‫make sure it's annotated, 18 00:00:34,570 --> 00:00:35,403 ‫make sure it's commented, 19 00:00:35,403 --> 00:00:36,380 ‫make sure it's clean 20 00:00:36,380 --> 00:00:39,460 ‫and generated after every build of your application. 21 00:00:39,460 --> 00:00:42,990 ‫You need to make frequent and small reversible changes, 22 00:00:42,990 --> 00:00:45,120 ‫so that, in case of failures, you can reverse it. 23 00:00:45,120 --> 00:00:47,150 ‫If you make huge changes every three months, 24 00:00:47,150 --> 00:00:48,970 ‫this is not going to go well. 25 00:00:48,970 --> 00:00:51,080 ‫You need to refine operations procedures frequently, 26 00:00:51,080 --> 00:00:54,040 ‫so as the team gets more familiar with the operations, 27 00:00:54,040 --> 00:00:57,440 ‫refine it, automate more, and make improvements. 28 00:00:57,440 --> 00:01:00,570 ‫Finally, anticipate failure, that will always happen, 29 00:01:00,570 --> 00:01:03,240 ‫and you need to learn from all these failures, okay? 30 00:01:03,240 --> 00:01:05,110 ‫Failures do happen and they're great, 31 00:01:05,110 --> 00:01:06,750 ‫failures is when you actually learn, 32 00:01:06,750 --> 00:01:08,870 ‫when I fail, I learn, so you should do the same, 33 00:01:08,870 --> 00:01:12,670 ‫and this is the design principles of operational excellence. 34 00:01:12,670 --> 00:01:14,820 ‫So now, I'd like to just transcribe it 35 00:01:14,820 --> 00:01:16,660 ‫in terms of AWS Services, 36 00:01:16,660 --> 00:01:19,170 ‫and the first one is going to be Prepare. 37 00:01:19,170 --> 00:01:21,570 ‫So, Prepare is going to be how do we basically 38 00:01:21,570 --> 00:01:24,300 ‫prepare everything to have operational excellence. 39 00:01:24,300 --> 00:01:25,340 ‫So you should use run books, 40 00:01:25,340 --> 00:01:27,610 ‫you should have good infrastructure standards, 41 00:01:27,610 --> 00:01:29,650 ‫you should do run deployments, 42 00:01:29,650 --> 00:01:31,140 ‫and like, mock deployments, 43 00:01:31,140 --> 00:01:33,990 ‫and for this, we could use AWS CloudFormation, 44 00:01:33,990 --> 00:01:35,930 ‫basically to prepare everything 45 00:01:35,930 --> 00:01:37,910 ‫as a nice infrastructure as code, 46 00:01:37,910 --> 00:01:40,040 ‫and then Config would be really nice as well, 47 00:01:40,040 --> 00:01:42,604 ‫to be using, we haven't seen Config in-depth in this course, 48 00:01:42,604 --> 00:01:44,730 ‫but Config could be used to evaluate 49 00:01:44,730 --> 00:01:47,560 ‫the compliance of your CloudFormation templates. 50 00:01:47,560 --> 00:01:49,090 ‫Then, Operate, you need to operate 51 00:01:49,090 --> 00:01:51,710 ‫and basically automate as much as you can, 52 00:01:51,710 --> 00:01:53,080 ‫you should release fast, 53 00:01:53,080 --> 00:01:55,110 ‫you should avoid manual processes. 54 00:01:55,110 --> 00:01:58,380 ‫So, again, CloudFormation would be a great tool for this. 55 00:01:58,380 --> 00:01:59,500 ‫Config as well. 56 00:01:59,500 --> 00:02:01,110 ‫CloudTrail to make sure to track 57 00:02:01,110 --> 00:02:02,620 ‫all the API codes that are done, 58 00:02:02,620 --> 00:02:04,410 ‫and make sure nothing is done on purpose, 59 00:02:04,410 --> 00:02:06,450 ‫or nothing is changed manually. 60 00:02:06,450 --> 00:02:08,580 ‫CloudWatch to basically monitor 61 00:02:08,580 --> 00:02:10,590 ‫the performance over time of your stack, 62 00:02:10,590 --> 00:02:12,420 ‫and make sure that if you need to operate it, 63 00:02:12,420 --> 00:02:13,800 ‫you know what operations to do. 64 00:02:13,800 --> 00:02:14,890 ‫And finally X-Ray, 65 00:02:14,890 --> 00:02:16,830 ‫it's not something we've seen in-depth in this course 66 00:02:16,830 --> 00:02:19,191 ‫but X-Ray will be able to trace API calls, 67 00:02:19,191 --> 00:02:21,330 ‫to trace HTP requests 68 00:02:21,330 --> 00:02:22,850 ‫and to make sure that they're working correctly, 69 00:02:22,850 --> 00:02:24,350 ‫and if they're not working correctly, 70 00:02:24,350 --> 00:02:27,640 ‫it will point us to where the incorrect stuff happened. 71 00:02:27,640 --> 00:02:30,690 ‫Finally, you need to evolve your infrastructure over time. 72 00:02:30,690 --> 00:02:31,523 ‫So how do we do this? 73 00:02:31,523 --> 00:02:33,330 ‫Well, again, CloudFormation as you understand it, 74 00:02:33,330 --> 00:02:35,280 ‫CloudFormation is a centerpiece 75 00:02:35,280 --> 00:02:37,740 ‫of this whole operational excellence pillar, 76 00:02:37,740 --> 00:02:39,910 ‫but also you need to be able to evolve over time, 77 00:02:39,910 --> 00:02:43,017 ‫so all this CICD tools basically CodeBuild, 78 00:02:43,017 --> 00:02:46,330 ‫CodeCommit, CodeDeploy, CodePipeline, 79 00:02:46,330 --> 00:02:48,850 ‫allow you to basically iterate quickly, 80 00:02:48,850 --> 00:02:50,690 ‫deploy quickly, deploy often, 81 00:02:50,690 --> 00:02:52,080 ‫deploy small changes, 82 00:02:52,080 --> 00:02:53,646 ‫and all these things will contribute 83 00:02:53,646 --> 00:02:55,410 ‫to operational excellence. 84 00:02:55,410 --> 00:02:56,680 ‫Now remember, just going into the exam, 85 00:02:56,680 --> 00:02:59,190 ‫you don't need to understand or remember any of that. 86 00:02:59,190 --> 00:03:01,260 ‫I'm just trying to educate you, 87 00:03:01,260 --> 00:03:04,140 ‫to all the AWS Services, and how they can help 88 00:03:04,140 --> 00:03:06,260 ‫with this operational excellence pillar. 89 00:03:06,260 --> 00:03:07,870 ‫This is just educative, okay? 90 00:03:07,870 --> 00:03:09,610 ‫Have a read through the pillar, by the way, 91 00:03:09,610 --> 00:03:11,027 ‫on the white paper if you want to, 92 00:03:11,027 --> 00:03:13,190 ‫but here I'm just trying to give you a small rundown. 93 00:03:13,190 --> 00:03:14,890 ‫You don't need to remember all these services, 94 00:03:14,890 --> 00:03:18,480 ‫and there's not, this service corresponds exactly to that, 95 00:03:18,480 --> 00:03:20,210 ‫operational pillar, or whatever, 96 00:03:20,210 --> 00:03:22,920 ‫but it's just to give you ideas of how all these services 97 00:03:22,920 --> 00:03:25,950 ‫can be used as a synergy to basically 98 00:03:25,950 --> 00:03:27,640 ‫drive operational excellence. 99 00:03:27,640 --> 00:03:29,030 ‫Alright, so that's it for this lecture, 100 00:03:29,030 --> 00:03:30,580 ‫I will see you in the next one.