1
00:00:00,710 --> 00:00:01,960
‫Okay, so now let's have a look

2
00:00:01,960 --> 00:00:03,573
‫at automatic scaling for your ASG.

3
00:00:03,573 --> 00:00:05,860
‫As you can see we have three categories,

4
00:00:05,860 --> 00:00:08,920
‫dynamic scaling policies, predictive scaling policies,

5
00:00:08,920 --> 00:00:10,400
‫and scheduled actions.

6
00:00:10,400 --> 00:00:11,430
‫So let's start with the simplest.

7
00:00:11,430 --> 00:00:12,340
‫So schedule actions

8
00:00:12,340 --> 00:00:16,290
‫is when you want to schedule a scaling action in the future,

9
00:00:16,290 --> 00:00:17,610
‫so you can create one

10
00:00:17,610 --> 00:00:20,310
‫and then you say what you want the desired capacity

11
00:00:20,310 --> 00:00:22,300
‫or the min or the max to be,

12
00:00:22,300 --> 00:00:25,060
‫and then is it once every week, every hour,

13
00:00:25,060 --> 00:00:27,500
‫is it's based on a specific schedule and so on?

14
00:00:27,500 --> 00:00:29,830
‫And then a start time and an end time

15
00:00:29,830 --> 00:00:31,769
‫if you wanted it to be done.

16
00:00:31,769 --> 00:00:34,890
‫So this is pretty cool and this is allowing you to schedule

17
00:00:34,890 --> 00:00:36,890
‫based on events that you can predict

18
00:00:36,890 --> 00:00:38,050
‫and that you know in advance.

19
00:00:38,050 --> 00:00:38,890
‫Because you know, for example

20
00:00:38,890 --> 00:00:41,683
‫that you're going to run a big promotion next Saturday.

21
00:00:42,840 --> 00:00:45,150
‫Next, predictive scaling policies

22
00:00:45,150 --> 00:00:47,380
‫is where it's going to be machine learning driven.

23
00:00:47,380 --> 00:00:49,830
‫So you need to scale it based on a forecast.

24
00:00:49,830 --> 00:00:53,290
‫So you need to have a look at the actual policy

25
00:00:53,290 --> 00:00:55,420
‫and the actual scaling based on the past

26
00:00:55,420 --> 00:00:58,040
‫and then it will have a look at a metric to look at.

27
00:00:58,040 --> 00:01:01,250
‫For example, CPU utilization or network in, network out,

28
00:01:01,250 --> 00:01:03,300
‫the application load balancer request counts

29
00:01:03,300 --> 00:01:05,000
‫or a custom metric that you want.

30
00:01:05,000 --> 00:01:06,143
‫And then you want to, for example,

31
00:01:06,143 --> 00:01:10,600
‫to have a target of 50% of CPU utilization for instance,

32
00:01:10,600 --> 00:01:12,980
‫and then you can set up some additional settings,

33
00:01:12,980 --> 00:01:14,450
‫but based on this,

34
00:01:14,450 --> 00:01:16,960
‫and based on the actual scheduled utilization

35
00:01:16,960 --> 00:01:18,810
‫for the past week, for example,

36
00:01:18,810 --> 00:01:21,370
‫then a forecast is going to be created

37
00:01:21,370 --> 00:01:24,860
‫and your ASG is going to be scaling based on that forecast.

38
00:01:24,860 --> 00:01:26,370
‫So it's not something I can demonstrate with you

39
00:01:26,370 --> 00:01:28,270
‫because I would need to enable this

40
00:01:28,270 --> 00:01:29,590
‫for a very long time a week

41
00:01:29,590 --> 00:01:31,760
‫and then make sure I have some usage on my application.

42
00:01:31,760 --> 00:01:33,960
‫But at least you see that predictive scaling policy

43
00:01:33,960 --> 00:01:35,040
‫is quite simple to set up,

44
00:01:35,040 --> 00:01:37,160
‫you just specify the metric you want

45
00:01:37,160 --> 00:01:38,530
‫and the target utilization,

46
00:01:38,530 --> 00:01:40,410
‫and then some machine learning will be applied

47
00:01:40,410 --> 00:01:42,230
‫for the scaling to happen.

48
00:01:42,230 --> 00:01:44,880
‫So the one policy I can demonstrate to you

49
00:01:44,880 --> 00:01:46,820
‫is the dynamic scaling policy.

50
00:01:46,820 --> 00:01:49,630
‫So let's go ahead and create a dynamic scaling policy.

51
00:01:49,630 --> 00:01:51,130
‫So here we have three options.

52
00:01:51,130 --> 00:01:54,650
‫We have target tracking, step scaling and simple scaling.

53
00:01:54,650 --> 00:01:56,720
‫So let's have a look at simple scaling first.

54
00:01:56,720 --> 00:02:00,400
‫So here we have to specify a name than a CloudWatch alarm,

55
00:02:00,400 --> 00:02:02,450
‫which is an alarm that can scale capacity

56
00:02:02,450 --> 00:02:03,650
‫whenever it is being triggered.

57
00:02:03,650 --> 00:02:06,160
‫So we need to create an alarm beforehand,

58
00:02:06,160 --> 00:02:07,840
‫but we'll see this later on.

59
00:02:07,840 --> 00:02:10,140
‫And so in case the alarm is being triggered

60
00:02:10,140 --> 00:02:14,030
‫and what happens, maybe you want to add two capacity units,

61
00:02:14,030 --> 00:02:18,270
‫or maybe you want to add 10% of your group.

62
00:02:18,270 --> 00:02:20,040
‫And then add capacity units

63
00:02:20,040 --> 00:02:23,610
‫in increments of at least two capacity units.

64
00:02:23,610 --> 00:02:25,330
‫So this is a simple scaling policy.

65
00:02:25,330 --> 00:02:27,810
‫So we can have a scaling policy that goes out

66
00:02:27,810 --> 00:02:29,220
‫by adding instances,

67
00:02:29,220 --> 00:02:32,420
‫or that goes in by removing instances here.

68
00:02:32,420 --> 00:02:34,663
‫Or you can have a set to action as well.

69
00:02:35,930 --> 00:02:37,420
‫And next we have step scaling.

70
00:02:37,420 --> 00:02:40,430
‫So this is allowing you to set up multiple an alarm

71
00:02:40,430 --> 00:02:42,950
‫and based on the alarm value to have different steps

72
00:02:42,950 --> 00:02:45,400
‫to, for example, if the alarm is very, very high

73
00:02:45,400 --> 00:02:48,160
‫in terms of the value, then add 10 capacity units.

74
00:02:48,160 --> 00:02:50,370
‫But if it's high, but not too high, add one.

75
00:02:50,370 --> 00:02:52,060
‫This is the idea behind step scaling,

76
00:02:52,060 --> 00:02:54,970
‫but we'll set up a target tracking scaling policy

77
00:02:54,970 --> 00:02:58,100
‫because this is going to create a CloudWatch alarms for us.

78
00:02:58,100 --> 00:03:00,220
‫So here's the name target tracking policy

79
00:03:00,220 --> 00:03:02,930
‫and it will track the average CPU utilization

80
00:03:02,930 --> 00:03:06,180
‫of a target value of 40,

81
00:03:06,180 --> 00:03:09,260
‫and then I will go ahead and create my policy.

82
00:03:09,260 --> 00:03:11,050
‫So now what we're saying is that, hey,

83
00:03:11,050 --> 00:03:13,090
‫the goal of this ASG

84
00:03:13,090 --> 00:03:16,450
‫is to maintain the CPU utilization to 40.

85
00:03:16,450 --> 00:03:18,430
‫And in case, then you go over,

86
00:03:18,430 --> 00:03:20,220
‫then please add capacity units.

87
00:03:20,220 --> 00:03:23,550
‫So to see this in action, we need to change a few things.

88
00:03:23,550 --> 00:03:25,750
‫So right now the main end of the desired

89
00:03:25,750 --> 00:03:29,030
‫is one which is good, but let's set the max capacity

90
00:03:29,030 --> 00:03:31,490
‫to be three or two whatever happens.

91
00:03:31,490 --> 00:03:33,490
‫The idea is that you want to give it a maximum

92
00:03:33,490 --> 00:03:35,620
‫that is greater than the minimum

93
00:03:35,620 --> 00:03:38,300
‫so that the capacity can go from one to two

94
00:03:38,300 --> 00:03:39,670
‫and then to three.

95
00:03:39,670 --> 00:03:42,970
‫And so the idea now is that we want the CPU utilization

96
00:03:42,970 --> 00:03:46,540
‫of my autoscaling group to be at a target value of 40.

97
00:03:46,540 --> 00:03:49,400
‫So if you have a look right now at the CPU utilization

98
00:03:49,400 --> 00:03:51,080
‫is going to be zero, obviously.

99
00:03:51,080 --> 00:03:52,520
‫So let's have a look at EC2

100
00:03:53,580 --> 00:03:55,280
‫is going to be close to zero because,

101
00:03:55,280 --> 00:03:57,710
‫well, my EC2 instance is not doing anything.

102
00:03:57,710 --> 00:04:01,150
‫So what I want to do is to go to my EC2 instance,

103
00:04:01,150 --> 00:04:04,770
‫and I'm going to stress it to make the CPU utilization

104
00:04:04,770 --> 00:04:07,070
‫skyrocket to 100%.

105
00:04:07,070 --> 00:04:09,400
‫So I'm going to connect to my EC2 instance,

106
00:04:09,400 --> 00:04:12,850
‫using EC2 instance connect, and then connect to it

107
00:04:12,850 --> 00:04:17,850
‫and then I'll Google install stress Amazon Linux 2.

108
00:04:19,800 --> 00:04:22,590
‫Cause there's a few commands and here is the commands,

109
00:04:22,590 --> 00:04:24,973
‫so I'll copy the first command in here,

110
00:04:27,950 --> 00:04:31,573
‫and then I'll copy the second command to install stress.

111
00:04:35,710 --> 00:04:36,543
‫Here we go.

112
00:04:36,543 --> 00:04:37,900
‫So stress is installed.

113
00:04:37,900 --> 00:04:42,700
‫And then I just run the command stress -C 4

114
00:04:42,700 --> 00:04:45,880
‫and this is going to make the CPU go to 100%

115
00:04:45,880 --> 00:04:50,160
‫by leveraging four CPU units by making four VCPUs

116
00:04:50,160 --> 00:04:50,993
‫being used at a time.

117
00:04:50,993 --> 00:04:53,940
‫So this should make my CPU go to 100%

118
00:04:53,940 --> 00:04:55,800
‫and so what's going to happen is that

119
00:04:55,800 --> 00:04:59,010
‫in my monitoring of my ASG in here,

120
00:04:59,010 --> 00:05:01,670
‫what I wanted to happen is to go see the CPU tradition

121
00:05:01,670 --> 00:05:04,060
‫to go to something very, very, very high.

122
00:05:04,060 --> 00:05:08,410
‫And then in activities, I want to see a scaling action.

123
00:05:08,410 --> 00:05:11,110
‫And so that means that I will go from one instance

124
00:05:11,110 --> 00:05:12,760
‫to two instances.

125
00:05:12,760 --> 00:05:15,430
‫So what I'm going to do is just pause the video until,

126
00:05:15,430 --> 00:05:17,040
‫well, enough metrics are being captured

127
00:05:17,040 --> 00:05:19,910
‫and until we can see that the CPU utilization

128
00:05:19,910 --> 00:05:21,330
‫is at a very high value

129
00:05:21,330 --> 00:05:24,183
‫and then we'll see how the target tracking policy works.

130
00:05:25,440 --> 00:05:29,490
‫So now I went into activity and under Activity history,

131
00:05:29,490 --> 00:05:32,390
‫it says that's a alarm has been triggered

132
00:05:32,390 --> 00:05:34,270
‫and due to the target driving policy,

133
00:05:34,270 --> 00:05:37,870
‫then the capacity went from one instance to two instances.

134
00:05:37,870 --> 00:05:39,750
‫So if I go into instance management,

135
00:05:39,750 --> 00:05:41,800
‫as you can see, now I have two EC2 instances

136
00:05:41,800 --> 00:05:43,080
‫due to the scaling.

137
00:05:43,080 --> 00:05:44,870
‫And if I go into monitoring

138
00:05:44,870 --> 00:05:47,100
‫and look at the EC2 level monitoring,

139
00:05:47,100 --> 00:05:50,110
‫as we can see the CPU utilization went to a very high value

140
00:05:50,110 --> 00:05:51,690
‫and then four scaling happens.

141
00:05:51,690 --> 00:05:53,400
‫So how do we know that the scaling happens?

142
00:05:53,400 --> 00:05:55,900
‫So if you're going to Automatic scaling, as you can see,

143
00:05:55,900 --> 00:05:59,380
‫there is a target tracking policy right here.

144
00:05:59,380 --> 00:06:01,160
‫And what I want to show you is the backend.

145
00:06:01,160 --> 00:06:06,160
‫So if we go into the CloudWatch service in here

146
00:06:07,000 --> 00:06:08,820
‫and go into CloudWatch,

147
00:06:08,820 --> 00:06:12,460
‫at the left hand side, I want to go into Alarms.

148
00:06:12,460 --> 00:06:14,830
‫So we need to go into alarms.

149
00:06:14,830 --> 00:06:18,200
‫As you can see, two alarms are created directly

150
00:06:18,200 --> 00:06:20,910
‫by the target tracking policy.

151
00:06:20,910 --> 00:06:24,900
‫And one of them is called AlarmHigh, which is to scale out,

152
00:06:24,900 --> 00:06:26,620
‫so add instances.

153
00:06:26,620 --> 00:06:28,450
‫And one of them is called AlarmLow,

154
00:06:28,450 --> 00:06:31,650
‫which is to scale in so less instances.

155
00:06:31,650 --> 00:06:34,770
‫And so this one is looking at if the CPU utilization

156
00:06:34,770 --> 00:06:37,870
‫is more than 40% for three to the points

157
00:06:37,870 --> 00:06:40,810
‫within three minutes, then go into the alarm states.

158
00:06:40,810 --> 00:06:42,860
‫And this one is looking at if CPU utilization

159
00:06:42,860 --> 00:06:46,773
‫is less than 20 eights for 15 data points, then scale in.

160
00:06:47,640 --> 00:06:48,500
‫So this is the idea.

161
00:06:48,500 --> 00:06:49,830
‫So this one was in alarm

162
00:06:49,830 --> 00:06:54,800
‫and so due to the metric itself, going into the alarm state,

163
00:06:54,800 --> 00:06:56,050
‫by having the CPU utilization

164
00:06:56,050 --> 00:06:58,090
‫going over that limit right here,

165
00:06:58,090 --> 00:06:59,560
‫well, it got triggered

166
00:06:59,560 --> 00:07:02,730
‫and this made a scaling activity happen

167
00:07:02,730 --> 00:07:04,170
‫from my autoscaling group,

168
00:07:04,170 --> 00:07:07,680
‫which in turn made a new instance being in service.

169
00:07:07,680 --> 00:07:11,900
‫And so now if I go here and I stop this command

170
00:07:11,900 --> 00:07:13,940
‫and I'm not even sure that I can stop it,

171
00:07:13,940 --> 00:07:18,610
‫but if I stop this command or let me just finish it

172
00:07:18,610 --> 00:07:19,920
‫and to stop this command,

173
00:07:19,920 --> 00:07:24,600
‫I can probably go to my EC2 instances and reboot it.

174
00:07:24,600 --> 00:07:28,403
‫So I'm going to have a look at this one, I believe,

175
00:07:29,750 --> 00:07:31,760
‫and I'm going to reboot it.

176
00:07:31,760 --> 00:07:33,510
‫So reboot this instance,

177
00:07:33,510 --> 00:07:36,293
‫and I'm going to also reboot that one just in case.

178
00:07:39,720 --> 00:07:43,640
‫So this should make my CPU utilization go back to zero

179
00:07:43,640 --> 00:07:47,430
‫and this would trigger a scale in action within 15 minutes.

180
00:07:47,430 --> 00:07:51,380
‫So I'm going to yet again, pause the video in my ASG

181
00:07:51,380 --> 00:07:54,870
‫and see if by any chance within the activity,

182
00:07:54,870 --> 00:07:56,620
‫I see a scale in action happening.

183
00:07:56,620 --> 00:07:58,340
‫So I will pause right now

184
00:07:59,360 --> 00:08:01,990
‫and actually just because I wasn't quick enough

185
00:08:01,990 --> 00:08:03,810
‫because the CPU still was high,

186
00:08:03,810 --> 00:08:06,450
‫then the desired capacity went from two to three.

187
00:08:06,450 --> 00:08:09,070
‫So as we can see yet another instance has been added.

188
00:08:09,070 --> 00:08:11,130
‫So this really shows the power of autoscaling group

189
00:08:11,130 --> 00:08:12,750
‫and now if I go into instance managements,

190
00:08:12,750 --> 00:08:15,280
‫we'll have three EC2 instances,

191
00:08:15,280 --> 00:08:17,310
‫and I'm going to have to wait a little while

192
00:08:17,310 --> 00:08:19,690
‫until they're being terminated.

193
00:08:19,690 --> 00:08:22,670
‫So now let's go back into activity and as you can see,

194
00:08:22,670 --> 00:08:24,520
‫more activities have been going on.

195
00:08:24,520 --> 00:08:26,410
‫So some instances have been terminated.

196
00:08:26,410 --> 00:08:31,000
‫So because the alarm went from the lower CPU

197
00:08:31,000 --> 00:08:34,650
‫and then the capacity was set from three to two

198
00:08:34,650 --> 00:08:37,090
‫and then one more time from two to one.

199
00:08:37,090 --> 00:08:38,900
‫And so if you go into your instance management,

200
00:08:38,900 --> 00:08:41,060
‫as you can see, one instance was already terminated

201
00:08:41,060 --> 00:08:44,380
‫and the other one is in the terminating phase,

202
00:08:44,380 --> 00:08:48,160
‫as that means that's your target tracking policy is working.

203
00:08:48,160 --> 00:08:51,350
‫And you can see this by going into this alarm right here.

204
00:08:51,350 --> 00:08:53,550
‫And as you can see, the CPU utilization went up

205
00:08:53,550 --> 00:08:54,430
‫and then it went down

206
00:08:54,430 --> 00:08:59,070
‫and then as soon it's past this 28% threshold,

207
00:08:59,070 --> 00:09:00,570
‫then it went to do the alarm state,

208
00:09:00,570 --> 00:09:04,240
‫which means that your ASG we'll start removing instances.

209
00:09:04,240 --> 00:09:06,870
‫So that really shows the power of target tracking policies.

210
00:09:06,870 --> 00:09:10,660
‫When you're done, please make sure to delete this policy

211
00:09:10,660 --> 00:09:12,550
‫and you'll be good to go for the cleanup.

212
00:09:12,550 --> 00:09:14,850
‫That's it, I will see you in the next lecture.