1 00:00:00,190 --> 00:00:02,330 ‫So now let's talk about Amazon Athena. 2 00:00:02,330 --> 00:00:04,632 ‫Amazon Athena is a serverless query service 3 00:00:04,632 --> 00:00:07,490 ‫to perform analytics against your objects 4 00:00:07,490 --> 00:00:09,180 ‫stored in Amazon S3. 5 00:00:09,180 --> 00:00:11,590 ‫So the idea is that you would use the SQL query language to 6 00:00:11,590 --> 00:00:13,850 ‫create these files, but you don't need to load them. 7 00:00:13,850 --> 00:00:17,530 ‫They just need to be in S3 and Athena will do the rest. 8 00:00:17,530 --> 00:00:19,280 ‫So these files can be formatted in different ways, 9 00:00:19,280 --> 00:00:22,980 ‫such as CSV, JSON, ORC, Avro, and Parquet 10 00:00:22,980 --> 00:00:25,890 ‫and the Athena is built on the Presto engine, 11 00:00:25,890 --> 00:00:27,110 ‫if you must know. 12 00:00:27,110 --> 00:00:27,943 ‫Now, how does it work? 13 00:00:27,943 --> 00:00:29,811 ‫So users will load it into Amazon S3, 14 00:00:29,811 --> 00:00:31,880 ‫and then Amazon Athena will be used 15 00:00:31,880 --> 00:00:33,690 ‫to query and analyze the data. 16 00:00:33,690 --> 00:00:35,160 ‫Very, very simple. 17 00:00:35,160 --> 00:00:36,690 ‫And then if you wanted to, 18 00:00:36,690 --> 00:00:38,890 ‫you could have some reporting on top of Athena, 19 00:00:38,890 --> 00:00:41,470 ‫such as using Amazon QuickSight. 20 00:00:41,470 --> 00:00:43,457 ‫Now the pricing for Athena is around 21 00:00:43,457 --> 00:00:45,552 ‫$5 per terabyte of data scanned. 22 00:00:45,552 --> 00:00:47,730 ‫And if you use compressed 23 00:00:47,730 --> 00:00:50,289 ‫or data stored in a columnar fashion, 24 00:00:50,289 --> 00:00:52,010 ‫then you're going to have cost savings 25 00:00:52,010 --> 00:00:54,910 ‫because there is less scan of the data being made. 26 00:00:54,910 --> 00:00:57,270 ‫So the use cases of Athena are multiple, 27 00:00:57,270 --> 00:00:59,150 ‫but anytime you see Business intelligence, 28 00:00:59,150 --> 00:01:00,480 ‫analytics, or reporting, 29 00:01:00,480 --> 00:01:03,710 ‫or to analyze Flow Logs in VPC or ELB Logs, 30 00:01:03,710 --> 00:01:05,280 ‫or CloudTrail logs, or platform logs, 31 00:01:05,280 --> 00:01:07,520 ‫all these kinds of logs in AWS, 32 00:01:07,520 --> 00:01:10,070 ‫then Athena's going to be a really, really good option. 33 00:01:10,070 --> 00:01:11,600 ‫So from an exam perspective, 34 00:01:11,600 --> 00:01:14,280 ‫whenever you see serverless analyze data in S3 35 00:01:14,280 --> 00:01:17,710 ‫use SQL, then think Amazon Athena. 36 00:01:17,710 --> 00:01:18,543 ‫That's it. 37 00:01:18,543 --> 00:01:19,376 ‫I hope you liked it. 38 00:01:19,376 --> 00:01:20,960 ‫And I will see you in the next lecture.