[SAA + SAP] 19. Database & Redshfit

 

  • ETL: Extra data from S3 or RDS
  • Transform data in Glue
  • Load data into RedShift, or other Data Warehouse

  • Craw the data

  • Search for any field
  • Partially matches

 

  • Data warehouse
  • High perofrmance
  • Can write SQL

  • Copy data can through Enahced VPC Routing
  • Redshift is provisioned, so it's worth it when you have a sustained usage (otherwise use Athena if the queries are sporadic)

 

 

  • Redshift has no "Multi-AZ"
  • So for DR, you need to use Snapshot
  • CRR is available

 

  • Data source are be from
    • Kinesis data firehose
    • S3
    • EC2 (suitable for batch operation)

  • SQL query against S3
  • Must use Redshift cluster

 

posted @ 2021-08-04 16:09  Zhentiw  阅读(77)  评论(0编辑  收藏  举报