SlideShare a Scribd company logo
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Notices
All information, guidance and materials (collectively, "Information") provided to you in connection with the Program are for
informational purposes only. You are solely responsible for making your own independent assessment of the Information and your
use of AWS's products or services. Neither this document nor any other Information provided to you creates any warranties (express or
implied), representations, contractual commitments, conditions or assurances from AWS, its affiliates, suppliers or licensors. Neither
this document nor any other information provided to you are part of, nor do they modify, any agreements between you and AWS. All
information in this document will be shared with only the Customer and the AWS Team.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Herman Mak, Solutions Architect
10 April 2019
Best Practices
Using the Well Architected Framework
Kwunhok Chan, Solutions Architect
Clifford Duke, Solutions Architect
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
• Introduction
• Well Architected Pillar: Security
• Well Architected Pillar: Performance Efficiency
• Well Architected Pillar: Reliability
• Break
• Well Architected Pillar: Cost Optimization
• Well Architected Pillar: Operational Excellence
• Breakout Discussions
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Workshop Format
After presentation of all pillars, we will work in groups to
fill in worksheet then discuss any questions
For a One-on-One review with a Solutions Architect book
your time before you leave
…. and please rate the workshop!
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What to Expect from the Session
Learn about
• The AWS Well-Architected Framework (5 pillars)
• General design principles
• Best practices
• Value proposition
Understand the business impact of your design decisions
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Introduction
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Am I doing it right?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
“Are you Well-Architected?”
• Show you how to build strategies & best practices for
architecting your applications in the cloud
• Give you questions that allow you to measure your
architecture against best practices
• Help identify how to address any shortcomings
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Reference Architectures
https://aws.amazon.com/architecture https://aws.amazon.com/whitepapers
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Trusted Advisor
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Framework
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Purpose of the AWS Well-Architected Framework
• Increases awareness of architectural best practices
• Addresses foundational areas that are often neglected
• Consistent approach to evaluating architectures
• Composed of
• Pillars
• Design principles
• Questions – starting point to get you thinking
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Pillars of Well-Architected Framework
Security Reliability Performance
Efficiency
Cost
Optimization
Operational
Excellence
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Design Principles of Well-Architected Framework
Stop guessing
capacity needs
Test systems at
scale
Data-driven
architectures
Automate to enable
experimentation
Allow for
evolution
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
Pillar Area
Question Text
Question Context
Your Answers
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Well-Architected Tool
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Worksheet
• Work through the questions
• Use the questions as a prompt
• CURRENT STATE – what is being done now?
• FUTURE STATE – what do you think they should be doing?
• Not an absolute right or wrong – use case specific
• It’s a guide
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Herman Mak, Solutions Architect
2019/04/10
Well Architected Framework
Security
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The security pillar focuses on protecting
information & systems. Key topics include
confidentiality and integrity of data, identifying
and managing who can do what with privilege
management, protecting systems, and establishing
controls to detect security events.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Resources:
Website:
https://aws.amazon.com/architecture/well-architected/
Whitepaper
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Implement a Strong Identity Foundation
Enable Traceability
Apply Security at All Layers
Automate security best practices
Design Principal:
Protect data in transit and at rest
Prepare for security events
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and access management
Detective controls
Infrastructure protection
Data protection
Definitions:
Incident response
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and Access Management
• Protecting AWS credentials
•How do you protect your Root Account?
•How do you assign your IAM Users, Groups?
•How about the API/CLI access?
•Did you create Appropriate Policies?
• Fine-grained authorization
•Least privilege practice
•IAM Role for resources
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and Access Management
• Protecting AWS credentials
• MFA for Root Account
• MFA for critical API Call
• IAM
• IAM instance profiles for EC2 instances
• AWS STS
• Fine-grained authorization
• IAM, IAM Role
• AWS Organizations
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and access management
Detective controls
Infrastructure protection
Data protection
Definitions:
Incident response
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Detective Controls
• Capture and Analyze Logs:
• Track all the activities and resource changes on your cloud env
• Keep the log, centralized, and prepare for analyze
• Analyze those logs from: compute, storage, applications
• Integrate Auditing Controls with Notification and Workflow:
• Discover potential events of interest
• Triggered from changes in infrastructure
• Better to have stronger build process before production
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Detective Controls
• Capture and Analyze Logs:
• Capture: CloudTrial, AWS Config
• Store: CloudWatch Logs, S3, Glacier
• Analyze: Elasticsearch Service, EMR, Athena
• Integrate Auditing Controls with Notification and Workflow:
• CloudWatch, CloudWatch Events
• AWS Config Rules
• CloudWatch API & AWS SDKs
• Inspector
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and access management
Detective controls
Infrastructure protection
Data protection
Definitions:
Incident response
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Infrastructure Protection
• Protecting network and host-level boundaries:
• How you provide isolation and boundaries for resource?
• How to design your network topology?
• How to protect your network access?
• What will you do, if you have hybrid cloud?
• System(OS) security configuration and maintenance
• How to manage your system security configuration?
• How to make sure your system is secure?
• Do you applied Least-privilege approach, in your system?
• Enforcing service-level protection
• Who can access service-endpoints? And how?
• Who can access the resource of this service? And how?
• Did you prevent all the possible data leak?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Infrastructure Protection
• Protecting network and host-level boundaries:
• VPC
• Security Group
• NetACL
• DirectConnect
• System security configuration and maintenance
• VPC Security Group
• Amazon Inspector
• Systems Manager
(Run Command, State Manager, Inventory, Parameter Store, Patch Manager)
• Enforcing service-level protection
• IAM
• AWS KMS allows you to set policies on the individual key
• Amazon S3 allows you to set bucket policies for each S3 bucket.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and access management
Detective controls
Infrastructure protection
Data protection
Definitions:
Incident response
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Data Protection
• Data Classification
• Categorize organizational data based on levels of sensitivity
• Manage appropriate data classification system with different requirement level
• Public data? Internal data? HIPPA PHI?
• Encryption/tokenization
• Protect your content against unauthorized user and unnecessary explosure.
• Protecting data at rest
• Protecting data in transit
• Data backup/replication/recovery
• Against the deletion or destruction of data
• Ensure continued business operations
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Data Protection
• Data Classification
• Resource Tag
• Detect abnormal access à Amazon Macie
• AWS KMS
• Encryption/tokenization
• AWS KMS
• AWS CloudHSM
• Amazon DynamoDB
• Protecting data at rest
• AWS KMS
• AWS S3, EBS, Glacier all support KMS
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Data Protection
• Protecting data in transit
• AWS Certificate Manager (ACM)
• ELB Classic Load Balancers/ Application Load Balancers
• Amazon CloudFront
• AWS Shield
• AWS WAF???
• Data backup/replication/recovery
• Amazon S3
• Amazon S3 Cross-Region Replication
• Amazon S3 Lifecycle polices and versioning
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Identity and access management
Detective controls
Infrastructure protection
Data protection
Definitions:
Incident response
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Incident Response
• Clean Room
• Maintain situational awareness of incident
• Describe your resource with Tag. (Whom to response?)
• How to get access for the right people during an incident?
• How to clean a same environment to investigate incident/reproduce defect?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Incident Response
• Clean Room
• IAM should be used to grant appropriate authorization to incident response teams
• AWS CloudFormation
• EC2 APIs
• AWS Step Functions
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: How are you protecting access to
and use of the AWS root account
credentials??
No MFA on Root
Root Actively Used
MFA and Minimal Use of Root
No Use of Root
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: How are you classifying your
data?
No Data Classification Schema
Data Not Classified
Using Data Classification Schema
All Data Treated as Sensitive
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Herman Mak, Solutions Architect
2019/04/10
Well Architected Framework
Performance Efficiency
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The performance efficiency pillar focuses on using
IT and computing resources efficiently. Key topics
include selecting the right resource types and sizes
based on workload requirements, monitoring
performance, and making informed decisions to
maintain efficiency as business needs evolve.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Website:
https://aws.amazon.com/architecture/well-architected/
Whitepaper
Resources:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Democratize advanced technologies
Go global in minutes
Use server-less architectures
Experiment more often
Design Principal:
Mechanical sympathy
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
Definitions:
Monitoring
Review
Trade-offs
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
• Compute
• Instances
• Containers
• Functions
• Elasticity
• Storage
• Database
• OLTP
• NoSQL
• OLAP
• Data Indexing and searching
• Network
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
• Compute
• EC2 à GPU? FPGA? Burstable? HPC?
• ECS, EKS, Fargate
• Lambda + APIGateway
• Storage
• EBS, EFS, EC2 instance store, Glacier
• Database
• RDS
• DynamoDB, DAX
• Redshift, S3, Athena
• ES
• Network
• Route53, VPC, CloudFront, DirectConnect
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
Definitions:
Monitoring
Review
Trade-offs
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Review
• Infrastructure as code
• Deployment pipeline
• Well-defined metrics
• Performance test automatically
• Load generation
• Performance visibility
• visualization
à CodeDeploy, CloudFormation, CloudWatch
Benchmarking
Load Testing
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
Definitions:
Monitoring
Review
Trade-offs
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Monitoring
• Active
• You setup and collect in every environment
• Passive
• Collected from outside of your system
• Understand user experience performance
• Geographically performance variability
• The impact of API use
• Phases
Generation àAggregation àReal-time processing and alarming àStorage àAnalytics
à CloudWatch, S3, EMR
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Selection
Definitions:
Monitoring
Review
Trade-offs
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Trade-offs
• Caching
• Application Level
• Database Level
• Geographic Level
• Partitioning or Sharding
• Compression
• Buffering
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: How did you select your storage
solution?
Considered Characteristics
Options no Explored
Considered Configuration Options
Have not Considered Configuration Options to Improve Performance
Considered Access Patterns
Have not Considered Access Patterns to Improve Performance
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Herman Mak, Solutions Architect
2019/04/10
Well Architected Framework
Reliability
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The reliability pillar focuses on the ability to
prevent, and quickly recover from failures to meet
business and customer demand. Key topics include
foundational elements around setup, cross project
requirements, recovery planning, and how we
handle change.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Website:
https://aws.amazon.com/architecture/well-architected/
Whitepaper
Resources:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to calculate Availability? How many 9s?
Background
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to calculate Availability?
Background
With hard dependency?
With redundant components?
Cost?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Foundation - Networking
Definitions:
Understand Availability Needs
Application Design for Availability
Operational Consideration for Availability
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Foundation - Networking
• Allow IP address space for more than one VPC per Region.
• Consider cross-account connections. For example, each line of business
might have a unique account and VPCs. These accounts should be able to
connect back to shared services.
• Within a VPC, allow space for multiple subnets that span multiple AZ.
• Always leave unused CIDR block space within a VPC.
• How are you going to be resilient to failures in your topology?
• What happens if you misconfigure something and remove connectivity?
• Will you be able to handle an unexpected increase in traffic/use of your
services?
• Will you be able to absorb an attempted DoS attack?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Foundation - Networking
• Key Services for Network Topology
• Amazon VPC
• AWS Direct Connect
• Amazon EC2
• Amazon route53
• Elastic Load Balancing
• AWS Shield
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Foundation - Networking
Definitions:
Understand Availability Needs
Application Design for Availability
Operational Consideration for Availability
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Application Design for Availability
• Fault Isolation Zones
• Multiple independent component in parallel
• Multi-AZ
• Redundant components
• Micro-service architecture
• Recovery Oriented Computing
• Distributed systems best practices
• Throttling
• Retry with exponential fallback
• Fail fast
• Use of idempotency tokens à assume an action must occur exactly once
• Constant work
• Circuit breaker
• Bi-modal behavior and static stability
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Detective Controls
• Capture and Analyze Logs:
• Capture: CloudTrial, AWS Config
• Store: CloudWatch Logs, S3, Glacier
• Analyze: Elasticsearch Service, EMR, Athena
• Integrate Auditing Controls with Notification and Workflow:
• CloudWatch, CloudWatch Events
• AWS Config Rules
• CloudWatch API & AWS SDKs
• Inspector
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Foundation - Networking
Definitions:
Understand Availability Needs
Application Design for Availability
Operational Consideration for Availability
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Consideration for Availability
• Automate Deployments to Eliminate Impact
• Canary deployment
• Blue-Green deployment
• Feature toggles
• Failure isolation zone deployments
• Testing
• Monitoring and Alarming
Generation àAggregation àReal-time processing and alarming àStorage àAnalytics
• Operational Readiness Reviews
• Auditing
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Consideration for Availability
• Automate Deployments to Eliminate Impact
• AWS Code Deploy
• Testing
• Monitoring and Alarming
• Amazon Cloudwatch
• AWS X-Ray
• Amazon S3
• Amazon EMR
• Operational Readiness Reviews
• Auditing
• Amazon Cloudwatch Logs
• AWS Config
• AWS CloudTrail
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: How are you managing AWS
Service Limits for your Account(s)?
Unaware
Aware but not Tracking
Monitor and Manage Limits
Aware of Fixed Service Limits
Sufficient Buffer in Service Limits to Accommodate for Failover
Service Limits are Considered
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Clifford Duke, Solutions Architect
2019/04/10
Well Architected Framework
Cost Optimization
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Optimization focuses on avoiding un-needed
costs. Key topics include understanding and
controlling where money is being spent, selecting
the most appropriate and right number of
resource types, analyzing spend over time, and
scaling to meet business needs without
overspending.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Resources:
Website:
https://aws.amazon.com/architecture/well-architected/
Whitepaper
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Adopt Consumption Model
Measure Overall Efficiency
Stop spending on On-Premise-DC
Analyze and Attribute Expenditure
Design Principal:
Adopt Managed Service
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
Match Supply and Demand
Expenditure Awareness
Optimizing Over Time
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
Match Supply and Demand
Expenditure Awareness
Optimizing Over Time
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
• Appropriate Provisioning
• Over-estimated? Under-estimated?
• Right Sizing
• Instance Type? Instance Family?
Database Type? Storage Type?
• Purchase Options
• On Demand? Reserved Instance? Spot
Instance?
• All Upfront? No Upfront? Partial
Upfront?
• GEO Selection
• Centralized? Distributed?
• Managed Services
• Self managed cost? Human resource
cost?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
• Appropriate Provisioning
• Based on Cloudwatch and historical data
• Right Sizing
• Cloudwatch, Logs, Trusted Advisor
• Monitoring reflect end-users’ experience
• Purchase Options
• Cost Explorer
• Hybrid combination: On Demand + Reserved Instance + Spot Instance
• GEO Selection
• S3, CloudFront
• Managed Services
• RDS, Dynamodb…
• SES, SQS, SNS…
• Lambda, APIGW…
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
Match Supply and Demand
Expenditure Awareness
Optimizing Over Time
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Match supply and demand
• Demand-Based:
• How quickly you need to provision
• Understand the size of margin between supply and demand
• Buffer-Based:
• Components run at different rates over time
• Time-Based:
• Align the resource capacity to predictable timeframe in real-world
• Office hours, Weekdays, Campaign, Super Hot Selling Holiday…
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Match supply and demand
• Demand-Based:
• Auto-Scaling, Pre-built AMI
• Load-Balacing
• CloudWatch, CloudWatch Alert/Event/Trigger
• Buffer-Based:
• SQS
• Kinesis
• Spot Instances
• Lambda
• Time-Based:
• Auto-Scaling
• CloudFormation
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
Match Supply and Demand
Expenditure Awareness
Optimizing Over Time
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Expenditure Awareness
• Stakeholders:
• CFO/Financial Controllers
• BU Owners
• Tech Lead
• 3rd Parties
• Visibility and Controls:
• I want to estimate & forecast billing
• I want to receive Alert if exceed threshold
• I want to analyze spending/usage
• I want to know RI Utilization/Coverage
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Expenditure Awareness
• Cost Attribution
• Account Structure (Multi-Account) varies by business reasons
• Set specific limit to particular workloads (Sub Account? IAM user?)
• Specific reservation for certain workloads (Central DB?)
• Tagging
• Entity Lifecycle Tracking
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Expenditure Awareness
• Stakeholders/Visibility and Controls:
• Cost Explorer
• Billing Alarm, CloudWatch Alarm, SNS
• Cost Attribution
• Consolidated Billing *Quota Discount
• IAM
• Tagging
• Resource Tag
• Entity Lifecycle Tracking
• CloudTrail
• Config
• IAM
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost Effective
Match Supply and Demand
Expenditure Awareness
Optimizing Over Time
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Optimizing Over Time
• Measure, Monitor, and Improve:
• Establish a Cost Optimization Function
• Establish Goals and Metrics
• Gather Insight and Perform Analysis
• Report and Validate
• Stay Ever Green
• Stay up-to-date with AWS
• Check with managed services
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Optimizing Over Time
• Measure, Monitor, and Improve:
• TAM (Technical Account Manager)
• Utilization, RI Coverage
• Trusted Advisor
• Stay Ever Green
• Trusted Advisor
• AWS Blog
• What’s New At AWS
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: How do you make sure your
capacity matches but does not
substantially exceed what you need?
Provision for Peak
Demand-based Approach
Buffered-Based Approach
Time-Based Approach
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Clifford Duke, Solutions Architect
2019/04/10
Well Architected Framework
Operational Excellence
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The operational excellence pillar focuses on
running and monitoring systems to deliver
business value, and continually improving
processes and procedures. Key topics include
managing and automating changes, responding to
events, and defining standards to successfully
manage daily operations.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Website:
https://aws.amazon.com/architecture/well-architected/
Whitepaper
Resources:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Perform operations as code
Annotated documentation
Make frequent, small, reversible changes
Refine operations procedures frequently
Design Principal:
Anticipate failure
Learn from all operational failure
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
Operate
Evolve
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
• Operational priorities
• Understand your workload
• Role
• Business goal
• External regulatory and compliance requirement
• Design for operations
• How it will be deployed, updated, and operated?
• Defect reduction and fixes(quick and safe)
• Log, monitor, metrics
• Operational readiness
• Checklists
• Runbooks
• process
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
• Operational priorities
• AWS Support : Business Support/Enterprise Support
• Trusted Advisor
• Cloud Compliance : (https://aws.amazon.com/compliance/programs/)
• Well Architected Framework
• Design for operations
• CloudWatch
• CloudFormation
• Developer Tools
• AWS X-Ray
• Operational readiness
• Lambda
• AWS Config
• AWS Systems Manager
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
Operate
Evolve
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operate
• Understanding operational Health
• Monitoring
• What metrics?
• Which level?
• Only number or useful insights?
• Responding to Events
• Planned events, unplanned events….
• How to response?
• RCA
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operate
• Understanding operational Health
• CloudWatch, CloudWatch Logs
• ES
• Personal Health Dashboard
• Service Health Dashboard
• Responding to Events
• AWS Lambda
• CloudWatch, CloudWatch Events
• SNS
• Auto Scaling
• Systems Manager
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Prepare
Operate
Evolve
Definitions:
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Evolve
• Learning from Experience
• Analyze your operations over time
• Review, and validate insights
• Sharing learning
• Use code methodologies.
• Sharing resource with other accounts
• Integrate with 3rd party tools
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Evolve
• Learning from Experience
• ES
• QuickSight
• Athena
• S3
• CloudWatch
• Sharing learning
• IAM
• SNS
• CodeCommit
• Lambda
• CloudFormation
• AMIs
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Q: What best practices for cloud
operations are you using?
Operational Checklist to Evaluate Readiness
Not Evaluating Readiness
Proactive Plan for Events
No Active Plan for Events
Security Checklist to Evaluate Security Readiness
Not Evaluating Security Readiness
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
https://bit.ly/2Ia7W0k
Worksheet
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank You!

More Related Content

HK-AWS-Well-Architected-Workshop

  • 1. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Notices All information, guidance and materials (collectively, "Information") provided to you in connection with the Program are for informational purposes only. You are solely responsible for making your own independent assessment of the Information and your use of AWS's products or services. Neither this document nor any other Information provided to you creates any warranties (express or implied), representations, contractual commitments, conditions or assurances from AWS, its affiliates, suppliers or licensors. Neither this document nor any other information provided to you are part of, nor do they modify, any agreements between you and AWS. All information in this document will be shared with only the Customer and the AWS Team.
  • 2. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Herman Mak, Solutions Architect 10 April 2019 Best Practices Using the Well Architected Framework Kwunhok Chan, Solutions Architect Clifford Duke, Solutions Architect
  • 3. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda • Introduction • Well Architected Pillar: Security • Well Architected Pillar: Performance Efficiency • Well Architected Pillar: Reliability • Break • Well Architected Pillar: Cost Optimization • Well Architected Pillar: Operational Excellence • Breakout Discussions
  • 4. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Workshop Format After presentation of all pillars, we will work in groups to fill in worksheet then discuss any questions For a One-on-One review with a Solutions Architect book your time before you leave …. and please rate the workshop!
  • 5. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What to Expect from the Session Learn about • The AWS Well-Architected Framework (5 pillars) • General design principles • Best practices • Value proposition Understand the business impact of your design decisions
  • 6. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Introduction
  • 7. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Am I doing it right?
  • 8. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. “Are you Well-Architected?” • Show you how to build strategies & best practices for architecting your applications in the cloud • Give you questions that allow you to measure your architecture against best practices • Help identify how to address any shortcomings
  • 9. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Reference Architectures https://aws.amazon.com/architecture https://aws.amazon.com/whitepapers
  • 10. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Trusted Advisor
  • 11. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Framework
  • 12. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Purpose of the AWS Well-Architected Framework • Increases awareness of architectural best practices • Addresses foundational areas that are often neglected • Consistent approach to evaluating architectures • Composed of • Pillars • Design principles • Questions – starting point to get you thinking
  • 13. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Pillars of Well-Architected Framework Security Reliability Performance Efficiency Cost Optimization Operational Excellence
  • 14. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Design Principles of Well-Architected Framework Stop guessing capacity needs Test systems at scale Data-driven architectures Automate to enable experimentation Allow for evolution
  • 15. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool
  • 16. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool Pillar Area Question Text Question Context Your Answers
  • 17. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool
  • 18. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool
  • 19. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool
  • 20. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Well-Architected Tool
  • 21. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Worksheet • Work through the questions • Use the questions as a prompt • CURRENT STATE – what is being done now? • FUTURE STATE – what do you think they should be doing? • Not an absolute right or wrong – use case specific • It’s a guide
  • 22. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Herman Mak, Solutions Architect 2019/04/10 Well Architected Framework Security
  • 23. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The security pillar focuses on protecting information & systems. Key topics include confidentiality and integrity of data, identifying and managing who can do what with privilege management, protecting systems, and establishing controls to detect security events.
  • 24. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Resources: Website: https://aws.amazon.com/architecture/well-architected/ Whitepaper
  • 25. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Implement a Strong Identity Foundation Enable Traceability Apply Security at All Layers Automate security best practices Design Principal: Protect data in transit and at rest Prepare for security events
  • 26. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and access management Detective controls Infrastructure protection Data protection Definitions: Incident response
  • 27. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and Access Management • Protecting AWS credentials •How do you protect your Root Account? •How do you assign your IAM Users, Groups? •How about the API/CLI access? •Did you create Appropriate Policies? • Fine-grained authorization •Least privilege practice •IAM Role for resources
  • 28. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and Access Management • Protecting AWS credentials • MFA for Root Account • MFA for critical API Call • IAM • IAM instance profiles for EC2 instances • AWS STS • Fine-grained authorization • IAM, IAM Role • AWS Organizations
  • 29. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and access management Detective controls Infrastructure protection Data protection Definitions: Incident response
  • 30. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Detective Controls • Capture and Analyze Logs: • Track all the activities and resource changes on your cloud env • Keep the log, centralized, and prepare for analyze • Analyze those logs from: compute, storage, applications • Integrate Auditing Controls with Notification and Workflow: • Discover potential events of interest • Triggered from changes in infrastructure • Better to have stronger build process before production
  • 31. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Detective Controls • Capture and Analyze Logs: • Capture: CloudTrial, AWS Config • Store: CloudWatch Logs, S3, Glacier • Analyze: Elasticsearch Service, EMR, Athena • Integrate Auditing Controls with Notification and Workflow: • CloudWatch, CloudWatch Events • AWS Config Rules • CloudWatch API & AWS SDKs • Inspector
  • 32. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and access management Detective controls Infrastructure protection Data protection Definitions: Incident response
  • 33. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Infrastructure Protection • Protecting network and host-level boundaries: • How you provide isolation and boundaries for resource? • How to design your network topology? • How to protect your network access? • What will you do, if you have hybrid cloud? • System(OS) security configuration and maintenance • How to manage your system security configuration? • How to make sure your system is secure? • Do you applied Least-privilege approach, in your system? • Enforcing service-level protection • Who can access service-endpoints? And how? • Who can access the resource of this service? And how? • Did you prevent all the possible data leak?
  • 34. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Infrastructure Protection • Protecting network and host-level boundaries: • VPC • Security Group • NetACL • DirectConnect • System security configuration and maintenance • VPC Security Group • Amazon Inspector • Systems Manager (Run Command, State Manager, Inventory, Parameter Store, Patch Manager) • Enforcing service-level protection • IAM • AWS KMS allows you to set policies on the individual key • Amazon S3 allows you to set bucket policies for each S3 bucket.
  • 35. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and access management Detective controls Infrastructure protection Data protection Definitions: Incident response
  • 36. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Data Protection • Data Classification • Categorize organizational data based on levels of sensitivity • Manage appropriate data classification system with different requirement level • Public data? Internal data? HIPPA PHI? • Encryption/tokenization • Protect your content against unauthorized user and unnecessary explosure. • Protecting data at rest • Protecting data in transit • Data backup/replication/recovery • Against the deletion or destruction of data • Ensure continued business operations
  • 37. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Data Protection • Data Classification • Resource Tag • Detect abnormal access à Amazon Macie • AWS KMS • Encryption/tokenization • AWS KMS • AWS CloudHSM • Amazon DynamoDB • Protecting data at rest • AWS KMS • AWS S3, EBS, Glacier all support KMS
  • 38. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Data Protection • Protecting data in transit • AWS Certificate Manager (ACM) • ELB Classic Load Balancers/ Application Load Balancers • Amazon CloudFront • AWS Shield • AWS WAF??? • Data backup/replication/recovery • Amazon S3 • Amazon S3 Cross-Region Replication • Amazon S3 Lifecycle polices and versioning
  • 39. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Identity and access management Detective controls Infrastructure protection Data protection Definitions: Incident response
  • 40. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Incident Response • Clean Room • Maintain situational awareness of incident • Describe your resource with Tag. (Whom to response?) • How to get access for the right people during an incident? • How to clean a same environment to investigate incident/reproduce defect?
  • 41. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Incident Response • Clean Room • IAM should be used to grant appropriate authorization to incident response teams • AWS CloudFormation • EC2 APIs • AWS Step Functions
  • 42. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: How are you protecting access to and use of the AWS root account credentials?? No MFA on Root Root Actively Used MFA and Minimal Use of Root No Use of Root
  • 43. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: How are you classifying your data? No Data Classification Schema Data Not Classified Using Data Classification Schema All Data Treated as Sensitive
  • 44. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Herman Mak, Solutions Architect 2019/04/10 Well Architected Framework Performance Efficiency
  • 45. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The performance efficiency pillar focuses on using IT and computing resources efficiently. Key topics include selecting the right resource types and sizes based on workload requirements, monitoring performance, and making informed decisions to maintain efficiency as business needs evolve.
  • 46. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Website: https://aws.amazon.com/architecture/well-architected/ Whitepaper Resources:
  • 47. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Democratize advanced technologies Go global in minutes Use server-less architectures Experiment more often Design Principal: Mechanical sympathy
  • 48. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection Definitions: Monitoring Review Trade-offs
  • 49. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection • Compute • Instances • Containers • Functions • Elasticity • Storage • Database • OLTP • NoSQL • OLAP • Data Indexing and searching • Network
  • 50. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection • Compute • EC2 à GPU? FPGA? Burstable? HPC? • ECS, EKS, Fargate • Lambda + APIGateway • Storage • EBS, EFS, EC2 instance store, Glacier • Database • RDS • DynamoDB, DAX • Redshift, S3, Athena • ES • Network • Route53, VPC, CloudFront, DirectConnect
  • 51. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection Definitions: Monitoring Review Trade-offs
  • 52. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Review • Infrastructure as code • Deployment pipeline • Well-defined metrics • Performance test automatically • Load generation • Performance visibility • visualization à CodeDeploy, CloudFormation, CloudWatch Benchmarking Load Testing
  • 53. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection Definitions: Monitoring Review Trade-offs
  • 54. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Monitoring • Active • You setup and collect in every environment • Passive • Collected from outside of your system • Understand user experience performance • Geographically performance variability • The impact of API use • Phases Generation àAggregation àReal-time processing and alarming àStorage àAnalytics à CloudWatch, S3, EMR
  • 55. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Selection Definitions: Monitoring Review Trade-offs
  • 56. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Trade-offs • Caching • Application Level • Database Level • Geographic Level • Partitioning or Sharding • Compression • Buffering
  • 57. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: How did you select your storage solution? Considered Characteristics Options no Explored Considered Configuration Options Have not Considered Configuration Options to Improve Performance Considered Access Patterns Have not Considered Access Patterns to Improve Performance
  • 58. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Herman Mak, Solutions Architect 2019/04/10 Well Architected Framework Reliability
  • 59. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The reliability pillar focuses on the ability to prevent, and quickly recover from failures to meet business and customer demand. Key topics include foundational elements around setup, cross project requirements, recovery planning, and how we handle change.
  • 60. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Website: https://aws.amazon.com/architecture/well-architected/ Whitepaper Resources:
  • 61. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to calculate Availability? How many 9s? Background
  • 62. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to calculate Availability? Background With hard dependency? With redundant components? Cost?
  • 63. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Foundation - Networking Definitions: Understand Availability Needs Application Design for Availability Operational Consideration for Availability
  • 64. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Foundation - Networking • Allow IP address space for more than one VPC per Region. • Consider cross-account connections. For example, each line of business might have a unique account and VPCs. These accounts should be able to connect back to shared services. • Within a VPC, allow space for multiple subnets that span multiple AZ. • Always leave unused CIDR block space within a VPC. • How are you going to be resilient to failures in your topology? • What happens if you misconfigure something and remove connectivity? • Will you be able to handle an unexpected increase in traffic/use of your services? • Will you be able to absorb an attempted DoS attack?
  • 65. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Foundation - Networking • Key Services for Network Topology • Amazon VPC • AWS Direct Connect • Amazon EC2 • Amazon route53 • Elastic Load Balancing • AWS Shield
  • 66. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Foundation - Networking Definitions: Understand Availability Needs Application Design for Availability Operational Consideration for Availability
  • 67. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Application Design for Availability • Fault Isolation Zones • Multiple independent component in parallel • Multi-AZ • Redundant components • Micro-service architecture • Recovery Oriented Computing • Distributed systems best practices • Throttling • Retry with exponential fallback • Fail fast • Use of idempotency tokens à assume an action must occur exactly once • Constant work • Circuit breaker • Bi-modal behavior and static stability
  • 68. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Detective Controls • Capture and Analyze Logs: • Capture: CloudTrial, AWS Config • Store: CloudWatch Logs, S3, Glacier • Analyze: Elasticsearch Service, EMR, Athena • Integrate Auditing Controls with Notification and Workflow: • CloudWatch, CloudWatch Events • AWS Config Rules • CloudWatch API & AWS SDKs • Inspector
  • 69. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Foundation - Networking Definitions: Understand Availability Needs Application Design for Availability Operational Consideration for Availability
  • 70. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Consideration for Availability • Automate Deployments to Eliminate Impact • Canary deployment • Blue-Green deployment • Feature toggles • Failure isolation zone deployments • Testing • Monitoring and Alarming Generation àAggregation àReal-time processing and alarming àStorage àAnalytics • Operational Readiness Reviews • Auditing
  • 71. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Consideration for Availability • Automate Deployments to Eliminate Impact • AWS Code Deploy • Testing • Monitoring and Alarming • Amazon Cloudwatch • AWS X-Ray • Amazon S3 • Amazon EMR • Operational Readiness Reviews • Auditing • Amazon Cloudwatch Logs • AWS Config • AWS CloudTrail
  • 72. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: How are you managing AWS Service Limits for your Account(s)? Unaware Aware but not Tracking Monitor and Manage Limits Aware of Fixed Service Limits Sufficient Buffer in Service Limits to Accommodate for Failover Service Limits are Considered
  • 73. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Clifford Duke, Solutions Architect 2019/04/10 Well Architected Framework Cost Optimization
  • 74. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Optimization focuses on avoiding un-needed costs. Key topics include understanding and controlling where money is being spent, selecting the most appropriate and right number of resource types, analyzing spend over time, and scaling to meet business needs without overspending.
  • 75. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Resources: Website: https://aws.amazon.com/architecture/well-architected/ Whitepaper
  • 76. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Adopt Consumption Model Measure Overall Efficiency Stop spending on On-Premise-DC Analyze and Attribute Expenditure Design Principal: Adopt Managed Service
  • 77. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective Match Supply and Demand Expenditure Awareness Optimizing Over Time Definitions:
  • 78. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective Match Supply and Demand Expenditure Awareness Optimizing Over Time Definitions:
  • 79. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective • Appropriate Provisioning • Over-estimated? Under-estimated? • Right Sizing • Instance Type? Instance Family? Database Type? Storage Type? • Purchase Options • On Demand? Reserved Instance? Spot Instance? • All Upfront? No Upfront? Partial Upfront? • GEO Selection • Centralized? Distributed? • Managed Services • Self managed cost? Human resource cost?
  • 80. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective • Appropriate Provisioning • Based on Cloudwatch and historical data • Right Sizing • Cloudwatch, Logs, Trusted Advisor • Monitoring reflect end-users’ experience • Purchase Options • Cost Explorer • Hybrid combination: On Demand + Reserved Instance + Spot Instance • GEO Selection • S3, CloudFront • Managed Services • RDS, Dynamodb… • SES, SQS, SNS… • Lambda, APIGW…
  • 81. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective Match Supply and Demand Expenditure Awareness Optimizing Over Time Definitions:
  • 82. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Match supply and demand • Demand-Based: • How quickly you need to provision • Understand the size of margin between supply and demand • Buffer-Based: • Components run at different rates over time • Time-Based: • Align the resource capacity to predictable timeframe in real-world • Office hours, Weekdays, Campaign, Super Hot Selling Holiday…
  • 83. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Match supply and demand • Demand-Based: • Auto-Scaling, Pre-built AMI • Load-Balacing • CloudWatch, CloudWatch Alert/Event/Trigger • Buffer-Based: • SQS • Kinesis • Spot Instances • Lambda • Time-Based: • Auto-Scaling • CloudFormation
  • 84. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective Match Supply and Demand Expenditure Awareness Optimizing Over Time Definitions:
  • 85. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Expenditure Awareness • Stakeholders: • CFO/Financial Controllers • BU Owners • Tech Lead • 3rd Parties • Visibility and Controls: • I want to estimate & forecast billing • I want to receive Alert if exceed threshold • I want to analyze spending/usage • I want to know RI Utilization/Coverage
  • 86. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Expenditure Awareness • Cost Attribution • Account Structure (Multi-Account) varies by business reasons • Set specific limit to particular workloads (Sub Account? IAM user?) • Specific reservation for certain workloads (Central DB?) • Tagging • Entity Lifecycle Tracking
  • 87. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Expenditure Awareness • Stakeholders/Visibility and Controls: • Cost Explorer • Billing Alarm, CloudWatch Alarm, SNS • Cost Attribution • Consolidated Billing *Quota Discount • IAM • Tagging • Resource Tag • Entity Lifecycle Tracking • CloudTrail • Config • IAM
  • 88. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost Effective Match Supply and Demand Expenditure Awareness Optimizing Over Time Definitions:
  • 89. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Optimizing Over Time • Measure, Monitor, and Improve: • Establish a Cost Optimization Function • Establish Goals and Metrics • Gather Insight and Perform Analysis • Report and Validate • Stay Ever Green • Stay up-to-date with AWS • Check with managed services
  • 90. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Optimizing Over Time • Measure, Monitor, and Improve: • TAM (Technical Account Manager) • Utilization, RI Coverage • Trusted Advisor • Stay Ever Green • Trusted Advisor • AWS Blog • What’s New At AWS
  • 91. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: How do you make sure your capacity matches but does not substantially exceed what you need? Provision for Peak Demand-based Approach Buffered-Based Approach Time-Based Approach
  • 92. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Clifford Duke, Solutions Architect 2019/04/10 Well Architected Framework Operational Excellence
  • 93. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The operational excellence pillar focuses on running and monitoring systems to deliver business value, and continually improving processes and procedures. Key topics include managing and automating changes, responding to events, and defining standards to successfully manage daily operations.
  • 94. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Website: https://aws.amazon.com/architecture/well-architected/ Whitepaper Resources:
  • 95. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Perform operations as code Annotated documentation Make frequent, small, reversible changes Refine operations procedures frequently Design Principal: Anticipate failure Learn from all operational failure
  • 96. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare Operate Evolve Definitions:
  • 97. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare
  • 98. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare • Operational priorities • Understand your workload • Role • Business goal • External regulatory and compliance requirement • Design for operations • How it will be deployed, updated, and operated? • Defect reduction and fixes(quick and safe) • Log, monitor, metrics • Operational readiness • Checklists • Runbooks • process
  • 99. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare • Operational priorities • AWS Support : Business Support/Enterprise Support • Trusted Advisor • Cloud Compliance : (https://aws.amazon.com/compliance/programs/) • Well Architected Framework • Design for operations • CloudWatch • CloudFormation • Developer Tools • AWS X-Ray • Operational readiness • Lambda • AWS Config • AWS Systems Manager
  • 100. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare Operate Evolve Definitions:
  • 101. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operate • Understanding operational Health • Monitoring • What metrics? • Which level? • Only number or useful insights? • Responding to Events • Planned events, unplanned events…. • How to response? • RCA
  • 102. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operate • Understanding operational Health • CloudWatch, CloudWatch Logs • ES • Personal Health Dashboard • Service Health Dashboard • Responding to Events • AWS Lambda • CloudWatch, CloudWatch Events • SNS • Auto Scaling • Systems Manager
  • 103. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prepare Operate Evolve Definitions:
  • 104. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Evolve • Learning from Experience • Analyze your operations over time • Review, and validate insights • Sharing learning • Use code methodologies. • Sharing resource with other accounts • Integrate with 3rd party tools
  • 105. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Evolve • Learning from Experience • ES • QuickSight • Athena • S3 • CloudWatch • Sharing learning • IAM • SNS • CodeCommit • Lambda • CloudFormation • AMIs
  • 106. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Q: What best practices for cloud operations are you using? Operational Checklist to Evaluate Readiness Not Evaluating Readiness Proactive Plan for Events No Active Plan for Events Security Checklist to Evaluate Security Readiness Not Evaluating Security Readiness
  • 107. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. https://bit.ly/2Ia7W0k Worksheet
  • 108. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank You!