ComputeSection 055 min read25 questions

ELB & ASG

Elastic Load Balancing & Auto Scaling

ELB และ ASG คือคู่หูที่ทำงานร่วมกันเพื่อให้ระบบรองรับ traffic สูง ไม่ล่ม และขยายตัวได้อัตโนมัติ — ELB กระจาย traffic, ASG ปรับจำนวน instance ตาม load

ในหน้านี้7 sections

01HA, Scalability, Elasticity & Agility
02Elastic Load Balancing (ELB) Overview
03Application Load Balancer (ALB)
04Auto Scaling Groups (ASG) Overview
05Launch Template — Blueprint ของ ASG
06ASG Scaling Strategies
07ELB & ASG Summary

HA, Scalability, Elasticity & Agility

แนวคิดสำคัญที่เกี่ยวข้องกับการออกแบบระบบบน Cloud — ต้องแยกความหมายให้ชัด เพราะข้อสอบ CLF-C02 ชอบเอามาสลับเพื่อหลอก:

Vertical Scaling (Scale Up / Down)

เพิ่ม/ลด spec ของเครื่องเดิม เช่น t2.micro → t2.large (Scale Up) หรือ t2.large → t2.micro (Scale Down) มี hardware limit เหมาะกับ database ที่กระจายยาก เช่น RDS

Horizontal Scaling (Scale Out / In) = Elasticity

เพิ่ม/ลดจำนวนเครื่อง — Scale Out = เพิ่มเครื่อง, Scale In = ลดเครื่อง ไม่มี hardware limit เหมาะกับ distributed system เช่น web server หลัง ASG + ELB

High Availability (HA)

ระบบยังทำงานได้แม้ส่วนใดส่วนหนึ่งล้มเหลว (disaster) — กระจาย instance ไว้อย่างน้อย 2 AZ เพื่อไม่มี single point of failure เช่น AZ-A ล่ม AZ-B ยังทำงานต่อได้

Elasticity

ระบบ scale อัตโนมัติตาม load จริง เพิ่มเมื่อ traffic สูง ลดเมื่อ traffic ต่ำ จ่ายตามใช้จริง (Pay-as-you-go) = Horizontal Scaling + Auto

Agility (ระวัง! เป็นตัวหลอกในข้อสอบ)

ความรวดเร็วในการสร้าง resource ใหม่ใน Cloud — แค่คลิกก็ได้ environment ใหม่ใน 1 นาที ไม่ต้องสั่งซื้อ hardware Agility ไม่เกี่ยวกับ Scaling — แต่ข้อสอบมักใส่มาเป็นตัวเลือกหลอก

Elastic Load Balancing (ELB) Overview

Load Balancer คือ server ที่รับ traffic จาก user แล้วกระจายไปยัง EC2 instances หลายตัวด้านหลัง ทำให้ไม่มีเครื่องไหนรับ load หนักเกินไป เป็น Managed Service ที่ AWS ดูแล upgrade และ maintenance ให้

กระจาย load ข้าม instance / AZ
มี single DNS endpoint ให้ user เชื่อมต่อ
Health check — หยุดส่ง traffic ไปเครื่องที่ล้มเหลว
รองรับ SSL/TLS termination
High availability ข้าม AZ
AWS ดูแล upgrade, maintenance และรับประกัน availability ให้

ALB — Application Load Balancer (Layer 7)

HTTP / HTTPS / WebSocket — routing ตาม URL path, header, query string เหมาะกับ microservices, web apps

NLB — Network Load Balancer (Layer 4)

TCP / UDP / TLS — ultra-low latency, รองรับล้าน request/วินาที, มี Static IP ต่อ AZ

GWLB — Gateway Load Balancer (Layer 3)

ส่ง traffic ผ่าน virtual appliances (firewall, IDS/IPS, deep packet inspection) ก่อนถึง application

CLB — Classic Load Balancer (legacy)

Layer 4 + 7 รุ่นเดิม (previous-generation) — AWS แนะนำให้งานใหม่ใช้ ALB หรือ NLB แทน

Application Load Balancer (ALB)

ALB ทำงานที่ Layer 7 (HTTP/HTTPS) — สามารถ routing traffic ตาม content ของ request ได้ละเอียด ไม่ใช่แค่ IP/Port

Path-based routing: /users → Target Group A, /payments → Target Group B
Host-based routing: app.example.com → TG A, api.example.com → TG B
Query string routing: ?platform=mobile → Target Group Mobile

ALB ส่ง traffic ไปยัง Target Group ซึ่งประกอบด้วย EC2 Instances (จัดการโดย ASG ได้), ECS Tasks (Container-based), หรือ Lambda Functions (Serverless)

Auto Scaling Groups (ASG) Overview

ASG คือกลุ่มของ EC2 instances ที่ถูก scale out (เพิ่ม) หรือ scale in (ลด) อัตโนมัติตาม load — ทำให้ระบบรองรับ traffic ที่เปลี่ยนแปลงได้โดยไม่ต้องทำเอง

Min Capacity

จำนวน instance ขั้นต่ำ ห้ามต่ำกว่านี้ เช่น min=1

Desired Capacity

จำนวนที่ต้องการ — ASG พยายามรักษาไว้ที่จำนวนนี้ เช่น desired=2

Max Capacity

จำนวน instance สูงสุด ห้ามเกินนี้ เช่น max=5

Scale out เมื่อ load เพิ่ม / Scale in เมื่อ load ลด — ใช้ CloudWatch alarms เป็น trigger
Replace instance ที่ unhealthy อัตโนมัติ — รับ health check จาก ELB แล้ว terminate และสร้างใหม่แทน
Register instance ใหม่กับ ELB อัตโนมัติ — instance ใหม่จะถูก add เข้า Target Group ให้เลย
ASG ฟรี — จ่ายแค่ค่า EC2 instances ที่รัน

Launch Template — Blueprint ของ ASG

Launch Template คือ blueprint หรือพิมพ์เขียวที่บอก ASG ว่า 'เวลาสร้าง EC2 instance ใหม่ ให้สร้างแบบไหน' — ASG จะอ่านค่าจาก template ทุกครั้งที่ scale out หรือ replace instance

AMI (Amazon Machine Image) — OS และ software ที่ติดตั้งมาแล้ว
Instance Type — ขนาดเครื่อง เช่น t2.micro, m5.large
Key Pair — สำหรับ SSH เข้าเครื่อง
Security Groups — กฎ firewall เข้า/ออก
EBS Volumes — disk ที่ติดมากับ instance
IAM Role / Instance Profile — สิทธิ์ที่ instance จะใช้เรียก AWS API
User Data — script ที่รันตอน boot ครั้งแรก เช่น install software, config service
Network / Subnet settings

ASG Scaling Strategies

Manual Scaling

ปรับตัวเลข desired capacity ด้วยมือ — ง่ายที่สุด เหมาะกับ testing หรือ one-time adjustment

Simple / Step Scaling

กำหนด rule ตาม CloudWatch alarm เช่น 'ถ้า CPU > 70% → เพิ่ม 2 instances' หรือ 'ถ้า CPU < 30% → ลด 1 instance' กำหนดหลาย step ได้

Target Tracking Scaling (แนะนำ)

ง่ายที่สุด — ระบุ metric target แล้ว ASG จัดการให้เอง เช่น 'รักษา Average CPU ไว้ที่ 40%' ASG จะ scale out/in อัตโนมัติเพื่อให้ถึง target

Scheduled Scaling

Scale ตามเวลาที่กำหนดไว้ล่วงหน้า เช่น เช้าจันทร์ 9:00 → min=10, เย็นศุกร์ 18:00 → min=2 เหมาะกับ traffic pattern ที่รู้แล้ว

Predictive Scaling (ML)

AWS วิเคราะห์ historical data ด้วย Machine Learning แล้วคาดการณ์ load ล่วงหน้า pre-scale ก่อน traffic มาถึง ดีกว่า Scheduled เพราะไม่ต้องกำหนดเวลาเอง

ELB & ASG Summary

ELB และ ASG เป็นคู่หูที่ทำให้ระบบ scale ได้ และ tolerant ต่อ failure — ELB กระจาย traffic, ASG ปรับจำนวน instance ตาม load

ALB

Layer 7 (HTTP/HTTPS) — routing ตาม path, host, header เหมาะกับ microservices

NLB

Layer 4 (TCP/UDP) — ultra-low latency พร้อม Static IP ต่อ AZ

GWLB

Layer 3 — ส่ง traffic ผ่าน security appliances (firewall, IDS/IPS)

Health Check

ELB ตรวจ instance เป็นระยะ และหยุดส่ง traffic ไปเครื่องที่ล้มเหลว

ASG Capacity

Min ≤ Desired ≤ Max — ASG รักษาจำนวน instance ให้อยู่ใน range

Launch Template

Blueprint ของ ASG — AMI, instance type, SG, IAM role, user data

ELB เป็น managed service — AWS ดูแล HA และ upgrade ให้
ASG ฟรี — จ่ายแค่ค่า EC2 instance ที่รันจริง
Target Tracking เป็น scaling strategy ที่ง่ายและแนะนำที่สุด
HA = กระจาย instance ข้ามหลาย AZ ผ่าน ASG + ELB
Scale Up (vertical) = spec ใหญ่ขึ้น ส่วน Scale Out (horizontal) = จำนวนเครื่องมากขึ้น

ทดสอบ

คำถามทบทวน

25 ข้อ — เลือกคำตอบเพื่อดูเฉลยและคำอธิบาย

ข้อ 1 / 10คะแนน 0

A system can automatically add or remove EC2 instances based on real-time traffic. What is this concept called?

Question 1 / 25

A system can automatically add or remove EC2 instances based on real-time traffic. What is this concept called?

A.High Availability
B.Elasticity (Auto Scaling)✓ Correct
C.Fault Tolerance
D.Replication

Question 2 / 25

Which AWS load balancer operates at Layer 7 (HTTP/HTTPS) and supports path-based / host-based routing?

A.Classic Load Balancer (CLB)
B.Application Load Balancer (ALB)✓ Correct
C.Network Load Balancer (NLB)
D.Gateway Load Balancer (GWLB)

Question 3 / 25

Which AWS load balancer is BEST for ultra-low-latency, high-throughput TCP/UDP traffic (e.g., gaming, IoT)?

A.Application Load Balancer
B.Network Load Balancer✓ Correct
C.Classic Load Balancer
D.Gateway Load Balancer

Question 4 / 25

Which AWS load balancer is used to insert third-party network virtual appliances (firewalls, IDS/IPS, deep packet inspection) into the traffic path?

A.Application Load Balancer
B.Network Load Balancer
C.Gateway Load Balancer✓ Correct
D.Classic Load Balancer

Question 5 / 25

An Auto Scaling Group has min=2, desired=4, max=10. What is the maximum number of instances the ASG can run?

A.2
B.4
C.10✓ Correct
D.Unlimited

Question 6 / 25

Which Auto Scaling policy adjusts capacity to maintain a target metric value (e.g., 50% average CPU)?

A.Simple Scaling
B.Step Scaling
C.Target Tracking Scaling✓ Correct
D.Manual Scaling

Question 7 / 25

What is the main advantage of using an Auto Scaling Group with multiple Availability Zones?

A.Lower cost than a single AZ.
B.High availability — if one AZ fails, instances in other AZs continue serving traffic.✓ Correct
C.Faster instance launches.
D.Required for Lambda integration.

Question 8 / 25

What does "sticky sessions" (session affinity) mean in an Application Load Balancer?

A.All requests from a single client are routed to the same target for the session duration.✓ Correct
B.The load balancer drops slow connections.
C.Requests are randomized across all targets equally.
D.The load balancer caches all responses.

Question 9 / 25

Which feature ensures a load balancer ONLY routes traffic to healthy instances?

A.Sticky sessions
B.Health checks✓ Correct
C.Cross-zone load balancing
D.Listener rules

Question 10 / 25

Which load balancer feature distributes traffic evenly across targets in ALL AZs, not just within the same AZ?

A.Cross-Zone Load Balancing✓ Correct
B.Health Checks
C.Sticky Sessions
D.WebSocket

Question 11 / 25

What is the difference between scaling out and scaling up?

A.There is no difference.
B.Scaling out adds more instances (horizontal); scaling up uses a larger instance (vertical).✓ Correct
C.Scaling up adds more instances; scaling out resizes an instance.
D.Both terms refer to scaling down.

Question 12 / 25

Which ELB type is the previous-generation Load Balancer that AWS recommends migrating away from for new workloads?

A.Application Load Balancer
B.Network Load Balancer
C.Classic Load Balancer (CLB)✓ Correct
D.Gateway Load Balancer

Question 13 / 25

An ASG should add an instance when CPU > 80% for 5 minutes and remove an instance when CPU < 20% for 10 minutes. Which kind of scaling policy would BEST express this?

A.Target Tracking
B.Step Scaling✓ Correct
C.Scheduled Scaling
D.Predictive Scaling

Question 14 / 25

Which scaling type uses ML to forecast traffic and pre-scale capacity?

A.Simple Scaling
B.Step Scaling
C.Target Tracking
D.Predictive Scaling✓ Correct

Question 15 / 25

What is the role of an ELB in front of an Auto Scaling Group?

A.To reduce the number of instances.
B.To distribute incoming traffic evenly across the EC2 instances managed by the ASG.✓ Correct
C.To replace the ASG.
D.To encrypt data on EBS volumes.

Question 16 / 25

Which features can route different URL paths to different backend services on an ALB?

A.Path-based routing
B.Host-based routing
C.HTTP header routing
D.All of the above✓ Correct

Question 17 / 25

Which is true about NLB IP addresses?

A.NLB only has dynamic public IPs.
B.NLB provides one static IP per AZ — and supports Elastic IPs.✓ Correct
C.NLB does not have an IP address.
D.NLB only supports private IPs.

Question 18 / 25

What is a Launch Template (vs. legacy Launch Configuration)?

A.A launch template is the legacy way to configure ASG instances.
B.A launch template is the modern, recommended way to define instance launch parameters (AMI, type, security groups, user data, etc.) — supports versioning.✓ Correct
C.Launch templates are only for Lambda.
D.Launch templates can only be used in us-east-1.

Question 19 / 25

When does an ASG scale-IN cooldown matter?

A.Before terminating instances after a scale-in event, the ASG waits a specified period before evaluating again — to avoid flapping.✓ Correct
B.It dictates how often health checks run.
C.It controls the AZ failover.
D.It limits the number of new instances per minute.

Question 20 / 25

Which protocol does ALB support that allows real-time bidirectional communication between client and server?

A.FTP
B.WebSocket✓ Correct
C.SMTP
D.ICMP

Question 21 / 25

Which AWS feature allows ELB to handle SSL/TLS encryption so backend servers don't have to?

A.WAF
B.SSL/TLS termination at the load balancer✓ Correct
C.Sticky Sessions
D.WebSocket

Question 22 / 25

Which target type does an ALB NOT support?

A.EC2 instances
B.IP addresses (IPv4)
C.AWS Lambda functions
D.S3 buckets✓ Correct

Question 23 / 25

What is the difference between an ALB target group and a listener?

A.They are the same thing.
B.A listener checks for incoming traffic on a specific port/protocol; a target group is a set of backend targets that receive routed traffic.✓ Correct
C.Listeners are only for NLBs.
D.Target groups are only for HTTPS.

Question 24 / 25

In an ASG, when an unhealthy instance is detected, what happens?

A.The instance is rebooted automatically.
B.The ASG terminates the unhealthy instance and launches a replacement.✓ Correct
C.The instance stays in the group with errors.
D.The user is alerted but no action is taken.

Question 25 / 25

Which AWS Auto Scaling type is set to scale at specific times (e.g., scale up at 9 AM weekdays, scale down at 6 PM)?

A.Predictive Scaling
B.Scheduled Scaling✓ Correct
C.Target Tracking
D.Step Scaling