Upload
arinto-murdopo
View
968
Download
2
Embed Size (px)
DESCRIPTION
Slide for EEDC homework 6, based on this paper: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5961695
Citation preview
Execution
Environments for
Distributed
Computing
Intelligent Placement of
Datacenter for Internet
Services
EEDC
34
33
0
Master in Computer Architecture,
Networks and Systems - CANS
Homework number: 6
by Arinto Murdopo – [email protected]
2
Problem Statement
Data Center
where? dónde? di mana? oú? waar?
dove? どこですか? πού? 在哪里?어디?
Response time, availability, cost, environmental concerns
3
Proposed Solution
Framework
Solve optimization Problem
Produce tool to compare efficiency and accuracy
Characterization
4
Framework
Minimize Cost
Efficiently select data center locations
Response Time Consistency Availability
5
Solve Optimization Problem
Problem formulation
Approaches: • Simple Linear Programming (LP0) • Pre-set Linear Programming (LP1) • Brute force (Brute) • Heuristic-based on LP (Heuristic) • Simulated Annealing plus LP1 (SA+LP1) • Optimzed SA + LP1 (OSA + LP1)
6
Placement Tool
Available Inputs:
MaxS
1/ratioServerUser
MAXLAT
MAXDELAY
MINAVAIL
area of interest
Granularity
existing data center
7
Placement Tool
Location-dependent data:
Network backbones: latency data from backbone ISP
Power plants, transmission lines, and CO2 emissions: obtained from DOE
Electricity, land, water and temperature: obtained from DOE as well
Missing data are obtained from neighboring location
8
Placement Tool
Datacenter characteristics:
Cooling : CRACs and Water Chillers for cooling
Connection: It costs $500k/mile of transmission line, and $480k/mile of fiber. Amortization of 12 years
Building: Its costs depends of the maximum power
Land: 6 K square feet per Megawatt
9
Placement Tool
Datacenter characteristics:
Water: 24K gallons of water per MW per day
Server: Each server costs $2000 (4 years amortization), each interconnect switch costs $20K (4 years amortization)
Staff: $0.05 per Watt per month. $100K per year salary for 1K servers
10
Characterization
Characterize 7 locations in US
11
Characterization
Evaluate each location with Placement Tools Parameters
12
Characterization
Evaluate each location with Placement Tools Parameters
13
Broadening The Scope
Distribution of cost assuming 500 potential locations
14
Sample Output
Specifications: 1. 60 K servers 2. Latency <= 60 ms 3. Consistency Delay <= 85 ms 4. Minimum Availability = 5 nines
Results Three locations : 1. Seattle(A, 1789 servers) 2. St. Louis (B, 22712 servers) 3. Oklahoma city(C, 5501 servers)
15
Evaluation of Chosen Approach
Based on this specification:
1. 60 K servers
2. Latency <= 60 ms
3. Consistency Delay <= 85 ms
4. Minimum Availability = 5 nines
16
Overall cost table (in million)
17
Evaluation of Chosen Approach
Running Times of Solution Approaches
18
Evaluation of Chosen Approach
Solution Quality
19
Evaluation of Chosen Approach
Recommended approach:
OSA + LP1, since it provides best tradeoff between running time and search quality
20
Exploring Placement Tradeoff
Latency
Latency of 50 ms strikes the best compromise between latency and cost
21
Exploring Placement Tradeoff
Availability
It is usually cheaper to build networks out of less redundant datacenters Tier II data centers are the best option
22
Exploring Placement Tradeoff
Consistency Delay
Low latency and low consistency are conflicting goals
23
Exploring Placement Tradeoff
Green datacenters
Green network is less than $100k more expensive per month than the cost-optimal network when the maximum latency can be relatively high (> 70ms)
24
Exploring Placement Tradeoff
Chiller-less data center
Avoiding chillers reduces costs by 8% for max latencies >= 70ms
25
Conclusions
• Proposed and implemented optimization framework for automatic data center placement for Internet Services
• Characterized US regions • Evaluated solutions based on the framework
26
Questions and answers