27
How Did OpenStack Improved for Public Cloud in Recent Releases and What Are Still Missing? Zhenyu Zheng Yikun Jiang Sheng Liu

and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

How Did OpenStack Improved for Public Cloud in Recent Releases and What Are Still Missing?

Zhenyu Zheng Yikun Jiang Sheng Liu

Page 2: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

2

基础配色 强调色一 强调色二

• Zhenyu ZhengHuawei Technologies Co., Ltd.

OpenStack Nova Contributor, Upstream Developer.

• Yikun JiangHuawei Technologies Co., Ltd.

OpenStack Nova Contributor, Upstream Developer.

• Sheng LiuHuawei Technologies Co., Ltd.

OpenStack Contributor, OpenLab Maintainer, Upstream Developer.

About us:

Page 3: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

3

基础配色 强调色一 强调色二

1. The Uniqueness of Public Cloud

2. What is the benefit From OpenStack?

3. OpenStack Improvements in Recent Release For Public Cloud

4. What are Still Missing In OpenStack?

Page 4: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

4

基础配色 强调色一 强调色二

1. The Uniqueness of Public Cloud

Page 5: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

5

基础配色 强调色一 强调色二

1. The Unique User

Different Scale

The scale of Public Cloud users could be very different, ranging from a personal user uses VM setting up his own VPN to enterprise that have thousands of employees.

Various Demand

According to their use cases, the users demands varies, there could be thousands of service combinations, and they will want them cheap, functional and easy to use.

Extreme User Experience

Different type of users will have different user experience preference, for example, rookie users may want it as simple as possible, but pros may want to have full control of their resources.

Page 6: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

6

基础配色 强调色一 强调色二

2. The Unique Operator

Huge Number of Users

The Public Cloud will serve huge number of users from all over the world, it is challenging to operate a platform at this scale and complexity, operators would have to consider cost, availability, performance, security, etc.

Follow the Trend

The Cloud have to always follow the trend, both technical and commercial. Operators should be able to response fast to the market and release new attractive services rapidly.

Flexible & Fine-grained Billing

Support flexible and fine-grained billing options for users, users will be able to choose the most suitable way according to their use cases, for example, user could pay either metering or monthly/yearly.

如何更便捷、更精细化的完成运营工作?

Page 7: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

7

基础配色 强调色一 强调色二

3. The Unique Vendor

Rapid Dev Iteration

In order to fulfill the various requests from Public Cloud users and the Operators request of rapid service release, the vendors have to be able to develop in small step and rapid iteration.

Easy Upgrade

Rapid dev iteration will lead to rapid upgrade, vendors will have to provide a way of easy upgrades to help the operators release the latest services.

Stability and Performance

Running a Public Cloud normally means running it for 60*24*365, high availability for all time is the commitment from vendor and operator to customer, as well as high performance.

Page 8: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

8

基础配色 强调色一 强调色二

2. What is the Benefit from OpenStack?

Page 9: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

9

基础配色 强调色一 强调色二

I’m an OpenStack User

Application

OS

Virtualization

Server

Network

Storage

1. Interoperability and compatibility

2. All kinds of services and features

3. Various peripheral ecological tools

Page 10: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

10

基础配色 强调色一 强调色二

I’m an OpenStack User

Compute Network Storage Image Metering Identity Orchestration

… …

1. Interoperability and compatibility

2. All kinds of services and features

3. Various peripheral ecological tools

… …

Page 11: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

11

基础配色 强调色一 强调色二

I’m an OpenStack User

… …

1. Interoperability and compatibility

2. All kinds of services and features

3. Various peripheral ecological tools

Page 12: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

12

基础配色 强调色一 强调色二

I’m an OpenStack Operator

1. Avoid vendor lock-in

2. Free technical support, continuous evolution, fast iteration.

3. Hot service✓ Avoid Server Device lock-in✓ Avoid Hardware Device lock-in✓ Avoid OpenStack Vendor lock-in

… …

Page 13: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

13

基础配色 强调色一 强调色二

I’m an OpenStack Operator

1. Interoperability and compatibility, Avoid Device and Vendor lock-in

2. Free Technical Support, continuous evolution, fast iteration.

3. Hot service

Hot discussion of real user requirement,

grow together with the Giants, open and fair discussion

from all company.

Take Keystone Unified Limit Feature as an example, it comes

from CERN’s real nested relationship user model.

Page 14: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

14

基础配色 强调色一 强调色二

I’m an OpenStack Operator

1. Interoperability and compatibility, Avoid Device and Vendor lock-in

2. Free Technical Support, continuous evolution, fast iteration.

3. Hot Services following the trends

In the active OpenStack community, there are a group of people who are rapidly implementing hot services

Take Acceleration Management Service “Cyborg” as an example, it only takes 6 months from idea to implementation.

Page 15: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

15

基础配色 强调色一 强调色二

I’m an OpenStack Vendor Developer

1. General framework and modular components

2. Upstream code and Product code Decoupling

3. Build our QA system using community CI

• Common libraries and middlewares

• Common architecture API+Object+DB

• Common schedule mechanism(Filter & Weight)

• … …

Page 16: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

16

基础配色 强调色一 强调色二

I’m an OpenStack Vendor Developer

1. General framework and modular components

2. Flexible customization and delivery

3. Infrastructure leverage

• Up/Downstream code decoupling

• Pluggable architecture

• Code reuse

Upstream Downstream

decoupling

Page 17: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

17

基础配色 强调色一 强调色二

I’m a OpenStack Vendor Developer

1. General framework and modular components

2. Flexible customization and delivery

3. Infrastructure leverage• Zuul

• Jenkins

• Gerrit

• UT, Tempest…

Gerrit

Page 18: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

18

基础配色 强调色一 强调色二

3. OpenStack Improvements in Recent Releasefor Public Cloud Scenarios

Page 19: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

19

基础配色 强调色一 强调色二

User View

OpenStack Improvements in Recent Release

Operator View Vendor View

➢ Nova: • Tag support• Expose extra-specs to flavor• Pagination Support • Improve filter instances by IP

➢ Cinder• Resource total count info in

list APIs ➢ Neutron

• Push Notifications for Agents(Performance)

➢ Keystone• Federation • Application credentials• Service token

➢ Nova: • Placement• API Database Separation• Support disabling a cell• Filter and sort whitelist

➢ Cinder• Volume multi-attach• Support AZ in volume type• operation type backend filter• Explicit user messages

➢ Neutron• routed network• Vlan aware VMs

➢ Keystone• Application credentials• PCI DSS• MFA

➢ Nova: • Cell v2• Record host info in instance

action➢ Cinder

• Dynamic Log Level control • Dynamic Reconfiguration

➢ Neutron• Logging API for

security-group-rules• Port data plane status

➢ Keystone: • Unified limits

Page 20: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

20

基础配色 强调色一 强调色二

What Are Still Missing?The True Story from Huawei Cloud

Page 21: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

21

基础配色 强调色一 强调色二

Huawei Public Cloud

• OpenStack Platinum member

• 2000+ partner companies

• 60+ solutions

• 100+ cloud services

2016.02 2016.06 2016.09 2017.032015.07

Page 22: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

22

基础配色 强调色一 强调色二

Large scale deployment:Cascading ArchitectureOur Practice:Current Status

Only Nova Support the Cell V2

architecture.OpenStack API

OpenStack API

DC1

DC2 DCn

Availability Zone 1

Availability Zone 2

Availability Zone 3

AvailabilityZone 4

Availability Zone 5

Availability Zone n

OpenStack API

Huawei OpenStack

Redhat OpenStack

Vendor2OpenStack

Vendor1OpenStack

Huawei OpenStack

Vendor3OpenStack

Cascading OpenStack

The cascading architecture help us expand our scale

with Compatible API

Support 1 million VMs, 100 data centers, 100K hosts.

Page 23: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

23

基础配色 强调色一 强调色二

Extreme User Experience:Performance OptimizationWhat Happened when resource indexing in Nova?

1. All key APIs with paging support

2. Removing unreasonable join and pre-check

3. Specific Query API optimization, such as IP fuzzy

query optimization,+90% speed up

4. Lite API interface(Only API+DB)

5. Add more reasonable index, remove irrational index

based on DB access frequency.

API

SQLAlchemy

DB

Redundant join

Redundant pre-check

Inefficient filtration

Inefficient filtration

Irrational index

WithoutPaging

Object

Non-optimized conversion

Our Practice:

Page 24: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

24

基础配色 强调色一 强调色二

Fast Abnormality Discovery :Monitoring and AuditingOur Practice:

Health Check Service: Fast discovery abnormality, Key data damage, and abnormal status service.

Data Audit Service: the invalid record can be audit periodically.

Current status:

➢ The orphan, fake, invalid records only can be checked by admin manually.

➢ There is no reasonable mechanism to discover system abnormal services and heal them.

➢ The damage to key data can't be found in time

Audit Service

OpenStackDatabase

HuaweiDevice

Orphan Record Fake Record

• Invalid volume• Invalid vm• Invalid snapshot• Invalid attachment• Redundant neutron• namespace• Unavailable service

Alarm report

Invalid Record

Page 25: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

25

基础配色 强调色一 强调色二

A Powerful Weapon: One-click Upgrading

➢ One-click upgrading: distribute package,pre-check、 upgrade DB 、Change Configuration 、Data Cleanup…

➢ Support grayscale, business lossless upgrade, and rollback.

Upgrade toolOne Click

Repository Script Script

Upgrade Service

Check

Upgrade

Check

Upgrade

Current status:

➢ Too complex

➢ Without pre-check before upgrade.

➢ No UI.

Our Practice:

Page 26: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

26

基础配色 强调色一 强调色二

Still unsolved ,help wanted

1. Not all components support grayscale, business lossless

upgrade

2. Log analysis, log collection, call chain and some other

maintenance abilities.

3. The configuration is too complex

4. Weak parallel API handling

Page 27: and What Are Still Missing? How Did OpenStack Improved for ...€¦ · How Did OpenStack Improved for Public Cloud in Recent Releases ... means running it for 60*24*365, high availability

Copyright©2016 Huawei Technologies Co., Ltd. All Rights Reserved.The information in this document may contain predictive statements including, without limitation, statements regarding the future financial and operating results, future product portfolio, new technology, etc. There are a number of factors that could cause actual results and developments to differ materially from those expressed or implied in the predictive statements. Therefore, such information is provided for reference purpose only and constitutes neither an offer nor an acceptance. Huawei may change the information at any time without notice.

Thank You.