Loading…
中国上海
2019 年 6 月 24–26 日
单击此处了解更多信息和注册

点击此处查看英文版日程表。
To view the English version of this schedule please go here.

我们将为所有主题演讲和分组会议提供同声传译服务。
Simultaneous translation will be provided for all keynote and breakout sessions.

场馆 + 赞助商展示区地图
Venue + Sponsor Showcase Map
KC+CNC - 运营 [clear filter]
Tuesday, June 25
 

16:00 CST

在 Web 级集群中动态调整 Pod 资源限制 - Cheng Wang 和 Xiaoyu Zhang,阿里巴巴
您是否曾想过如何为 Pod 设置完美的资源限制?如何在资源效率与应用 SLO 之间取得平衡?

在本次演讲中,我们将分享阿里巴巴集团通过将不同 QoS 类别的 Pod 共置在同一节点上,在 Web 级集群中动态调整 Pod 资源限制(特别是在资源争用期间)的实践以及从中汲取的经验教训。

在生产集群中应用这一实践后,我们将集群资源使用率提高了 14%~30%,尾部延迟(95%)提高了 76%~87%,TPS(每秒事务处理数)提高了 107%~163%。

大家可以借鉴我们的经验,利用 Kubernetes 原生方法提高集群的资源利用率和应用性能。

Speakers
avatar for Cheng Wang

Cheng Wang

Software engineer, Alibaba
Cheng Wang is a software engineer at Alibaba Group, helping enhance cluster monitoring, resource management and scheduling with data-driven intelligence for Alibaba’s Web-scale clusters. Prior to joining Alibaba, he worked at VMware with the focus on Docker, Kubernetes and edge... Read More →
avatar for Xiaoyu Zhang

Xiaoyu Zhang

Principal Engineer, Tencent
Xiaoyu Zhang is a principal engineer in Tencent Cloud. He worked for Alibaba Cloud as a senior engineer. He's a member of the Kubernetes organization. He mainly works on Kubernetes project and focuses on docs, kubectl, controller-manager, storage and runtime areas. He had multiple... Read More →



Tuesday June 25, 2019 16:00 - 16:35 CST
619

16:45 CST

Kubernetes 管理 - Damini Satya Kammakomati 和 Mitesh Jain,Salesforce
运行 Kubernetes 等大规模分布式系统的一大挑战是管理资源。这些系统的效率和长期运行准备程度取决于对资源利用的监控和管理效果。Kubernetes 提供了大量的选项和机制来跟踪和处理资源。但是与任何其他系统一样,要想取得最佳的调优效果,就必须了解这些选项和机制,更重要的是理解它们。

本次会议将讲解在 Kubernetes 中可用的各种资源管理机制。我们将深入探讨垃圾收集控制器、Kube 控制器管理器、驱赶和 Kubelet 垃圾收集等概念,提供有关它们如何工作、如何配置以及建议如何设置的详细信息。

Speakers
avatar for Damini Satya Kammakomati

Damini Satya Kammakomati

Software Engineer, Salesforce
Damini Satya is a Software Engineer at Salesforce building tools for infrastructure automation internally. Not only she is an active open source contributor and part of various open source communities but also a teach speaker at a lot of well-known conferences like ReactConf, Grace... Read More →
avatar for Mitesh Jain

Mitesh Jain

Lead Systems Engineer, Salesforce
Mitesh Jain is Lead Systems Engineer at Salesforce building trusted platforms for distributed applications at Cloud scale. He has over 13 years of experience building and managing Open Source deployments in public and private clouds at enterprises like Red Hat, GE, Wipro Technologie... Read More →



Tuesday June 25, 2019 16:45 - 17:20 CST
619

17:30 CST

有效可靠地管理大规模 Kubernetes 集群 - 张勇和林志贤,蚂蚁金服
随着业务的增长,我们需要将 Kubernetets 部署到世界各地的多个数据中心。单个数据中心中就拥有超过数万个节点。我们面临的关键挑战是如何高效、可靠地在数据中心内管理多个大规模 Kubernetes 集群。

在本次演讲中,我们将分享实现大规模集群管理自动化的经验和实践。首先,我们将介绍全自动化节点生命周期管理,以及如何基于 NPD、Autoscaler 和自定义运算符自动发现和恢复节点故障。然后,我们将分享部署和升级 Kubernetes 集群的经验和解决方案。最后,我们将分享基于 Prometheus 和运算符的风险防控系统,该系统可确保集群可靠性,具有自动故障检测和隔离的能力。

Speakers
YZ

Yong Zhang

Senior Software Engineer, Ant Financial
A Senior Software Engineer of Ant Financial.
ZL

Zhixian Lin

Senior Software Engineer, Ant Financial
A Senior Software Engineer of Ant Financial.



Tuesday June 25, 2019 17:30 - 18:05 CST
619

18:15 CST

在 Air Gap/离线环境中管理 Kubernetes - Rong Zhang,Suning.com
用于管理 kubernetes 集群的大多数可用软件和工具都假设有互联网连接。实际上,这个要求并不总能满足,最终用户不得不独自探索如何使用 Kubernetes。
在本次演讲中,我们将分享在离线环境中轻松安装、升级和管理 Kubernetes 的不同策略。
Rong Zhang 将介绍他在 的工作经历以及他们正如何使用 Kubespray 和 Harbour 来管理线下基础设施。

Speakers
avatar for Rong Zhang

Rong Zhang

Senior software Engineer, vivo
Rong is a software engineer at vivo developing platform services on top of Kubernetes, providing containerized infrastructure. Lead software engineer focusing on the closed loop system of scheduling,gpu technology and cluster management. He was a speaker at the 2019 ShangHai KubeCon... Read More →



Tuesday June 25, 2019 18:15 - 18:50 CST
619
 
Wednesday, June 26
 

11:20 CST

灾难恢复计划:好船长无救生筏不航行 - Steven Wong 和 Carlisia Campos,VMware
纵观历史,正式的灾难恢复 (DR) 计划仅适用于大型企业。因为他们负担得起分配时间、资源和复制数据中心基础设施的成本。

随着公共云和云原生技术的普及,DR 计划的成本和复杂性大幅降低。这意味着各种规模的公司都可以进行业务连续性计划。为什么这很重要?原因如下:

机器和软件故障
- 人为错误
- 黑客攻击弱势群体
- 天气、火灾、恐怖主义等......
- 当发生中断和数据丢失时,您会丢失客户
- 法律标准通常要求数据保留

本次演讲将聚焦于:
- 需要备份的项目和原因 - 有些可能会出乎您的意料
- 为什么需要选择性恢复功能
- 可简化并实现 DR 策略自动化的现有工具

Speakers
avatar for Carlisia Thompson

Carlisia Thompson

Senior Member of Technical Staff, VMware
Carlisia works as a Senior Member of Technical Staff at VMware. She's a maintainer of the open source project Velero, a cloud native disaster recovery and data migration tool for Kubernetes workloads. She currently runs the San Diego Kubernetes meetup. Carlisia holds a MS in Computer... Read More →
avatar for Steven Wong

Steven Wong

Staff Engineer, VMware
Steve Wong has been active in the Kubernetes community since 2015. He is a co chair of the CNCF Working Group. Steve is co-chair of the VMware User Group on the Kubernetes project. He has implemented industrial control systems for many factories, pipelines, and process control systems... Read More →



Wednesday June 26, 2019 11:20 - 11:55 CST
619

12:05 CST

存储版本迁移器:再也不用担心过时的 API 对象 - Chao Xu,Google
您是否曾遇到过 API 服务器拒绝获取、更新甚至删除僵尸 Kubernetes API 对象的情况?这可能是因为这些在 etcd 中持久存在的对象是以过时的版本编码的。在本次演讲中,我们将介绍一款 Kubernetes 存储版本迁移器,它可以一劳永逸地解决这个问题。启用此 alpha 特性后,存储版本迁移器会自动确保存储在 etcd 中的所有 API 对象始终以正确的版本进行编码。在本次演讲中,存储版本迁移器设计和实施的主要贡献者 Chao 将分享迁移器如何管理 Pod 等 Kubernetes 资源以及您的自定义资源。您还将获悉使用迁移器时的注意事项,例如如何用它来管理高可用性集群。此外,Chao 还将分享存储版本迁移器的发展路线图。

Speakers
avatar for Chao Xu

Chao Xu

Software engineer, Google
Chao Xu has been a member of Kubernetes SIG apimachinery for more than 4 years. He is one of the top contributors, owning the garbage collector, admission webhooks, etc. Recently, Chao has been focusing on safe Kubernetes upgrades/downgrades. At his free time, Chao is a good table... Read More →



Wednesday June 26, 2019 12:05 - 12:40 CST
619
 

Filter sessions
Apply filters to sessions.