2019 年 6 月 24–26 日

To view the English version of this schedule please go here.

Simultaneous translation will be provided for all keynote and breakout sessions.

场馆 + 赞助商展示区地图
Venue + Sponsor Showcase Map
Back To Schedule
Tuesday, June 25 • 18:15 - 18:50
Kubernetes 的多云机器学习数据和工作流 - Lei Xue,Momenta;Fei Xue,Google

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
自动驾驶汽车需要硬件加速机器学习来解决跟踪和分类等关键问题。Momenta 在本地和公共云中训练 ML 模型,每个模型有着不同的 GPU 和网络接口(Infiniband,RoCE)。

在本次演讲中,我们将讨论如何使用 Kubernetes 构建多云 ML 平台,特别是我们如何在不同环境中管理训练数据;我们如何处理多用户和群组调度;以及我们如何支持异构硬件。

avatar for Lei Xue

Lei Xue

Infrastructure Tech Lead, Momenta
Lei Xue currently works as an AI Infrastructure tech lead at Momenta. He leads a development team that focuses on GPU cluster management for Kubernetes&Docker. Previously, Lei was a member of KataContainers/Hyper team and the software engineer of Oracle/Sun Microsystems. He is also... Read More →

Fei Xue

Product Manager, Ant Financial
Fei Xue is currently a product manager at Ant Financial working on ML and data platform. Fei was an early member of the Kubeflow team at Google, an open source effort to help developers and enterprise develop and deploy cloud-native machine learning everywhere. Fei comes from a distributed... Read More →

Tuesday June 25, 2019 18:15 - 18:50 CST