SegmentFault Woody的专栏最新的文章

银行4.0与金融服务的未来（下）

2024-10-30T17:46:01+08:00

写在前面

本篇文章有2个部分：

演讲正文与英文详解（银行4.0与金融服务的未来（上））
关于金融服务的思考

本篇文章有3个目的或者作用：

作为英文学习材料，里面有很多英文的表达是值得我们学习的。特别是关于金融领域的词汇。
作为金融领域知识，值得金融从业人员学习，里面有很多关于银行的发展以及金融服务的发展的理念。
分享一些自己观后的一些思考和总结，特别是作者关于金融服务的未来的部分。

注：文中的翻译部分使用了AI，其余均为自己创作。

关于金融服务的思考

作者从银行的发展历史到银行业不断的变化，探讨了这些变化中的一些原因和本质，以及未来我们应该朝哪个方向发展。讲得很透彻，特别是他对于银行的思考。

作者在演讲中有几个观点我觉得很重要，下面我来一一阐述一下我的思考。

银行的实用性

对于普通人来说，提到银行都觉得它的发展离不开信用，可当信用已经深深融入我们的生活中之后，我们发现那些影响银行发展的已经不再是信用了，而是实用性。

作者原话：

And so, the trust was changing from being about a place you could go where your money was safe, to now, a set of bank platform technologies that would enable you to do banking.
因此，人们对银行的信任正在从你可以去安全存钱的地方，转变为一套银行平台技术，可以让你进行银行业务。
Today, it’s not a bank charter that makes someone trust in you as a bank brand, it’s the utility of your bank.
今天，让人们信任你这个银行品牌的不是银行牌照，而是银行的实用性。

银行从早期的信任机构，随着时代的发展，信用已经变成了银行的一项基本属性，而不是需要去建立的东西。因此，现代社会银行能提供怎样的现代化服务，则变成了人们对银行的新认识。

人们无法忍受一个银行宕机超过1天，人们无法忍受一个银行无法在互联网办理业务，人们无法忍受一个银行无法在手机上完成业务，人们甚至无法忍受一个银行的app每次打开都需要输入密码登录。

而这样的现代化服务总结成一个用户能理解的词，就是实用性。

作者举了一个例子，如果现在某个银行的服务崩溃10天，你对这个银行还会有信任吗？

因为银行从最早的提供纸质的凭证来作为你的储蓄凭证，中间经历了纸质存折，塑料银行卡等，以及发展到如今可以没有任何实体的凭证来作为你的银行卡了。

所以人们对银行的信任，已经从原来的安全可信，转变成了如今的实用性。

而这样的实用性的背后则是通过一整套现代IT互联网技术构成。

从硬件、到软件，从后台运维人员、到前台服务人员，从看得见的应用程序或网页，到看不见的架构设计，这些平台技术统一构成了如今人们感受到的银行实用性。

因此，随着技术的飞速发展，比如AI的发展，人们对银行的信任会不断发生新的变化。

当所有银行都满足了实用性，比如作者在演讲时举的例子，人们只需要问一下AI就知道自己的余额，只需要去商店买东西，而不需要关心如何结账时。实用性几乎就已经到了极致。而这时，人们对于银行的信任就会变为追求实用性以外的东西了。

就好比我们从物质追求转变成了精神追求一样。

作者原话提到：

When you start thinking about utility as it changes, banking becomes highly contextual.
当你开始考虑实用性的变化时，银行业务就会变得高度情景化。

过去人们去银行都是为了具体的银行业务，比如存钱，取钱，贷款等。而如今，银行已经不再止步于传统的银行业务，银行随着技术的发展，时代的发展，一直在拓宽它金融服务的边界。

而这样的金融服务，随着技术的发展，它将是完全融入人们生活的情景化服务。

还是商店购物的例子，当你想买一个东西的时候，你不需要考虑你有没有钱，也不需要先跑去银行办理贷款或者信用卡，然后再跑回来买它。而是在你想要买它的时候，银行已经评估完你的信用与风险，为你完成了信用付款，然后自动按照你的习惯来完成还款设置。

而这样的情景化，你甚至都不需要一个叫信用卡、或者贷款的业务名字。

就像作者提到的第一性原理设计思维。那么接下来我们就来说说first principles design thinking。

第一性原理设计思维

作者在演讲的时候多次提到第一性原理设计思维，它的英文名是first principles design thinking。

第一性原理设计思维指的是，回归事物最基本的条件，把其解构成各种要素进行分析，从而找到实现目标最优路径的方法。

说人话就是：从零开始设计。

就像作者多次在演讲时提到的start from scratch。

作者举了几个例子。

第一个例子：余额宝。

余额宝的本质是货币基金，如果是传统思维来售卖这个货币基金，那需要用户到线下的银行网点或者证券网点，然后告诉用户它是货币基金，然后告诉用户它的收益和风险，最后让用户买它，同时还需要为用户开户。

如果换成第一性原理思维来思考，如果我不需要用户投资很多钱，如果我不需要用户到线下网点，如果我不需要为用户专门开户，如果我只需要占用用户的零钱呢，如果我能让用户随时随地支取呢？

当然，站在余额宝成功的今天，我们很容易就能总结它成功的原因，比如低门槛、高流动性、收益稳健、引流、平台稳定、操作便捷等等诸多原因。

但如果让我们从零设计一款货币基金产品，我们能否抛开它固有的形态去从零开始思考呢？

第二个例子：福特Model T

福特model T是世界上第一台车，虽然我个人认为这个例子不太算是用了第一性原理思维，但它还是有代表性。

1908年的时候，人们还在坐马车，那个时候人们追求的是更名贵的马、更厉害的马夫，让坐马车上的贵族们能更快更舒适更有面子的到达目的地。

但如果我们跳出来思考，我们需要的不是更快的马，或者更好的驯马师，或者更舒适的马车。我们需要的是如何从A点到达B点，于是汽车就诞生了。

而现代社会，当我们能从A点到B点之后，我们的需求也发生了变化。

我们如何能更快的从A点到B点？我们如何能更安全的从A点到B点？我们能不能在从A点到B点的过程中做其他更有意义的事情呢？

第三个例子：iPhone

我觉得iPhone是一个很好的例子。

在当时，手机已经诞生很久了，无论是摩托罗拉、诺基亚、黑莓、三星，都已经有非常成熟的手机了，而且销量也很好。

特别是发明手机的摩托罗拉和制霸市场的诺基亚，都如日中天的在手机行业发着横财。

这个时候乔布斯就开始思考了，如果我要发明一款新手机，我不能模仿摩托罗拉和诺基亚，我要从零开始设计。

一个理想的手机，应该是有大屏幕，能触摸，有智能应用程序，能接入互联网。于是iPhone就诞生了。

如果iPhone是在摩托罗拉或者诺基亚的基础上迭代设计，那可能就没有今天引领时代的iPhone了。

第四个例子：SpaceX

这个例子也非常好。因为过去人们对航空的认识，可能都停留在国家航空或者美国NASA的新闻上。

提到火箭大家想到的是卫星，是航空材料，是掉落的助推器，和飞入太空的火箭头。

马斯克非常崇拜第一性原理，他多次在不同场合反复提到过第一性原理思维。

他想要实现人类移民火星的梦想，于是就要开始造火箭了，于是他就开始思考了：

火箭为什么一定要用航空级材料呢，为什么不能就用不锈钢呢？

火箭为什么要那么复杂的生产，为什么不直接3D打印呢？

火箭为什么火箭每次发射了就没了成本巨高，为什么不能通过回收利用来降低成本呢？

火箭为什么要发明大推力的发动机，为什么不能并联多个发动机来提升推力呢？

于是SpaceX就一次次的证明了自己。

第五个例子：微信支付、支付宝

这个例子大家已经非常熟悉了，同时也是和金融和银行最相关的例子。

当我们站在上帝视角思考，过去，人们付钱，要么现金，要么刷卡，要么支票，要么赊账。

如果我们不需要用户带着现金就能付钱呢？

如果我们不需要用户拥有一张借记卡呢？

如果我们不需要用户拥有一张信用卡呢？

如果我们不需要用户拥有银行账户呢？

手机是用户一定随身携带的设备，如果用户只需要手机就能支付呢？

于是就有了微信支付和支付宝。

如果用户连手机都不需要就能支付呢？

作者在演讲中也提到，既然银行已经有唯一识别我的唯一特征，为什么还需要银行账号，为什么还需要手机呢？

于是就有了刷掌支付，和刷脸支付。

小结一下

上面那么多例子其实就是想告诉大家，第一性原理思维能给我们打开新世界的大门。

不管是在银行业，还是在金融服务行业，能变革的永远不是在已有产品或服务上的改进，而是从零开始的颠覆。

比如支付，过去的人们只经历过现金或者赊账。

而现代社会用支票和信用卡颠覆了现金。

但这一切又被新时代的技术所颠覆。

因此，如果我们认真分析一下金融服务的发展给我们带来了哪些思考，我们就会发现：

支付只是一种行为，完成价值交易才是目的，而达到这样的目的有很多手段。

银行平台技术

聊技术的话题我就完全不陌生了，因为这就是我的老本行。

当我们再来聊银行4.0，或者聊银行的实用性，银行平台技术都是一个基础话题。

它决定着银行的实用性，以及过去银行的信用、安全、服务在技术面前的表现形式。

当我们孤立的看某一项技术时，我们都会认为它应该是最好的银行平台技术。

比如区块链，它应该是最安全最可信的银行存储技术。

比如智能手机，它应该是最实用的银行入口。

比如AI，它应该是银行最佳投资分析工具。

比如分布式技术，它应该是银行可用性和安全性的最佳保障。

比如云计算，它应该是帮助银行降低基础设施成本的最好方式。

但为什么银行没有采用这些技术，或者没有全部采用，因为我们需要结合发展历史来看。

现代的银行都是从过去最早的银行一步步迭代过来的，技术也是一步步发展进步的。

因此，银行平台技术要想全面的运用最先进的技术往往都是一个伪命题。

银行只能在部分领域，不断的迭代更新自己的平台技术。

比如从原来的单体大型机，更新到现在的分布式平台。

从原来的单中心架构，更新到现在的多中心异地架构。

作者在演讲中也提到：

These technologies will increasingly embed banking in our world.
这些技术将越来越多地把银行业务融入我们的世界。

同时，作为技术人，总想着运用最新的技术，去改变金融服务的业务，让它变得更好。

但往往我们错了，新技术往往与之前的工作方式大相径庭，无法融入。

这是为什么呢？因为顺序错了，我们想要改变，不是因为新技术的出现，而是因为我们发现了可以被改变的地方，才发明了新技术。

就像作者演讲时说的：

That’s when a new piece of technology comes along that’s so different from the way it was done before, that it requires everyone to reset their thinking and change their behavior.
当一项新技术出现时，它与以往的工作方式大相径庭，这就需要每个人重新思考并改变自己的行为。

银行平台技术也一样，现在国内很多银行的发展，好像都在追逐某种被证明了是先进的、是安全的、是可靠的技术，并且在监管的努力下，最终实现技术大统一。

银行平台技术应该是为用户服务的，是为了实现银行的实用性，为金融业务带来改变的基础上，去创新和改进的。

传统的银行，在银行平台技术上，追求的都是稳定、安全、性能等。

随着时代的发展，技术的发展，当这些要素都能成为最普通的基本属性后，银行平台技术还能追求什么呢？

是当金融服务能在人们的生活中高度情景化时，反过来追求技术的实用性？

还是当金融服务从零开始颠覆的时候，追求技术的可替换性？

还是在AI迅猛发展的未来，追求技术的可控性与安全性？

或许，大家都可以思考一下。

AI投资

作者在原文中提到：

When it comes to what we’re seeing in terms of investment today, what’s happening is, you’re not getting people just looking at digital onboarding, you’re seeing from the perspective of investment and savings, looking at behavioral mechanisms behind savings and investing.
说到我们今天所看到的投资，现在的情况是，人们不再仅仅关注数字开户，而是从投资和储蓄的角度，关注储蓄和投资背后的行为机制。

过去，人们要买股票，要跑到交易大厅去，写上买哪支股票买多少股。

如今，人们都在手机上就完成了这个交易。

证券操作已经完成了从线下到线上的转变，它是一种颠覆式的改变吗？

我认为不是，证券交易从线下变到线上，本质上的行为没有变，只是改变了参与这件事的位置。

如果从零开始设计，我认为，投资的理想情况应该是，动态理解我的财务目标，实时掌握我的风险评估，我无需懂理财知识，AI就能完成投资。

解释一下，动态理解我的财务目标就是它能知道我的资产里有多少闲钱是可以用来理财的，而这个目标每个月是不一样的。

实时掌握我的风险评估就是说，在我有房贷的情况下，和我无负债的情况下的风险承受能力是不一样的。

举个例子，AI能自动用我的10万块闲钱，去投资股票、基金、债券等理财产品，并在我接受的范围内，帮我理财。

挣了，我开心，亏了，不影响我的生活。

但AI代替人类顾问去投资，应该有两种方向：

降低风险
增加收益

这两种方向带来的发展是不一样的，第一种更注重于风险管理，第二种则更注重于投资机会。

无论那种，AI都在朝着替代人类的方向发展。因为数据不会出错，只会不够。

最后，就用作者的原话来结束这部分内容：

It’s man with machine versus man without machine.
这是有机器的人与没有机器的人的较量。
But it won’t be long before it’ll be machine versus man.
但用不了多久，就会变成机器与人的较量。

银行4.0与金融服务的未来（上）

2024-10-30T17:40:50+08:00

写在前面

这篇文章是Brett King在一次演讲中的内容，标题是”银行4.0与金融服务的未来“。我把演讲内容全部摘抄了下来。

Brett King 是一位国际知名的金融科技专家、作家和演说家。著有《Bank 3.0》《Bank 4.0》等多本在金融科技领域具有广泛影响力的书籍。

本篇文章有2个部分：

演讲正文与英文详解
关于金融服务的思考

本篇文章有3个目的或者作用：

作为英文学习材料，里面有很多英文的表达是值得我们学习的。特别是关于金融领域的词汇。
作为金融领域知识，值得金融从业人员学习，里面有很多关于银行的发展以及金融服务的发展的理念。
分享一些自己观后的一些思考和总结，特别是作者关于金融服务的未来的部分。

注：文中的翻译部分使用了AI，其余均为自己创作。

演讲正文

先来阅读一下作者的演讲全文，文中关键的英文单词或者短语我都做了标注和解释。同时包含了中英文对照。原文是我从演讲中摘抄出来的，中文是我用AI翻译加我自己校对的。（虽然说AI翻译已经非常厉害了，但可惜还是离不开人工校对。）加粗的文字是我认为作者的金句。

Guten morgen, it’s great to be here.

早上好，很高兴来到这里。

Guten morgen是德语的早上好。

I hope I don’t disappoint after that glowing introduction.

希望我的介绍不会让你失望。

But yeah it’s quite a thing to wake up one morning and find your Twitter feed getting blown up by people saying: “isn’t that your book on President Xi’s shelf?”

不过一早醒来，发现自己的推特被人刷爆了，确实是件好事， “主席的书架上不是你的书吗？”

blow up：爆发，爆炸。 这里主要指推特被刷爆了。Blown是blow的过去分词。

So, that’s pretty cool.

所以，这很酷。

As a futurist , one of the things that you try and do is you’re not just looking at what’s going to happen in the future.

作为一个未来学家，你要做的一件事就是你不能只关注未来会发生什么。

futurist：未来学家

You have to look at analogies for what might happen based on what’s happened in the past.

你必须根据过去发生的事情来类比可能发生的事情。

analogy： /əˈnælədʒi/ n. 类比，比拟；

JP and I were talking about this last night, a futurist told him once, “you have to be a good pastist,” understanding the past to understand the future.

JP和我昨晚在谈论这个话题时，一位未来学家曾告诉他，“你必须是一个优秀的历史学家”，了解过去才能了解未来。

And when you look at what’s happening in financial services broadly.

当你看一下金融服务领域正在发生的事情时，你就会明白这一点。

broadly: /ˈbrɔːdli/ adv. 广泛地，普遍地；

When we look at isolated technologies like blockchain or the smartphone, for example, we might think of these as a data solution for the bank.

当我们看到区块链或智能手机等孤立的技术时，我们可能会认为这些技术是银行的数据解决方案。

Or we might think of mobile like a channel for the bank.

或者，我们可能会认为移动手机就像银行的一个渠道。

But if you step back from those specific technologies and you look at what’s happening in the world, something is changing.

但是，如果你从这些特定技术中抽身出来，看看世界正在发生什么，就会发现有些东西正在发生变化。

step back from: 从……退后：停止或减少对某事的参与或关注。

The world is digitizing and the world is digitizing because we’re seeking low friction and immediacy.

世界正在数字化，世界正在数字化，因为我们正在寻求低摩擦和即时性。

friction: /ˈfrɪkʃ(ə)n/ n. 不和，分歧；摩擦；摩擦力

immediacy: /ɪˈmiːdiəsi/ n. 直接；及时性

We want immediate responses, we want stronger commerce connections that can scale up more rapidly.

我们想要即时响应，我们想要更强大的商业联系，能够更迅速地扩展。

These are the systems that are changing globally.

这些都是全球正在发生变化的系统。

So within that framework, you can’t expect banking and financial services to stay the same as it has been.

因此，在这个框架内，你不能指望银行和金融服务一成不变。

Because, ultimately, it has to shift.

因为，归根结底，它必须转变。

So when we look at these phases of development of banking, if you overlay technology on this, you understand that it’s not just about inserting technology into banking, there’s a larger shift.

因此，当我们审视银行业发展的这些阶段时，如果将技术叠加上去，就会明白这不仅仅是在银行业中植入技术，而是一个更大的转变。

Part of the shift is around trust and the utility of the bank.

转变的一部分是围绕信任和银行的实用性。

So when we look at the Bank 1.0 world, the foundational elements of banking, going back to the time of the Medicis and Florence and Firenze and places like that.

因此，当我们审视银行 1.0 世界时，银行业的基本要素可以追溯到美第奇家族、佛罗伦萨和佛罗伦萨等地的时代。

美第奇家族（`the Medicis`）主要活动在佛罗伦萨（Florence，意大利语为 Firenze）。美第奇家族以银行业起家，建立了庞大的金融帝国。他们的银行网络遍布欧洲，为各国君主和贵族提供金融服务，积累了巨额财富。美第奇家族是文艺复兴的重要赞助者。

When you look at that, banking was very simple, you would go to the bank and we trusted the bank because that was the safe place to store the money.

银行业务非常简单，你可以去银行，我们信任银行，因为银行是安全的储钱场所。

But transactionally, as our demands on the banking system increased, we needed to put technology in place to keep up with the demands of utility.

但在交易方面，随着我们对银行系统的要求越来越高，我们需要引进技术来满足实用性的需求。

This is the first bank mainframe called ERMA.

这是第一台名为 ERMA 的银行大型机。

mainframe:  n. [计] 主机；大型机

Electronic Recording Machine for Accounting.

ERMA，它是一种用于银行的计算机技术，实现了银行记账和支票处理的自动化。最早由通用电气生产。ERMA 的出现标志着计算机技术开始在银行业发挥重要作用，推动了银行业从手工处理向计算机处理的转变，对银行业的现代化和发展产生了深远影响。

This was, I think, where acronyms were introduced into banking, through technology.

我想，这就是通过技术将缩写引入银行业务的地方。

acronym:  n. 首字母缩略词

This was built by MIT for Bank of America in 1953 and it was primarily designed to do check processing back then.

这是麻省理工学院 1953 年为美国银行设计的，当时主要用于支票处理。

check ：这里的check是指支票。

back then ：当时，指过去的某个时间点或时期。

So, technology started to change the banking sector.

因此，技术开始改变银行业。

the banking sector：银行业

Now you probably don’t know this, but prior to the introduction of ERMA and mainframes, we never had bank account numbers.

你可能不知道，在引入ERMA和大型机之前，我们从来没有银行账号。

prior to：在……之前，先于

You’d go to a bank and they would fill out a card, they’d put your name on it and that was your account record and your customer record.

你去银行，他们会填写一张卡片，写上你的名字，这就是你的账户记录和客户记录。

Your name and address was on a physical card in a bank branch.

你的姓名和地址都在银行分行的实体卡片上。

This was why, for 30 years after this, with some banks, you couldn’t move from one branch to another without opening another account.

这就是为什么在这之后的 30 年里，有些银行在不开设另一个账户的情况下，你不能从一家分行转到另一家分行。

Because they stored your bank account details on this little card.

因为他们把你的银行账户信息储存在这张小卡片上。

But ERMA couldn’t sort customer information by name, so they had to give each customer and each account a unique number to sort it in the computer system.

但 ERMA 无法按姓名对客户信息进行分类，所以他们必须给每个客户和每个账户一个唯一的编号，以便在计算机系统中进行分类。

The computers weren’t very sophisticated back then.

那时的电脑还不是很先进。

sophisticated：/səˈfɪstɪkeɪtɪd/  adj. 见多识广的，老练的，见过世面的；复杂巧妙的，先进的，精密的；水平高的，在行的；

This was the first use of bank account numbers, due to the mainframe.

由于采用了大型主机，这是第一次使用银行账号。

Then, in the mid-80s, we started to look at ways to extend the platform of banking and make it self-service.

80 年代中期，我们开始研究如何扩展银行业务平台，使其成为自助服务。

We had the Internet come along in the 90s, but this really started with the introduction of the ATM machine, self-service banking.

90 年代出现了互联网，但真正的起点是引入 ATM 机和自助银行服务。

come along:  出现：指某人或某物出现或到达。

What’s happening is, we’re trying to say, “We’re extending the bank as a platform.”

现在的情况是，我们试图说，"我们正在把银行作为一个平台来扩展”。

But our reliance on the bank as a building, the bank as a place, was becoming less and less important, because now we’re saying you can bank 24/7.

但我们对银行这座建筑的依赖，银行这个地方的依赖的重要性越来越低了，因为现在我们说你可以7*24小时办理银行业务。

reliance：/rɪˈlaɪəns/  n. 依靠，信任

less and less：越来越少

And then when Bank 3.0 came along, we extended that analogy to say you could bank anywhere, any time.

然后，当银行 3.0 出现时，我们将这一类比扩展到你可以随时随地进行银行业务。

And this mobile, if you step back from this and say, “Well this was just another channel to extend banking, ” then you don’t understand the implications, because what it was doing was saying: “Banking can be done wherever you are, you don’t need the bank.”

如果你退一步说：“这只是扩展银行业务的另一个渠道”，那你就不明白它的意义，因为它的作用是说： “无论你在哪里都能办理银行业务，你不需要银行。”

implications：/ˌɪmplɪˈkeɪʃ(ə)n/  n. 可能的影响（或作用、结果）；

what it was doing was saying：整句话可以翻译为 “它正在做的事情是在表明……”，“what it was doing”是“正在做的事情”，这里的saying表示 “表明 / 表示”。

But what was key was the core utility of the bank, was being surfaced through this technology.

但关键在于，银行的核心功能正通过这项技术浮出水面。

And so, the trust was changing from being about a place you could go where your money was safe, to now, a set of bank platform technologies that would enable you to do banking.

因此，人们对银行的信任正在从你可以去安全存钱的地方，转变为一套银行平台技术，可以让你进行银行业务。

Today, it’s not a bank charter that makes someone trust in you as a bank brand, it’s the utility of your bank.

今天，让人们信任你这个银行品牌的不是银行牌照，而是银行的实用性。

charter：/ˈtʃɑːrtər/  n. 宪章，章程；特许状，许可证；

bank charter：银行牌照

If your technology stops working for 10 days, you can’t access internet banking, point-of-sale systems are down, branch systems aren’t working, how much trust do you think people would have in your bank for these 10 days?

如果你的技术停止工作 10 天，你无法访问网上银行，POS系统瘫痪，分行系统无法工作，你认为在这 10 天里人们会对你的银行有多少信任？

point-of-sale：就是我们平时刷卡的POS机的全称。

So utility and trust become wrapped up.

于是，实用性和信任度就被捆绑在了一起。

wrapped up：总结，概括；包好，裹紧；这里理解为捆绑到了一起。

And technology now becomes the overarching mechanism for delivery of that utility.

而技术现在成为了提供这种效用的总体机制。

overarching：/ˌoʊvərˈɑːrtʃɪŋ/ adj. 首要的，总体；支配一切的；包罗万象的

mechanism：/ˈmekənɪzəm/  n. 机械装置，机件；途径，方法；（生物体内的）机制，构造；

But something else changed.

但还有其他一些变化。

We started to see us rethinking the way financial services should work.

我们开始重新思考金融服务的运作方式。

This is Yuebao, the most successful investment or savings product on the planet today, built in China on top of Alibaba’s system to capture those deposits from merchants and consumers working on Alibaba and Taobao, putting that aside and giving them some high-yield interest rate.

这就是余额宝，当今世界上最成功的投资或储蓄产品，它建立在中国阿里巴巴的系统之上，从阿里巴巴和淘宝网上的商家和消费者那里获取存款，并将其搁置起来，为他们提供高收益利率。

deposits：存款

merchants：商家

high-yield：高的，高产的

interest rate：利率

We classify this in the West as a money market fund.

在西方，我们将其归类为货币市场基金。

classify：/ˈklæsɪfaɪ/  v. 把……分类，把……分级；

money market fund：货币基金

Jack Ma doesn’t see it as that, that’s way they called it Yuebao, “Hidden treasures.”

马云不这么认为，所以他们叫它余额宝，“隐藏的财宝”。

doesn’t see it as：不认为它是…

treasure：/ˈtreʒər/ n. 金银财宝，珠宝，财富；

They saw it as a behavioral model for savings.

他们认为这是一种储蓄行为模式。

savings：储蓄

$180 billion assets under management, no branches, no humans involved in the sale of that product.

管理着 1800 亿美元的资产，没有分支机构，没有人参与产品销售。

In the past, you may have heard an argument that we need bank branches and face-to-face interaction because how else are we going to engage customers to take deposits or assets?

过去，你可能听过这样一种说法：我们需要银行网点和面对面的互动，因为我们还能如何吸引客户来存款或购买资产？

argument：n. 争论，争吵；论据，理由；辩论，讨论；这里理解为一种说法或论点。

engage：v. 雇用，聘请；参加，从事；吸引，引起；

And yet the most successful deposit product in the world today doesn’t involve humans, it’s completely automated.

然而，当今世界上最成功的存款产品并不涉及人类，而是完全自动化的。

And Alipay, Ant Financial, the parent company of Alipay in China, has a higher trust rating in China than most of the banks there.

而支付宝，也就是中国支付宝的母公司蚂蚁金服，在中国的信任度高于大多数银行。

Why? It could be argued that it’s because of utility.

为什么呢？可以说是因为实用性。

It could be argued that: 可以说，可以这样说。是一种常用表达。

Now, when you see where we’re going with this, the next generation of technology, as we’re talking about, voice-based artificial intelligence, personal smart assistants built into our home and telephone.

现在，当你看到我们的发展方向时，下一代技术，正如我们正在谈论的，基于语音的人工智能，内置在我们的家庭和电话中的个人智能助理。

Augmented reality smart glasses in a few years that can give you data in your field of view so you can make decisions.

几年后，增强现实智能眼镜可以在你的视野中提供数据，让你做出决策。

field of view：视野：指在不转动头部或移动身体的情况下，一个人可以观察到的区域。

These technologies will increasingly embed banking in our world.

这些技术将越来越多地把银行业务融入我们的世界。

embed： v. （使）嵌入， 把……插入；

And so, the design of a banking system to fit into this world, requires us to sort of rethink banking from the ground up around utility, not around the products or the channels we’re using.

因此，要设计一个适应这个世界的银行系统，我们就必须从根本上重新思考银行业务的实用性，而不是我们正在使用的产品或渠道。

sort of：表示程度上的某种程度，有点儿。

from the ground up：从头开始，从零开始

Now, when you look for evidence of changes into the way economics works and so forth and you look at technology, the way it’s impacted, the biggest changes historically that have taken place in the world have happened with what we call “first principles design thinking.”

现在，当你寻找改变经济运行方式等的证据，并审视技术及其影响方式时，历史上发生在世界上的最大变化都发生在我们所说的 “第一性原理设计思维”中。

economics：经济学

and so forth：等等，等同于and so on或者etc.

historically：adv. 关于历史事件，从历史观点上说；在过去，历史上地

That’s when a new piece of technology comes along that’s so different from the way it was done before, that it requires everyone to reset their thinking and change their behavior.

当一项新技术出现时，它与以往的工作方式大相径庭，这就需要每个人重新思考并改变自己的行为。

The automobile was an example of that.

It got rid of all these people working in London and New York who were shoveling the horse dung off the streets, it changed employment patterns, it changed the architecture of cities.

它摆脱了所有在伦敦和纽约工作的人，这些人铲除了街道上的马粪，它改变了就业模式，改变了城市建筑。

got rid of：摆脱，除去：

shovel：/ˈʃʌv(ə)l/  v.（用铲子）铲起，铲去；

dung：/dʌŋ/ n.（动物的）粪，粪肥；

It created the middle class in the United States.

the middle class：中产阶级

它在美国创造了中产阶级。

The Model T Ford production line is credited with creating the middle class in the United States.

福特Model T车生产线被认为是美国中产阶级的开创者。

be credited wtih：/ˈkredɪtɪd/ 认为…有某种成就

All of this from a first principles rethink about transportation.

所有这一切都源于对交通的初步反思。

transportation：n. 运输，运送；交通运输系统；

We don’t need a faster horse, we need to rethink the way we get from point A to point B.

我们不需要更快的马，我们需要重新思考从A点到B点的交通方式。

You can think of other examples as well.

你还可以想到其他例子。

A great example of this is the iPhone.

iPhone 就是一个很好的例子。

When Steve Jobs worked on the iPhone and the iPod, you can see this is an example of the prototype that they used for creating the first iPhone and the first iPods.

当史蒂夫-乔布斯开发 iPhone 和 iPod 时，你可以看到这是他们用来制作第一代 iPhone 和第一代 iPod 的原型机。

prototype：/ˈproʊtətaɪp/  n. （新型汽车、机器等的）原型，雏形；

Now, Jobs didn’t take the Nokia Banana Phone, or the Motorola flip or the BlackBerry RIM and try and iterate on that, he said, “If we’re going to take a touchscreen device, a mobile phone, internet access, software apps, and combine them into a device, how would that work?”

现在，乔布斯并没有把诺基亚香蕉手机、摩托罗拉翻盖手机或黑莓 RIM 手机作为原型，而是说：”如果我们要把触摸屏设备、移动电话、互联网接入、软件应用程序结合到一个设备中，这将如何运作？“

This is what we call first principles thinking.

这就是我们所说的第一性原理。

Let me give you one other example and then I’ll tie this back to banking.

让我再举一个例子，然后把它与银行业联系起来。

tie ... back to：与…联系起来

Let’s talk about the development of technology here in Germany: the V-2 rocket.

让我们来谈谈德国的技术发展：V-2 火箭。

It was an amazing piece of technology, if you step away from what it was used for, it was decades ahead of the rest of the world in terms of technology development.

这是一项了不起的技术，如果不考虑它的用途，就技术发展而言，它领先世界其他国家几十年。

step away from：离开，远离。

decades：n. 数十年（decade 的复数）

ahead of：领先于

in terms of：就……而言；

But Wernher von Braun, who was behind this technology, said he wanted to get men to the moon.

但这项技术的幕后推手沃纳-冯-布劳恩说，他想把人类送上月球。

沃纳・冯・布劳恩（Wernher von Braun）是 20 世纪液体燃料火箭技术和宇航工程的开创者和奠基人。

So, at the end of the Second World War, the Russians, Americans and the British were rushing into Germany to try and get access to these resources, namely Wernher von Braun.

因此，在第二次世界大战结束时，俄国人、美国人和英国人纷纷涌入德国，试图获得这些资源，也就是沃纳-冯-布劳恩。

rushing into：v. 仓促行动；仓促从事

namely：adv. 即，也就是

He went on to build the Apollo program.

他后来建立了阿波罗计划。

Using this technology, he iterated on the V-2, created the Mercury-Redstone rocket, the Apollo rocket, and at the height of this program, the average launch would cost about $1.2 billion in today’s terms, about $6000 to get a kilogram of stuff into orbit.

利用这项技术，他迭代了 V-2 型火箭，创造了 “水星-红石 ”火箭和 “阿波罗 ”火箭，在这一计划的鼎盛时期，平均发射费用以今天的价格计算约为 12 亿美元，将一公斤的东西送入轨道约需 6000 美元。

at the height of：在…的顶峰时期

in today’s terms：用今天的眼光来看，这里指的是如今的汇率来算

kilogram：n. 千克，公斤

orbit：/ˈɔːrbɪt/  n. （环绕地球、太阳等运行的）轨道；

Over the last 50 or 60 years, we’ve reduced that by about a third by iterating on the Apollo design.

在过去的五六十年里，我们通过对阿波罗火箭设计的不断改进，将成本降低了大约三分之一。

a third：三分之一

Something interesting happened in the last few years: along came Elon Musk and SpaceX, and they said, “What happens if we were going to redesign rockets, if we started from scratch? What if we didn’t take the Wernher von Braun Apollo program design and iterated on this? What if we started from scratch using 3D printing of titanium engine parts? What if we started with new computer models with new systems? If we started from scratch would this make a difference?”

过去几年发生了一件有趣的事：埃隆-马斯克和 SpaceX 公司出现了，他们说："如果我们要重新设计火箭，如果我们从零开始，会发生什么？如果我们不采用沃纳-冯-布劳恩的阿波罗计划设计并在此基础上迭代呢？如果我们使用 3D 打印钛发动机部件，从零开始呢？如果我们使用新的计算机模型和新的系统从头开始呢？如果我们从零开始，这会有什么不同吗？

along came：到达或出现在某个地方，这里指马斯克出现了。

started from scratch：从头开始（固定搭配）

titanium：/taɪˈteɪniəm/ n. [化学] 钛（金属元素）

First principles thinking.

这就是第一性原理。

And the result is that in just 14 years, with the reusability on the Falcon Heavy platform, SpaceX has got the cost to orbit down to about $300 per kilogram, a 95% reduction of the days of the Apollo program.

结果是，在短短 14 年时间里，凭借 “猎鹰重型 ”平台的可重复使用性，SpaceX 将进入轨道的成本降至每公斤约 300 美元，比阿波罗计划时期降低了 95%。

Falcon：n. 隼，猎鹰。这个单词本身是猎鹰的意思，但这里Falcon Heavy platform指的是SpaceX的猎鹰重型平台。

And they sent Starman into space in a Tesla.

他们还用特斯拉把 “星人 ”送上了太空。

Starman是DC漫画的星光侠。猎鹰重型火箭的首次发射中，作为模拟有效载荷，火箭携带了 SpaceX 创始人埃隆・马斯克的特斯拉 Roadster 红色电动跑车，而在该跑车的驾驶座上有一个名为 “Starman” 的假人。是一种致敬，星光侠是美国奥帕尔市的一位科学家，具有超能力，在漫画中是一个与太空相关的英雄角色。这也契合了此次太空发射行动以及人类对太空探索的追求。

So this required rethinking the way rockets worked, reusability, all of these things, were essential components of this.

因此，这需要重新思考火箭的工作方式，可重复使用性，所有这些东西，都是必不可少的组成部分。

So you end up with two competing design themes in terms of how we incorporate technology into the world.

因此，在我们如何将技术融入世界的问题上，最终会出现两个相互竞争的设计主题。

end up with：以……结束

incorporate： v. 包含，合并；

You have first principles thinking, which says: we’ve had a major leap in technology, it changes everything.

一种是第一性原理思维，即：我们的技术有了重大飞跃，它改变了一切。

a major leap：重大飞跃

Or the other approach: we take technology and gradually improve on it.

或者是另一种方法：我们利用技术并逐步加以改进。

gradually：adv. 逐渐地，逐步地

That’s what’s happened in banking today.

今天的银行业就是这样。

We’ve taken technologies like the mainframe, the ATM machine, internet, mobile, and we’ve iterated on the traditional banking model.

我们采用了主机、ATM 机、互联网、移动电话等技术，并对传统银行模式进行了迭代。

Branch banking, investment advisors, we’ve iterated on this.

分行银行、投资顾问，我们都在此基础上进行了迭代。

So, when the iPhone came along, instead of saying, “There’s an opportunity to completely rethink the way financial services fits into people’s lives,” we took the primary artifact, the bank account, and we stuck a representation of that in the phone.

因此，当 iPhone 出现时，我们并没有说 “现在有机会彻底重新思考金融服务融入人们生活的方式”，而是将银行账户这一主要工具植入手机。

completely： adv. 完全地，彻底地

fits into：vt. 适合，适应；符合

artifact：n. 人工制品

stuck：stuck主要有卡住、陷入的意思，这里主要指塞进，也就是植入。

a representation of：…… 的代表；…… 的象征；…… 的表现形式

This is what we call Design-by-Analogy.

这就是我们所说的 “类比设计”。

So, these are the two competing design schools.

这就是两种相互竞争的设计流派。

schools：这里指的是学派、流派。不是学校。

First principles design, start from scratch or iterate on the existing model by incorporating technology.

第一性原理设计，从零开始，或者在现有模式上结合技术进行迭代。

incorporating：/ɪnˈkɔːrpəreɪtɪŋ/ adj. 合并的

Remember, historically speaking, the biggest leaps and the biggest changes in the world have occurred through first principles thinking.

请记住，从历史上看，世界上最大的飞跃和最大的变革都是通过第一性原理思维实现的。

historically speaking：从历史上看，用于引出一段历史背景或历史事件的讲述

So how would you think about acquisition of customers in the first-principle world?

那么，在第一性原理的世界里，你会如何考虑获取客户呢？

acquisition：/ˌækwɪˈzɪʃ(ə)n/  n.学得，习得；（金钱、财物等的）获取；这是获取客户的常用单词。

Well, we’ll introduce Jack Ma.

好吧，我们来介绍马云。

Hear what he had to say about competing with Walmart.

听听他对与沃尔玛竞争的看法。

It doesn’t matter that he’s talking about the retail business there because he’s used the same strategy in financial services in China today, making him one of the fastest-growing financial services organizations or Ant Financial, one of the fastest in the world.

他所说的零售业务并不重要，因为他今天在中国的金融服务领域也采用了同样的策略，使其成为全球发展最快的金融服务机构之一，即蚂蚁金服。

Hear what he said about competing with Walmart.

听听他是怎么说与沃尔玛竞争的。

He said Alibaba is going to be bigger than Walmart in a couple of years, because of this reason.

他说，因为这个原因，阿里巴巴将在几年内比沃尔玛还要大。

You did a great job in Baba, so I said maybe in 10 years, we’ll be bigger than Walmart.

你在巴巴做得很好，所以我说，也许 10 年后，我们会比沃尔玛还大。

He said, “Young man, you have good hope.”

他说："年轻人，你的希望很大。”

I said, “Let’s make a bet. In 10 years we’ll be bigger than Walmart on sales, because if you want to have 10000 new customers, you have to build a new warehouse and this and that. For me, two servers, two computers.”

我说："我们打个赌吧。10 年后，我们的销售额将超过沃尔玛，因为如果你想拥有 10000 个新客户，你就必须建一个新仓库，建这个建那个。对我来说，两台服务器，两台电脑。

warehouse：n. 仓库，货栈

That’s all he says he needs to get 10000 customers, two servers.

他说，要获得 10000 个客户，他只需要两台服务器。

So, in the world that Jack Ma thinks of financial services being embedded in people’s lives, the ultimate, low-friction, financial services engagement means you can execute everything you need to cross-digital channels.

因此，在马云认为金融服务已经嵌入人们生活的世界里，终极的、低摩擦的金融服务参与意味着你可以执行跨数字渠道所需的一切。

ultimate：adj. 最终的，最后的

friction：n. 不和，分歧；摩擦；摩擦力

Whereas with banks, we iterate on this, and we say, “Well, we don’t want to sell stuff on the internet because that’s going to cannibalize our existing agency business or our branch-based business, so let’s put some transactional stuff online.”

而对于银行来说，我们会反复斟酌，然后说："好吧，我们不想在互联网上卖东西，因为这会蚕食我们现有的代理业务或分行业务，所以我们还是把一些交易性的东西放到网上吧。”

Whereas：conj.（表示对比）但是，然而；

cannibalize：/ˈkænɪbəlaɪz/ vi. 调拨人员；拆用配件；同类相食。它的名词cannibal是食人者的意思，它的形容词cannibal是食同类的、吃人肉的意思。所以这里就是指蚕食什么。

So, when the internet came along, we didn’t sell investment products or bank accounts on the internet, we created internet banking, which was essentially the bank statement online, behind a login.

因此，当互联网出现时，我们并没有在互联网上销售投资产品或银行账户，而是创建了网上银行，本质上就是通过登录在网上提供银行对账单。

bank statement：银行结单，银行对账单。

essentially：adv. 本质上，根本上；大体上，基本上

Then mobile came along and we said, “Great, now we can put those bank statements on a smaller screen.”

然后手机出现了，我们说，“太好了，现在我们可以把银行对账单放到更小的屏幕上了”。

This is the iterative thinking.

这就是迭代思维。

So what you have today is, compared with first principles players in this ecosystem, all of the challenger banks of the world and the new behavioral investment platforms and so forth, are all about digital onboarding.

因此，与这一生态系统中的第一性原则参与者相比，如今世界上所有的挑战者银行和新的行为投资平台等都在致力于数字化开户。

onboarding：通常指新员工入职培训；新用户引导流程，顾客引导；但这里指开户。

And yet, less than 5% of the banks in the world today offer complete digital onboarding of customers.

然而，目前全球只有不到 5%的银行为客户提供完整的数字化开户服务。

And yet：然而，尽管如此。用于引出一个与前面所说的相反或出乎意料的事实或情况。

We’re already starting to see the world diverge around this very simple engagement principle: how you acquire customers in the digital age.

我们已经开始看到，围绕着这一简单的参与原则：如何在数字时代获取客户，世界已经开始出现分歧。

diverge：/daɪˈvɜːrdʒ/ v. 相异，出现分歧；

If you’re going to design value stores, you have to understand that technology is going to change the nature of banking itself, and that would have to start with the basic bank account or a value store.

如果要设计价值商店，就必须明白技术将改变银行业务本身的性质，而这必须从基本的银行账户或价值商店开始。

the nature of：…的性质；…的本性

In fact, if you think about it, if you break down the value financial service players provide to their customers, extending on what JP was talking about before, we probably only have three core products: we have the ability to store value, we have the ability to move money, and we have the ability to access credit.

事实上，如果你仔细想想，将金融服务公司为客户提供的价值进行细分，根据 JP 之前的论述，我们可能只有三种核心产品：我们有存储价值的能力，我们有转移资金的能力，我们有获取信贷的能力。

break down：分解，拆开，分析或细分

They’re the core foundation elements or utility that our products that we give to customers provide.

它们是我们为客户提供的产品的核心基础元素或实用性。

Let’s step back from the technology and think about the change that’s occurring in the value store itself at the heart of banking and financial services.

让我们从技术角度退一步，思考一下作为银行和金融服务核心的价值商店本身正在发生的变化。

If you look historically at the value stores we used to use, they weren’t particularly smart.

如果你回顾一下我们过去使用的价值存储，它们并不是特别智能。

particularly：adv. 非常，尤其

They would store our money safely and at the time, that was what the core value proposition, the trust in a bank was for, because your money was safe.

它们会安全地存储我们的钱，而在当时，这正是银行的核心价值主张和信任所在，因为你的钱是安全的。

proposition：n. 主张，观点；

But as technology started to come into play, we took those dumb artifacts and we put them inside our technology platform to try and give some bigger utility.

但随着技术的发展，我们把这些笨拙的工具放到了我们的技术平台上，试图让它们发挥更大的作用。

come into play：开始起作用

dumb：adj. 愚蠢的；简易的

But they were still dumb, they didn’t provide any feedback.

但它们仍然很笨，没有提供任何反馈。

That basic debit card or credit card you use when you go to visit a store, it doesn’t tell you your balance before and after the transaction.

你去商店购物时使用的基本借记卡或信用卡不会告诉您交易前后的余额。

debit card：借记卡

credit card：信用卡

That’s the most requested piece of information you get from customers about their day-to-day bank account.

这是您从客户那里得到的关于其日常银行账户的最多信息。

day-to-day：adj. 日常的；逐日的

We had to think about this in a different way.

我们必须换个角度思考这个问题。

When it comes to what we’re seeing in terms of investment today, what’s happening is, you’re not getting people just looking at digital onboarding, you’re seeing from the perspective of investment and savings, looking at behavioral mechanisms behind savings and investing.

说到我们今天所看到的投资，现在的情况是，人们不再仅仅关注数字开户，而是从投资和储蓄的角度，关注储蓄和投资背后的行为机制。

And not saying you need a minimum AUM to qualify as a customer to get into this account, just saying, “Let’s change your behavior so you can save. Let’s change the way you save so you can invest more money.”

并不是说你需要最低资产管理规模才有资格成为这个账户的客户，只是说："让我们改变你的行为，这样你就可以储蓄。让我们改变你的储蓄方式，这样你就能投资更多的钱"。

And not saying：而不是说（常见用法）

AUM：资产管理规模（asset under management）

qualify：v.（使）有权去做；取得资格，达到标准；

Because this, over time, builds AUM faster than saying, “Here’s a great product to stick your money in.”

因为随着时间的推移，这比说 “这是个不错的产品，你可以把钱存进去” 更快地积累资产管理规模。

over time：随着时间的过去，它也有加班的意思，但这里就是指时间流逝。

stick … in：放入，这里就是存钱，投进去的意思。

So, this is the change, it’s a behavioral framework, around the value store, not a product framework.

所以，这就是改变，是围绕价值存储的行为框架，而不是产品框架。

When you look at how this might evolve, a great illustration of this is happening in China right now with ICBC, with their AI investment platform.

当你看这可能如何发展时，中国工商银行的人工智能投资平台就是一个很好的例子。

evolve：v. 进化，演化；

illustration：/ˌɪləˈstreɪʃ(ə)n/ n. 插图，图解；说明，例释；实例，示例。这里主要是示例的意思。

Now what they do is they monitor your behavior in terms of your portfolio, to produce a very detailed risk model.

现在，他们所做的就是监测你在投资组合方面的行为，以生成一个非常详细的风险模型。

portfolio：在这里指的是投资组合。

produce：v. 生产，产生；

They’re eliminated the risk profile questionnaire as part of the investment process.

他们取消了作为投资流程一部分的风险状况问卷。

eliminated：/ɪˈlɪmɪneɪtɪd; ɪˈlɪməˌneɪtɪd/ v. 被淘汰；消除；排除

risk profile：风险评估

questionnaire：/ˌkwestʃəˈner/ n. 问卷，调查表

Now, from a perspective of a regulator, you might say, “This is a problem, because we need that risk profile questionnaire to understand your risk profile and then understand that you’ve committed to that risk contained in that investment product.”

现在，从监管者的角度来看，你可能会说：“这是个问题，因为我们需要风险评估问卷来了解你的风险状况，然后了解你是否承诺承担该投资产品所包含的风险。“

regulator：n.（某行业的）监管者，监管机构；

But that’s iterative thinking.

但这是迭代思维。

First principles design thinking is: well, let’s monitor your behavior and learn how risky you are.

第一性原则设计思维是：好吧，让我们监控你的行为，了解你的风险有多大。

And if that risk is a problem for you, let’s change your behavior over time by educating you, by giving you the right behavioral triggers.

如果这样的风险对你来说是个问题，那就让我们通过教育你、给你正确的行为诱因来改变你的行为。

This is really at the heart of this change around financial services.

这正是金融服务变革的核心所在。

at the heart of：在……的核心；

As we get smarter, banking and investment, and these tools are becoming embedded in this world through technologies.

随着我们变得越来越聪明，银行和投资以及这些工具正通过技术嵌入这个世界。

This is leading us to move away from the financial products we used to have to understand that the utility of financial institutions is now serviced not through products through a channel, but through technology experiences that service the utility.

这促使我们摆脱过去的金融产品，认识到金融机构的实用性现在不是通过渠道提供产品，而是通过服务于实用性的技术体验来实现。

move away from：远离

institutions：n. 机构，团体；

The core ability to move money, store value or access credit.

核心能力是转移资金、存储价值或获取信贷。

How we adapted to this in the past, is we took those traditional interactions we’d had in the physical space and we implemented electronic forms or electronic systems to mimic the processes we’d had in the branch or with the investment advisor.

过去，我们是如何适应这种情况的？我们采用了实体空间中的传统互动方式，并实施了电子表格或电子系统，以模仿我们在分行或与投资顾问之间的流程。

adapted：v. 适应；调整

mimic：v. 模仿

We iterated on this from a technology perspective.

我们从技术角度对此进行了迭代。

Now, I’m going to show you how Capital One did this with Alexa in respect to their credit card product.

现在，我将向大家展示 Capital One 在其信用卡产品中是如何使用 Alexa 的。

in respect to：关于；就……而言

Alexa：Alexa是亚马逊推出的一个智能助理音响，类似于小爱音响。

It's not a core product, but it’s a good illustration of iteration.

这不是核心产品，但它很好地说明了迭代的问题。

Voice is the next big technology that’s going to affect financial services.

语音是影响金融服务的下一个重要技术。

This is how one of the first banks in the world attacked the use of Amazon Alexa with voice.

世界上首批使用亚马逊 Alexa 语音技术的银行之一就是这样做的。

The Capital One Skill for Amazon Alexa makes credit card payments easier than ever.

亚马逊 Alexa 的 Capital One Skill 让信用卡支付变得前所未有的简单。

After saying, “Alexa, open Capital One” and speaking your personal key, you can pay your bill using only your voice.

说 “Alexa，打开 Capital One ”并说出你的个人密钥后，你就可以用你的声音支付账单了。

“When’s my payment due?”

“我的付款何时到期？

“The payment of your credit card is due July 9th.”

“您的信用卡付款截止日期是 7 月 9 日"。

“Pay my Capital One credit card bill.”

“支付我的Capital One信用卡账单”

You’ll get the option to pay your balance or a minimum payment.

您可以选择支付余额或最低还款额。

Make your choice and confirm.

做出选择并确认。

A confirmation code will appear on the Capital One Skill card in your Alexa app.

在 Alexa 应用程序中，Capital One Skill 卡上会显示一个确认代码。

Once the payment has been made.

付款完成后

“Confirm.”

确认

“All set, I’ve made the payment for you.”

“一切就绪，我已为您完成支付"。

All set：准备就绪：表示一切都已安排好或准备完毕。

The Capital One Skill makes account management as easy as speaking up.

Capital One Skill 让账户管理像说话一样简单。

Just ask Alexa, to find out for yourself.

一问 Alexa，马上知道。

This is not bad for a first attempt at adapting Alexa, but they just took the product they had in the branch, the credit card, and said, “How do we put this on the voice channel?”

这对于首次尝试改编 Alexa 来说还算不错，但他们只是把网点里的信用卡产品拿出来说："我们怎么把它放到语音渠道上？”

Whereas a first principle designer would say, “You don’t need plastic to make a payment, you’ve got your voice, that’s your unique identifier. As long as you can attach the voice to a value store, you don’t need plastic, or a 16-digit number. You can get access to credit, but that can be based on an experiential basis rather than a physical card.”

而一个第一性原则设计师会说："你不需要信用卡来付款，你有你的声音，那是你独一无二的标识符。只要你能把语音附加到一个价值商店，你就不需要实体信用卡，也不需要 16 位数字的号码。你可以获得信贷，但这可以建立在体验的基础上，而不是一张实体卡。

plastic：/ˈplæstɪk/ 信用卡，这里是想强调它是一个实体塑料卡。

As long as：只要

That’s first principles thinking versus iterative thinking.

这就是第一性原则思维与迭代思维的对比。

versus：对比，它的缩写是VS

When you look for evidence of first principles design in the financial services world, you see a lot of it coming out of China and new fintech start-ups around the world.

当你在金融服务领域寻找第一性原理设计的证据时，你会发现中国和世界各地的新金融科技初创企业都有很多这样的设计。

start-ups：创业公司

This is, of course, Tencent WeChat.

当然，这就是腾讯微信。

Now, in China, 98% of mobile payments go through two technology platforms, Ant Financial’s Alipay and Tencent WeChat, not through the traditional banks or traditional payments networks.

现在，在中国，98% 的移动支付都是通过蚂蚁金服的支付宝和腾讯微信这两个技术平台进行的，而不是通过传统银行或传统支付网络。

And this has happened in the space of just a few years.

而这一切都发生在短短几年间。

in the space of：在某个时间内，不超过某个时间

Last year, $12 trillion in mobile payments, what that means is this year, China’s mobile payments transaction traffic will surpass all of the card traffic of the world.

去年，移动支付交易额达到 12 万亿美元，这意味着今年中国的移动支付交易额将超过全球所有银行卡交易额。

trillion：万亿（生活中不常用，但金融领域挺常用的，所以提出来）

surpass：v. 超过，胜过，优于；

what that means is：这意味着

There will be more mobile payments globally done this year than all of the plastic card payments done across traditional means.

今年，全球移动支付的交易量将超过通过传统方式进行的所有实体信用卡支付。

plastic card：这里强调的是实体信用卡。

means：n. 手段，方法；金钱

This is a pretty big shift.

这是一个相当大的转变。

But WeChat, they didn’t try to create a credit card or debit card that you signed up for at a branch and you use a traditional point-of-sale network, they just used a simple QR code.

但微信并没有试图创造一种信用卡或借记卡，让用户在网点注册并使用传统的POS机网络，他们只是使用了一个简单的二维码。

First principles thinking around payments.

围绕支付的第一性原理思维。

It wasn’t a payment product, it was enabling the utility of a payment experience.

这不是一个支付产品，而是实现了支付体验的实用性。

When Uber was faced with the challenges of growth in American cities, in cities like New York, San Francisco, Chicago and Los Angeles, they couldn’t recruit drivers fast enough and they found out that 30% of the drivers who started the application process in the app got to a single field in the app and abandoned the driver sign-up process.

当 Uber 在美国城市面临发展挑战时，在纽约、旧金山、芝加哥和洛杉矶等城市，他们招募司机的速度不够快，他们发现有 30% 的司机在应用程序中开始申请流程时，只填写了应用程序中的一个字段，就放弃了司机注册流程。

recruit：v. 招聘，招收

abandon：v. 抛弃，遗弃；

That field was the debit card.

这个字段就是借记卡。

Because they had driven yellow taxi cabs and had never had a bank account, they’d been paid in cash.

因为他们开的是黄色出租车，从来没有银行账户，他们的工资都是现金支付的。

yellow taxi cabs：出租车。这里主要是指纽约经典且传统的黄色出租车。顺便说一些历史，在18世纪的英国，贵族往往都雇佣一辆单马双轮轻便车（cabriolet），到了19世纪，人们开始用cabriolet的缩略词cab来代指城中专供出租的大型马车，后来汽车出现了，出租马车就变成了汽车，所以出租车就叫cab了。

So, to enable them to grow Uber faster, they had to issue drivers with a bank account.

因此，为了让 Uber 更快地发展，他们必须向司机发放银行账户。

Overnight, Uber became one of the third-largest acquirers of small business bank accounts in the United States.

一夜之间，Uber 成为美国第三大小型企业银行账户收购者之一。

the third-largest：第三大，第几大就把中间那个数字换成几

But Uber doesn’t want to be a bank.

但 Uber 并不想成为一家银行。

They needed the utility of the bank built into their app to continue to grow their business.

他们需要在应用程序中内置银行的实用功能，以继续发展业务。

As JP mentioned, I founded Moven in 2011 in the US, and we’ve built essentially this app, this is the latest iteration of our app, as a smart bank account that will advise you on how to be financially healthy.

正如 JP 所提到的，我于 2011 年在美国创立了 Moven 公司，我们将这款应用（这是我们应用的最新迭代版本）打造成了一个智能银行账户，为你提供财务健康方面的建议。

found：v. 建立；创立

And that includes investment products, it includes savings behavior, and so forth.

这包括投资产品、储蓄行为等等。

When we introduced our first savings experience in Moven which was in Q4 last year, 40% of our customers immediately deposited funds into the Moven savings “stash” as we call it, our savings account or value store.

去年第四季度，当我们在 Moven 中首次推出储蓄体验时，40% 的客户立即将资金存入了 Moven 储蓄，我们称之为“stash”，它是储蓄账户或价值存储。

But we did zero marketing and we have 0% APR on that savings account.

但我们没有进行任何营销活动，而且该储蓄账户的年利率为 0%。

APR：年利率

40% of our customers immediately, overnight, responded to that.

一夜之间，40% 的客户立即对此做出了反应。

We can tell you the best day of the week to prompt people to save.

我们可以告诉你一周中哪天最适合提示人们存钱。

We can tell you the exact time of day that is the best time to message someone to save money.

我们可以告诉你一天中哪个时间段是给别人发送储蓄信息的最佳时间。

Behaviorally, we created a behavioral savings process, not a savings account.

在行为上，我们创建的是一个行为储蓄流程，而不是一个储蓄账户。

The customer doesn’t even need to sign up for a savings account with Moven, we just enabled their savings behavior.

客户甚至不需要在Moven公司注册储蓄账户，我们只需启用他们的储蓄行为。

When you start thinking about utility as it changes, banking becomes highly contextual.

当你开始考虑实用性的变化时，银行业务就会变得高度情景化。

contextual：/kənˈtekstʃuəl/ adj. 上下文的，与语境相关的

A great example of this might be credit access for day-to-day banking where I walk into a grocery store and I fill up my cart and I go to the checkout, and then they swipe my card and the cashier says, “I’m sorry Sir, it’s been declined.”

一个很好的例子可能是日常银行业务的信用访问，我走进一家杂货店，把购物车装满，然后去结账，他们刷我的卡，收银员说：“对不起，先生，它被拒绝了”。

grocery store：杂货店

Some of your customers may have had this problem.

您的一些客户可能遇到过这种问题。

Then you go fishing for another card.

然后你就去找另一张卡。

fish for：设法获取；探听；摸索寻找

“Let me give you this one, try this one.”

“我给你这张，试试这张"。

What about if we didn’t think about that as a product-based process?

如果我们不把这当成一个基于产品的过程呢？

What about if you think: when you walk in the grocery store, if I know you don’t have enough money to do your shopping, I present you with an offer for credit access there and then, to solve that problem?

如果你认为：当你走进杂货店时，如果我知道你没有足够的钱来购物，我会当场向你提供信贷服务，以解决这个问题，你会怎么想？

I don’t wait for you to get to the checkout.

我不会等你去结账。

This is experiential design of this.

这就是体验式设计。

Then we come back to the role of advisors, because when it comes to financial services, we’ve had this view predicated over the last 30 or 40 years that the best way to get the best bang for your buck, in investment terms, is you need to have a human involved, you need to get that advice.

然后我们再来谈谈顾问的作用，因为说到金融服务，在过去的三四十年里，我们一直有这样一种观点：在投资方面，要想获得最佳收益，最好的办法就是让人参与进来，你需要得到建议。

predicate：v. 使基于，使取决于；表明，断言

get the best bang for your buck：是一个常用的英语表达，意思是 “以最少的钱获得最大的价值；物有所值”。bang在这里不是 “砰” 的意思，而是指 “效果、影响力、价值” 等，是一种形象的说法。buck通常指 “美元” 或 “钱”，是一种通俗的货币表达。

in ... terms：在什么方面

But technology is also going to change the way we think of advice in financial services.

但技术也将改变我们对金融服务建议的看法。

In fact, probably the most common form of advice our customers will be faced with in the future from financial services is just something as simple as this: Hey Siri, can I afford to go out for dinner tonight?

事实上，未来我们的客户最常遇到的金融服务建议可能就是这样简单：嘿 Siri，我今晚有钱出去吃晚饭吗？

afford： v. 买得起；有（时间）做某事；承担得起

Your bank account should be smart enough to answer that question.

你的银行账户应该有足够的智能来回答这个问题。

And if you’re looking at retirement, how much do I need to save each week to put my son through college?

如果你考虑的是退休后的生活，我每周需要存多少钱才能供儿子上大学？

be looking at：考虑，思考

retirement：退休

put my son through college：供我儿子读大学

How much do I need to put away for retirement?

我需要为退休储蓄多少钱？

put away：存钱

There are questions a smart bank account should just be able to know, should just be able to answer for you, and artificial intelligence is going to give us that platform.

这些问题，智能银行账户应该能够知道，应该能够为你解答，而人工智能将为我们提供这样的平台。

Let me explain it in this sort of context: we talked about autonomous vehicles, smart-driving cars, so this is what we think of when we think of how an autonomous vehicle drives.

让我在这样的背景下解释一下：我们谈到了自动驾驶汽车、智能驾驶汽车，所以当我们想到自动驾驶汽车是如何驾驶的时候，我们想到的就是这个。

autonomous：/ɔːˈtɑːnəməs/ adj. 自治的，有自治权的；

It learns by capturing all of this information using LiDAR, radar detection, camera suites and so forth.

它通过使用激光雷达、雷达探测、相机套件等捕捉所有这些信息来学习。

All of these information captures about 1000 times the content that visually we can see through our eye.

所有这些信息所捕捉到的内容大约是我们通过眼睛所能看到的内容的 1000 倍。

These chips now are so good at processing this that it can process that amount of information in about half the time of our brain, our neocortex, or visual cortex.

现在这些芯片的处理能力非常强，它可以在我们大脑、新皮层或视觉皮层一半的时间内处理如此多的信息。

neocortex：n. 新（大脑）皮质，皮层

cortex：n. 皮层，（尤指）大脑皮层；

1000 times the information processed in half the time of a human brain.

用人类大脑一半的时间处理 1000 倍的信息。

Ultimately, when this technology is mature, that’s why no human will be able to keep up with an AI when it comes to driving.

最终，当这项技术成熟时，这就是为什么在驾驶方面，没有人类能赶上人工智能的原因。

mature：adj. 成熟的，理智的；

keep up with：跟上，与他人保持相同的进度或水平，在比赛、竞争等中与他人保持平衡。

Same analogy in financial services.

在金融服务领域也是如此。

The more data we have, the better advice we can give you, and no human will be able to process the same amount of data as an artificial intelligence.

我们掌握的数据越多，我们就能为你提供更好的建议，而没有一个人能够像人工智能一样处理同样多的数据。

When we look at this being applied in the robo-space for robo-advising, 2017 was a big year, it was the first year that robo-advisors met the performance of human advisors in terms of portfolio returns.

当我们看到这一点被应用于机器人领域的机器人咨询时，2017 年是一个重要的年份，这是机器人顾问在投资组合回报方面达到人类顾问表现的第一年。

portfolio returns：投资组合回报率

The best robo-advisors getting about 11-12% return on the portfolios.

最好的机器人顾问的投资组合回报率约为 11-12%。

So, we’re now starting to see the fact that, in terms of the black box portion of this, that machines are catching up with humans.

因此，我们现在开始发现，在黑箱操作方面，机器正在赶超人类。

portion：n.（某物的）一部分；

On the trading side, it’s even worse.

在交易方面，情况更糟。

Goldman Sachs has said one programmer can replace five traders today, by application of technology.

高盛说，通过技术应用，今天一个程序员可以取代五个交易员。

Goldman Sachs：高盛集团

This was UBS’ trading floor in Stanford back in the early 2000s in Stanford, Connecticut.

这是 2000 年代初瑞银在康涅狄格州斯坦福的交易大厅。

UBS：瑞士联合银行（United Bank of Switzerland）

trading floor：交易大厅

Connecticut：美国康涅狄格州

Today, this is empty, this trading floor, because of the application of artificial intelligence.

如今，由于人工智能的应用，这个交易大厅已经人去楼空。

AI is being introduced into the asset management side, portfolio management, advice and return generation for assets under management.

人工智能正在被引入到资产管理方面，包括投资组合管理、咨询以及为所管理的资产创造回报。

portfolio management：投资组合管理

But aren’t AIs all going to be the same?

但人工智能不都是一样的吗？

Let me use these two race cars as an illustration of this.

让我用这两辆赛车来说明这一点。

These are Audi’s test vehicles for their self-driving rig.

这是奥迪为其自动驾驶设备准备的测试车。

rig： 装备、器械；在这里可以理解为 “车辆装置”“设备组合”

They’re not self-driving cars you drive on the roads, they’re actually racing cars.

它们不是你在路上开的自动驾驶汽车，实际上是赛车。

Now, there’s two of these vehicles, test vehicles A and test vehicles B.

现在，有两辆这样的车，测试车 A 和测试车 B。

The engineering team nicknamed them AJ and Bobby, A and B, right?

工程团队昵称它们为AJ和Bobby，A和B，对吧？

But Bobby drives faster than AJ.

但Bobby比AJ开得更快

Same car, same platform, same hardware, same firmware, same software, same engineers that drive this, and yet one of these cars drives faster than the other consistently.

同样的车、同样的平台、同样的硬件、同样的固件、同样的软件、同样的工程师驾驶，但其中一辆车始终比另一辆车开得快。

firmware：固件。和hardware、software经常一起出现在IT领域。

consistently：adv. 一贯地，始终；一致地

I asked the engineers at Audi when I was doing augmented, “Why is that? Why does one AI drive faster than the other?”

我在做增强现实的时候问过奥迪的工程师："这是为什么？为什么一辆人工智能车开得比另一辆快？”

augmented：增强的，这里指的是增强现实。

And the engineer from Audi said, “Hmm, Yeah, we really don’t know.”

奥迪的工程师说："嗯，是的，我们真的不知道。”

I said, “Can you give a guess?”

我说："你能猜猜吗？”

And he said, “Actually, we think we know. One of the engineers, early in the process, because this is a machine-learning platform, maybe he drove more aggressively that day. Maybe he had an argument with his wife or got caught in traffic.”

他说："实际上，我们认为我们知道，其中一位工程师在早期，因为这是一个机器学习平台，也许他那天开车更猛烈了。也许他和妻子吵架了，或者堵车了。”

aggressively：/əˈɡresɪvli/ adv. 好斗地；侵略地；

got caught in traffic：堵车（以后就知道堵车如何表达了）

But that set a new baseline for one of the artificial intelligences to learn differently from its compatriot.

但这为其中一个人工智能设定了一个新的基线，让它以不同于同类的方式学习。

compatriot：/kəmˈpeɪtriət/ n. 同胞，同国人；同事，伙伴

And this shows us that even in investment, artificial intelligence, one AI will differentiate from another AI in terms of some types of investment platform.

这告诉我们，即使在投资、人工智能领域，在某些类型的投资平台上，一种人工智能也会区别于另一种人工智能。

differentiate：v. 使不同；求……的微分；

For now, the advantage lies with advisory firms incorporating artificial intelligence.

就目前而言，拥有人工智能的咨询公司更具优势。

lie with：取决于，在于

It’s man with machine versus man without machine.

这是有机器的人与没有机器的人的较量。

But it won’t be long before it’ll be machine versus man.

但用不了多久，就会变成机器与人的较量。

We’ve got probably a three-to-five-year window where we can supplement or augment human advisors with AI.

我们大概有三到五年的时间可以用人工智能来补充或增强人类顾问。

supplement：v. 增加，增补

augment：v. 增加，增大；加强，补充

After that, AIs are going to start to separate themselves in terms of capability.

在此之后，人工智能将开始在能力上脱颖而出。

separate themselves：指个体或群体在某种情况下自愿或被迫与其他人或事物分开。

When you look at the problem of customer acquisition and relationship and engagement of customers, what becomes clear is one of our biggest problems in financial services is the way we identify customers.

当你审视客户获取、客户关系和客户参与的问题时，就会发现我们在金融服务领域最大的问题之一就是我们识别客户的方式。

KYC: Kill your customers with paperwork.

KYC：用文书工作扼杀客户。

后续内容请看下篇。

苹果2024 WWDC大会英文全文

2024-09-20T16:59:15+08:00

说明

背景：

本文是苹果WWDC 2024视频的英文全文，本人一个字一个字的抄下来的。WWDC的全称是The Worldwide Developers Conference，视频全长1小时43分钟，全部摘抄下来有15198个单词。整个发布会主要介绍了Apple TV+, VisionOS, iOS, Audio&Home, watchOS, iPadOS, macOS, Apple Intelligence。其中Apple Intelligence是篇幅最长的。

用处：

可以用于学习英文，特别是IT相关的英文。
可以练习英文演讲，文中有很多介绍产品的地道的表达。
可以用于学习苹果的WWDC发布的内容，每一个字都不会错过。

以下则是WWDC 2024英文全文内容：

Introduction

Good morning. Welcome to Apple Park.

We’re glad you could join us for what promises to be an action-packed and memorable WWDC.

WWDC marks a moment in the year when we’re able to celebrate our global developer community.

Developers continue to amaze us with the apps they create for our products, apps that are used by over a billion people around the world.

It’s important for us to provide this community with the newest tools and technologies to do their very best work.

Today, we’re going to have some incredible updates to our platforms.

And I’m excited that we’ll introduce profound new intelligence capabilities that we hope will inspire developers, delight users, and make our platforms even smarter and more useful than ever.

Apple TV+

Before we get into our platforms, let’s talk about Apple TV+, which is celebrating its fifth anniversary this year.

Apple TV+ is the best in entertainment, filled with shows and movies made by the world’s most creative storytellers.

And I’m proud to say that Apple TV+ has been recognized for delivering the highest-rated originals in the industry for three years running.

Apple TV+ features great originals that have received industry-wide recognition such as Oscars, Emmys, and BAFTAs.

This past year alone, Apple TV+ has debuted critically acclaimed movies like “Killers of the Flower Moon” and “Napoleon” and hit shows like “Masters of the Air”, “Palm Royale”, “Hijack”, “Dark Matter”, and “Monarch: Legacy of Monstters”.

And we’re about to launch our most exciting lineup yet, with amazing new originals arriving on Apple TV+ each every week.

Let’s take a look.

This lineup looks incredible.

I hope you’re as excited about these Apple Originals as I am.

And now, let’s turn to our platforms.

We have so much to talk about today.

We’ll start with our OS announcements, and then we’ll dive deeper into intelligence.

VisionOS

Let’s start with our newest operating system, visionOS.

We released Apple Vision Pro in February, and we already have some great updates to share with you today.

Here’s Mike to tell you more.

Apple Vision Pro and visionOS unlock completely new possibilities for entertainment, productivity, collaboration, and so much more.

Vision Pro has inspired developers to create amazing and unique spatial apps that aren’t possible on any other platform.

Apps like NBA, where you can watch multiple live games with stats, “what if”, where you become a superhero in the Marvel universe, and “Unextinct”, where you can explore endangered species.

Games that take advantage of your space, immerse you completely, challenge you in new ways, or let you gather around a table to play with friends, even when you’re not together.

You can master meditation with Po from “Kung Fu Panda”, bring your data to life with SAP, and doctors can even reimagine surgical simulation and planning.

New apps, including some from the world’s biggest names in entertainment, productivity, and gaming are arriving on the App Store every day.

There’s already over 2000 apps created specifically for Apple Vision Pro.

And with over 1.5 million compatible iPhone and iPad apps, there’s always something new to do.

All of these amazing apps and experiences are made possible by visionOS.

It’s been just four months since we launched Vision Pro and visionOS, and today we’re already announcing our first major update.

Introducing visionOS 2.

visionOS 2 propels spatial computing forward with new ways to connect with your most important memories, great enhancements to productivity, and powerful new developer APIs for immersive shared experiences.

To tell you more, here’s Haley.

VisionOS 2 is a great release with some big updates.

Let’s start with Photos.

Spatial computing has reinvented how you view your photos.

There’s nothing like seeing them life-sized with incredible fidelity /fɪˈdeləti/.

Spatial photos are even more powerful, bringing life and realism to your favorite moments with family and friends.

It’s incredibly moving to step back into a treasured memory, and the rich visual depth of spatial photos makes this possible.

Now, visionOS 2 lets you do something truly amazing with the photos already in your library.

With just the tap of a button, advanced machine learning derives both a left and right eye view from your beautiful 2D image, creating a spatial photo with natural depth that looks stunning /ˈstʌnɪŋ/ on Vision Pro.

It’s so magical to reach into the past and bring your most cherished photos into the future.

And now, you can experience all your panoramas and spatial photos and videos together with the people you love using SharePlay in the photos app.

With our new spatial personas, it feels like they are sitting right next to you, even if they’re thousands of miles away.

People are amazed at how easy it is to navigate Vision Pro with just their eyes, hands, and voice.

And with visionOS 2, we’ve made it even easier.

Now you can just hold your hand up and tap to open Home View.

Or flip your hand over to bring up time and battery level.

And tap again to open Control Center, giving you quick access to frequently used features like Notifications and Mac virtual display.

People love Mac virtual display because it lets them bring their Mac wirelessly into Vision Pro just by looking at it, giving them a large, private, and portable 4K display.

Later this year, it gets even better, with higher display resolution and size.

And it can be expanded even further, into an ultra-wide display that wraps around you, equivalent /ɪˈkwɪvələnt/ to two 4K monitors side by side.

Your content stays sharp wherever you look thanks to dynamic foveation performed on the Mac.

Another great thing about Apple Vision Pro is how incredible it is to use on a plane, letting you take a private movie theater wherever you go.

With visionOS 2, we’re adding train support to Travel mode, so you can work privately on your long commute or catch up on your favorite shows on a massive screen.

These updates are going to make the Vision Pro experience even better.

And now, back to Mike.

In addition to these great features, visionOS 2 also makes it even easier to for developers to create sophisticated /səˈfɪstɪkeɪtɪd/ spatial apps.

There are many new frameworks and APIs for developers to explore, like advanced volumetric APIs that allow even the most complex 3D apps to run side by side for the ultimate multitasking experience.

TabletopKit makes it possible for developers to quickly create apps that anchor to flat surfaces, like manufacturing workstations or board and card games, and are great for use with spatial Personas on FaceTime.

And enterprise-specific APIs that will enable powerful use cases like surgical training in healthcare, equipment maintenance in manufacturing , and beyond.

These new APIs and frameworks will unlock exciting opportunities for developers to create truly unique experiences.

We’re also making it easier for people to create new spatial content for Apple Vision Pro, like spatial video.

We’ve made it so easy to capture spatial video anywhere with iPhone 15 pro and iPhone 15 pro max.

It’s one of the best ways to relive meaningful moments in your life.

Spatial video can also be used by pro videographers to tell powerful brand, product, and creative stories.

To make creating and sharing spatial videos with commercial audiences easier.

Canon will offer a brand-new spatial lens for their popular EOS R7 digital camera.

It can record gorgeous spatial video for Apple Vision Pro, even under the most challenging lighting conditions.

Spatial videos can then be edited in Final Cut Pro for Mac and shared and viewed in the new Vimeo app for visionOS.

This new professional workflow will be available this fall.

Last year, we also introduced Apple Immersive Video, a game-changing entertainment format created just for Vision Pro.

Apple Immersive Videos are 180-degree, 8K recordings with Spatial Audio that give you mind-blowing experiences with lifelike fidelity.

It truly feels like you are there.

To enable creators to bring their own stories to life with Apple Immersive Video, we’ve partnered first with Blackmagic Design, a leading innovator in creative video technology, to build a new production workflow consisting of Blackmagic cameras, DaVinci Resolve Studio, and Apple Compressor.

These will all be available to creators later this year.

And there’s new Apple Immersive Video content on the way, including a new extreme sports series with Red Bull, reimagined experiences from the world’s biggest artists like The Weeknd, and our first scripted Apple Immersive short film, “Submerged”, from Oscar-winning director Edward Berger.

These titles and more will be available on the TV app.

So that’s what’s coming to Apple Vision Pro and visionOS.

VisionOS 2 introduces a new way to turn your favorite photos into spatial photos, new intuitive gestures, a big boost to productivity with Mac Virtual Display, powerful new developer APIs, and so much more.

Now, back to Tim.

As you can see, we’re continuing to push visionOS forward as well as providing new content and capabilities for Apple Vision Pro.

I’ve been hearing from people all over the world about their interest in this incredible product.

So I’m happy to announce we’re bringing Apple Vision Pro to these eight countries next, starting with China, Japan, and Singapore on June 28.

And Australia, Canada, France, Germany, and the United Kingdom on July 12.

Now, here’s Craig to tell you all about what’s coming in iOS.

iOS

iOS 18 is a big release that delivers more ways to customize your iPhone, stay connected, and relive special moments.

Fist, let’s talk about a set of features that give you exciting new ways to personalize your iPhone further, starting with your Home Screen.

You can already customize your Home Screen with your favorite wallpaper, apps, and widgets, letting your personality shine through.

And now, your app icons and widgets can add even more.

Let me show you.

I have this photo I love as my wallpaper.

And now I can continue to enjoy it when I unlock my iPhone, because I can arrange my apps and widgets to frame it perfectly.

I can select them all and easily place them along the bottom, right above the Dock for easy access, or even off to the side.

And check this out.

We have an awesome new look for app icons when we go into Dark Mode.

Let’s turn it on.

Isn’t that cool?

Now, in addition to this new dark look, there are even more new ways to adjust how they look.

I can bring up a new customization sheet, and now I can tint them all with color.

iOS suggests a tint color that complements my wallpaper.

Or I can select any other color I want.

Now they really pop.

It’s so easy to create just the right look.

Whether you prefer the classic look, or want to go dark, or style with color, there are so many possibilities to make your home screen truly your own.

We’re also bringing new levels of customization and capability to control center, helping you access many of the things you do every day even faster.

Let’s take a look.

When I swipe from the top-right corner, I can see Control Center, with all my controls organized in one place.

And now control center isn’t limited to just the controls you see here.

I can swipe to multiple new groups of controls, like for Media Playback.

You can see how beautiful this looks.

And here are my Home Controls.

It’s so useful to have everything arranged for me like this.

Oh, that shouldn’t be open.

Let me close the garage.

And what’s really great is, I can get to any one of these groups with a single , continuous swipe.

I can get straight to my home controls, for instance, or right back up to the top.

To add more controls, I can open up the new Controls Gallery, where I have so many options to choose from.

We wanted to make control center more extensible than ever.

So now, developers can include controls from their apps as well.

Like this one from Ford.

Let’s add that in.

I can adjust how my controls are laid out and resize them too.

Now I can cool down the car just like that.

So that’s the new Control Center.

To enable new controls in Control Center, we have a new Controls API for developers.

And that’s not all.

These new controls are also available from the Lock Screen, so you can swap the camera and flashlight for different controls, like taking a not when an idea strikes, or quickly capturing the moment for your Snapchat.

And you can even use the Action button on iPhone 15 pro to invoke these new controls.

Another key part of personalizing iOS is about keeping you in control of your privacy.

And iOS 18 gives you even more ways to control who can see your apps, how you share your contacts, and how you connect to accessories.

Let’s start with apps.

Sometimes we hand our device to someone so they can look at a photo or play a game, but we want peace of mind that they can’t get into sensitive areas of our phone.

So this year, we’re giving you a new way to protect sensitive apps and the information inside them, by letting you lock an app.

When you choose to lock an app, if someone else tries to tap it, they will be required to authenticate using Face ID, Touch ID, or your passcode.

And information from inside the app won’t appear in other places across the system, like in search and notifications, so others won’t inadvertently see sensitive information.

There may also be occasions when you want to hide an app that you don’t want others to know is installed on your device.

For example, say you use a professional grade spatial capture app to track your different hairstyles.

I mean that’s just good science, right?

Well, anyway, say you use this app, but you don’t want anyone else to know.

Well, now you can hide it and put it in a new hidden apps folder that’s locked.

We’re also adding new ways to control how you share information with apps, starting with contacts.

Today, when you given an app access to your contacts, it can learn about all the people you’ve added over time.

In iOS 18, we’re putting you in control by letting you decide which contacts an app can see.

We’re also putting you in control when you pair accessories.

An app may ask for Bluetooth and local network access but also gain visibility to all the other devices on your network, from your computers and TVs to your door locks and blood pressure monitor.

Now, developers can offer you an intuitive new way to pair your accessories that keeps your devices private and makes pairing seamless.

Next up, we have big enhancements to the apps we use to stay connected, staring with Messages.

To tell you more, here’s Ronak.

Messages is central to how we communicate with the most important people in our lives, so in iOS 18, we’’re giving you all-new ways to express yourself and stay connected.

Let’s start with Tapbacks.

Tapbacks are one of the most popular ways to express yourself in Messages.

And people love them.

This is a huge year for Tapbacks.

We’ve not only redesigned your favorites.

We’re now giving you limitless ways to express yourself by letting you Tapback with any emoji or sticker.

Next, we’re bringing one of your most requested features to Messages.

When you don’t want to forget to send that friendly reminder or birthday text in the morning, you can schedule your message to Send Later.

We’re also giving you more ways to express your tone with text formatting.

Bold, italicize, underline, or strike through any text.

And when formatting is not enough, we’re introducing a new way to visually amplify your messages with text effects.

Whether you want to emphasize some major news, bring your emoji to life, or you’re just blown away by a stunning photo, you can express yourself in all-new ways with text effects.

Some words and phrases automatically surface a suggestion, so you can quickly select one and send it.

And you can also add one of the many new effects to any text.

Last, there’s a new way to stay connected whenever you don’t have Wi-Fi or cellular service.

We’re using the same groundbreaking technology that gave us Emergency SOS via satellite to bring you Messages via satellite.

Now you can use the satellite capabilities on iPhone 14 and later to connect to satellites hundreds of miles above the Earth to text your friends and family when you’re off the grid all right from the Messages app.

Once you’ve connected, you’ll be able to use key iMessage features like sending and receiving messages, emoji, and Tapbacks.

Because iMessage was built to protect your privacy, iMessages sent over satellite are end-to-end encrypted.

And if you need to text people not on iMessage, we’re supporting SMS messaging via satellite too.

Now, let’s talk about another app we use to communicate, Mail.

This year, we’re giving you a new way to stay in control and manage incoming email with on-device categorization that organizes your messages and helps you stay up to date across all of your accounts.

The Primary category enables you to focus on what matters most - emails from people you know and time-sensitive messages.

The rest of your email will be organized into new categories like Transactions, for receipts and order confirmations, Updates, for newsletters and social media notices, and Promotions, for marketing and sales messages.

And these categories do more than just sort your email.

We’ve also created an elegant new digest view that pulls together all the relevant emails you’ve received from a business to make interacting with these messages even easier.

For instance, it can bring together all of your flight information from United, so you can get to it in one place.

You can quickly scan snippets of each message to see what’s new and explore what you’re interested in.

If you want a sender to appear in another category, you can recategorize them with just a few taps.

Archiving or deleting all of the messages from a business is just as easy.

And of course, you can always see all of your emails in one place.

Categorization will be available later this year.

And now, back to Craig.

iOS 18 also includes some great updates to apps and features you use every day.

Let’s walk through a few of them, starting with Maps.

Maps delivers new topographic maps with detailed trail networks and hiking routes, including all 63 U.S national parks, that can be saved to your phone and accessed offline with turn-by-turn voice guidance and the ability to create your own hikes.

Next, Wallet.

Continuing on our journey to replace your physical wallet, we’re introducing Tap to Cash, a quick and private way to exchange Apple Cash without sharing phone numbers or email addresses.

With Tap to Cash, you can pay someone back for dinner just by holding your phones together.

We’re adding two new ways to pay with Apple Pay Online, giving customers around the world the ability to redeem rewards and access installments from their banks and card providers.

And event tickets are getting a beautiful new design and new features, including an all-new event guide combining helpful information about the venue, with smart recommendations from your favorite Apple apps.

We also have updates to Journal that let you log your state of mind and help you keep track of your goals with an insights view that shows your writing streaks, a calendar, and other fun stats.

And you can now use Search to quickly find the past entries you’re looking for.

We’re also excited to announce an update with great improvements for gamers.

Game mode is coming to iPhone, enabling a more immersive experience with game like “Zenless Zone Zero”.

Just like on Mac, Game mode minimizes background activity to sustain the highest frame rates, especially during long play sessions, and it dramatically improve responsiveness with AirPods and wireless game controllers.

Finally, we have some big news for an app where we relive our most precious memories and adventures, Photos.

Our photo libraries contain all of the big and small moments in our lives.

But as we capture so much, and our libraries grow bigger by the day, how can we keep it all organized so we can appreciate all of those moments and easily get to the good stuff?

iOS 18 brings the biggest redesign ever to the Photos app.

To tell us more about the all-new design, here’s Chelsea.

The new Photos app keeps your library organized and makes it super easy to find photos fast, so you can speed less time searching and more time enjoying your memories.

Let me show you.

This new design is gorgeous, feels familiar, and it puts everything you want right at your fingertips.

The app has been unified into a single view, with the photo grid at the top, and your library organized by theme below.

The photo grid is a great place to view your entire library.

When you want to quickly jump back to specific dates, you can use Months and Years views at the bottom.

I have a lot in my library, so it’s great that this filter button lets me quickly narrow it down to specific types of content.

And now I can even filter out screenshots, to enjoy my photos clutter-free.

We know that it can be tough to keep our ever-growing libraries organized, so we’ve built on the amazing intelligence in the Photos app and created a space below the grid that makes it easy to access the photos you care about most.

We call these Collections.

With Collections, you can browse by topics like time, people, my favorite memories, new one like Trips, and more.

Let’s go back and check out Recent Days.

Recent Days organizes photos by each day with clutter, like receipts, filtered out.

Here are my photos from earlier today.

You’ll see an autoplaying view of all the photos at the top.

I can swipe between days like this to see my hike yesterday.

I can view the photos as a beautiful collage, and I can share the whole Collection with just a tap right here.

When I want to find a specific person in my library, I head to People & Pets.

And it now includes my favorite groups of people for the first time.

Here’s me with my husband Don and with my best friends.

The new Trips section gathers all your memorable adventures in one place.

I love that they autoplay so I can remember my trips while I browse.

I can quickly jump back in time and revisit a trip.

Like this one to Patagonia in 2021.

Since everyone’s photo library is unique, Photos is now customizable, so you can elevate the topics that are most important to you.

You can reorder Collections to put them in the order you like.

I’ve put Pined Collections right here.

It’s where I can keep things I access frequently like Favorites, photos I’ve recently saved, the places I’ve been, and even an album of my favorite climbs.

And we have one more new space to make the Photos app your very own and enjoy your best moments.

If you swipe right from the grid, you’ll find the new Carousel, which highlights your best content in a beautiful, poster-like view.

Photos you’ve marked as Favorites are here, and so are featured photos surfaced by the app.

And you can customize this too.

Here, I’ve added a favorite trip to Crater Lake.

Each day, the Carousel surprises you with a new set of photos to enjoy for each of these.

And that’s a quick peek at the new Photos app.

Now, back to you, Craig.

So that’s iOS 18, a big release that brings deeper customization to iPhone, new ways to stay connected in messages and mail, enhancements to privacy, and the biggest photos redesign ever, marking it even easier to relive those special moments.

And so much more, including an option for larger icons on the Home Screen, RCS messaging support, and reminders integration in calendar.

Next, I’ll hand it over to Ron to tell us the latest in Audio and Home.

Audio&Home

Whether you’re on the go, or at home, we have some great new features that bring more convenience to the things you do every day and elevate the entertainment experience for everything you watch.

So let’s start off with AirPods, which are the most loved headphones in the world with an incredible audio experience.

This year, we’re making it even easier to interact with Siri for a seamless hands-free experience.

For those instances when you may not want to speak out loud in response to Siri, like on the bus to work or in those places that are a little too crowded, we’ve created the ability to simply nod your head “yes” or gently shake your head “no” to interact.

AirPods are also perfect for staying in touch with friends and colleagues, by taking calls anywhere, even in windy conditions or places with loud background noise.

So to ensure your voice will sound crystal clear, no matter your environment, we’re bringing Voice Isolation to AirPods Pro.

Powered by advanced computational audio, Voice Isolation removes the background noise around you, to deliver the best call quality.

Call from David. Answer it?
Oh, hey. Was just about to call you. The meeting went so well.
Also, sorry, it’s really noisy. Can you hear me okay?
That’s amazing news, and yeah. I can hear you totally fine.

AirPods are also great while playing games, thanks to their exceptional audio quality.

To level up this experience, we’re expanding Personalized Spatial Audio to include gaming, so that you’ll be in the middle of the action like never before.

We’ve built a new API so game developers can easily deliver the most immersive listening experience.

And we’re excited to announce that “Need for Speed Mobile” by Tencent Games and EA will be one of the first titles with Personalized Spatial Audio coming this fall.

Now let’s turn to Home and tvOS.

Home & tvOS

This year, we’re introducing some updates that make watching TV even more enjoyable.

First, let’s talk about those moments when we’ve all wondered, “Where have I seen this actor before?” or, “hey, what’s that sone?”

For these times, we have a new feature we’re bringing to Apple TV+.

It’s called InSight.

When you’re watching an Apple Original show or movie, just swipe down on the remote and InSight will show the actors and their character names in real time.

And if you’re curious about the song playing, you can quickly see the track and add it to an Apple Music playlist to enjoy later.

InSight will also be available when using iPhone as your remote, perfect for when you’re watching with others.

Next, let’s turn to the audio experience on tvOS.

We’re bringing enhance Dialogue to more living rooms, with support for TV speakers and receivers, along with AirPods and other Bluetooth devices.

And Enhance Dialogue now uses machine learning for even greater vocal clarity, ensuring that the actors’ dialogue will always cut through.

We’re also making subtitles more convenient.

With many of us turning to subtitles more often, they’ll now appear at just the right times, like when you mute the volume or when you skip back.

Now let’s talk visuals.

Apple TV has always delivered a theater-like experience to the home.

And this year, we’re adding to the experience with support for 21 by 9 projectors.

With 21 by 9, you’ll be able to view widescreen movies exactly as the directors intended.

And in between movies, you can enjoy amazing and visually interesting screen savers on Apple TV.

We’re making it even easier to choose what plays, including a bran-new Portraits category with stunning color effects and image segmentation, framing your photos like art in a gallery.

Or switch to TV and Movies and enjoy moments from Apple TV+ shows you love like this one from “Foundation”.

We’re also adding one more really cool screen saver as Snoopy and Woodstock take over the screen.

Whenever your Apple TV becomes idle, Snoopy springs to life with delightful animations.

We’re thrilled to bring everyone’s favorite beagle to your living room.

So that’s Audio and Home, bringing you more convenient ways to interact with AirPods, new entertainment experiences with AppleTV, and there’s more.

Like a redesigned Apple Fitness+ experience that’s perfect for the big screen.

Next, here’s David to tell you about watchOS.

watchOS

My Apple Watch always motivates me to stay active.

And this year will be no different.

watchOS 11 introduces more great features to not only keep you active but also healthy and connected.

To help you stay active, let’s first take a look at an exciting new feature that can transform the way you work out, whether you are training for something like your first 5K or your fastest marathon.

In watchOS 11, we’re introducing Training Load, an insightful way to measure how the intensity and duration of your workouts and impacting your body over time.

To track intensity, we designed a new way to rate your workouts.

Using calorimetry data, like heart rate, pace, and elevation, plus your personal data, like age and weight, a powerful new algorithm automatically translates our sensor data into an estimate of your Effort rating.

After your workout, you can review the rating on the Summary page, ranging from 1, easy, to 10, all out.

And you can even adjust your Effort rating up or down to get it just right.

Your Effort rating and workout duration are then used to calculate your Training Load.

You’ll be able to see if you’re holding steady, above your average and can safely progress and improve, or when you’re well above your average and should pay close attention to better avoid exhaustion or injury.

We think Training Load will help enthusiasts and elite athletes get to the next level with data, insights, and motivation they need to make the best decisions about their training.

And we’re now made it even easier for everyone to gain more insights from the Fitness app on iPhone by giving you the ability to customize the Summary Tab to show the information you want to see, including new metrics like weekly running distance.

The personalization even extends to your Activity rings where you can now adjust your goals by the day of the week.

Or if you have an injury that’s making it harder to close your rings, or maybe you just need a day off, you can pause them for a rest day, week, or more and keep your award streak going.

Those are some of the new ways watchOS 11 will help keep you active.

And now here’s Sumbul to tell you about a new app that will give you a better picture of your health.

Understanding how your body responds and recovers from exercise and other aspects of your life is an important part of your overall health.

Because Apple Watch can track key vitals while you sleep, like heart rate, respiratory rate, and wrist temperature, it can give you a deeper understanding of your body and help you identify when something might be off.

So with watchOS 11, these metrics are the foundation of the insightful new Vitals app where you can check in on your daily health status, and explore your most important health metrics with just a glance.

You can also see how your metrics relate to your typical range, which is determined from your own historical information and an algorithm developed using real-world data from the Apple Heart and Movement Study.

For additional insights, your metrics will be highlighted when they are outside of your typical range with details on what’s changed over the last week.

And when multiple metrics are out of range, you will be notified with a tailored message to help you understand how these changes may be linked to other aspects of your life, such as alcohol, elevation changes, or even illness.

And that’s the Vitals app, a new way to quickly view your most important health metrics, receive alerts when it’s time to pay more attention to your body, and gain better context when it comes to your health.

Now let’s talk about another time when context about your health matters, which is during pregnancy.

Cycle Tracking can now show you gestational age to support you during this important time.

The Health app will display your pregnancy across all charts and prompt you to review things like your high heart rate notification threshold, since heart rate often increases during pregnancy.

Those are some of the advances in Health.

And now back to David to tell you what’s coming to keep you connected.

With Apple Watch, you can have quick and meaningful interactions right on your wrist, making it so easy to stay connected to the world around you and the people you care about without always needing to take your iPhone out of your pocket.

Whether it’s using Apple Pay to buy your morning coffee or hop on the subway, telling Siri to add an item to your grocery list, or replying to a message from a friend, you can do it all with just the raise of a wrist.

And last year, we introduced the Smart Stack.

It’s another way to keep you connected to important information with just a scroll of the digital crown.

This year, it becomes even more intelligent by automatically adding new widgets right when you need them, like the precipitation widget to alert you before it rains, or the Translate widget for when you’re traveling somewhere new.

Just tap to open the new Translate app on Apple Watch, which uses machine learning models for speech recognition and translation.

You can now simply dictate to see and hear it right on your wrist.

The Smart Stack also becomes more capable with Live Activities coming to Apple Watch, so you’ll have all the details for your favorite events.

And you can use features like Check In, which lets a friend know you made it back home safely and is now on Apple Watch with additional support for workouts.

During a late-night run, your friend will know to keep an eye out, and will be updated when you end your workout, so you both have peace of mind.

Developers can also show Live Activities in the Smart Stack, so you can see updates in the moment like when your ride is coming for apps like Uber.

And with the new Double Tap API, they can also define actions within apps, like Sprout Baby Tracker, to log your baby’s time asleep without waking them.

Having access to all of these powerful capabilities right on your wrist makes Apple Watch so indispensable, and being able to customize your watch face is one of the ways that makes it incredibly personal to you.

With the popular Photos face, there is something special about seeing an important person or moment every time you raise your wrist.

Now, watchOS 11 will help you find the perfect photos for your watch face.

Machine learning intelligently identifies, scores, and curates the best photos based on facial expressions, aesthetics, and composition.

Then, a custom algorithm elegantly frames the image with the time.

You can select a bold color, choose monotone for a sleek look, or create something that is unique and personal to you.

That’s what’s coming in watchOS 11: a redesigned Photos face, a more intelligent Smart Stack, Training Load, the Vitals app, new APIs for developers, and so much more, like turn-by-turn directions for walking and hiking routes you’ve created.

We’re so excited about all the new ways to help you stay connected, active, and healthy.

Back to you, Craig.

iPad OS

Next, let’s talk about iPadOS, which powers our strongest lineup ever, including the incredibly thin and powerful iPad Pro and the redesigned iPad Air, now available in two sizes.

Together with the latest versions of Final Cut Pro and Logic Pro and game-changing accessories like Apple Pencil Pro and Magic Keyboard, it creates an experience that’s in a category of its own.

Our next release, iPadOS 18, starts with features you saw in iOS, like new ways to personalize your Home Screen, customize Control Center, and relive special moments in the Photos app.

iPadOS 18 also brings exciting new ways to get things done, reimagined with Apple Pencil, and a big update to apps designed for the distinct capabilities of iPad.

Apps are fundamental to the iPad experience.

In iPadOS 18, we’re making them even better, starting with a new floating tab bar, which makes it easier to navigate to different parts of an app and keeps your content edge to edge in apps like Apple TV.

When you want to explore more, the tab bar morphs into the sidebar.

If you use a specific tab often, you can customize the tab bar to keep your favorites within easy reach.

This redesigned experience works in apps across the system.

We’ve also made it easier to browse your documents in apps like Pages, keynote, Numbers, and Swift playgrounds, giving each app a distinct new look.

And throughout your experience, you’ll discover refined animations.

You’ll notice them as you open files or preview them with Quick Look, and they smoothly zoom into view, or when the tab bar elegantly morphs into the sidebar and back.

Across apps, animations will feel even more responsive.

And for developers, all these new elements are available as APIs to adopt in your apps too.

Now let’s take a look at updates to SharePlay and Freeform.

One of SharePlay’s best features is screen sharing.

It’s a great way to help friends and family from afar, and we’re making it better in two ways.

Now you can tap and draw on your screen to point out what they should do on theirs.

And if you need to assist more directly, you can ask for permission to remotely control their iPad or iPhone.

Hope that helps.

And Freeform adds Scenes, an all-new way to select sections of a board to present them one by one.

Next, I want to talk about a feat that some may have concluded must be a mathematical impossibility.

That’s right, we’re bringing Calculator to iPad.

By leveraging what makes iPad so unique, it makes solving math easier than ever.

It starts with the Calculator that you know from iPhone, updated to take advantage of the larger iPad display, along with some new tricks like history and unit conversions.

But the real magic of Calculator on iPad is unlocked when you use it with your Apple Pencil, an iPad superpower.

Apple pencil has changed the way you can take notes, draw, and even design with iPad.

And now, it’s changing the way you do math with a feature we call Math Notes.

Let’s see it in action with Jenny.

I’m so excited to show you the new Math Notes experience.

I get to it by just tapping the new calculator button right here.

And with my Apple Pencil, I’ll just start writing out expressions like I would on a piece of paper.

As soon as I write an equals sign, Calculator immediately solves it for me.

And even shows me the result in handwriting like my own.

When I make a change, the results update live.

And I can go beyond basic math with all of the same functions from the scientific calculator.

I can save my Math Notes and come back to them later if I’m working on different things.

Like here, where I’m working on a budget for my team’s upcoming table tennis tournament.

Since I’m in Math Notes, I can sum these costs quickly by just drawing a line underneath them.

It’s so natural.

Math Notes are also really powerful when it comes to more complex math.

Here, I have a physics problem my teammate and I are working on.

We’re calculating the maximum height of a table tennis ball when I hit it with different speeds and angles.

Math Notes supports variables, so I’ve declared a few here, and there’s an expression below, which uses these variables to help me calculate the height.

What’s powerful about v variables is that if I change one, like the velocity of my shot, it will change the related results too.

And if I want to see how this speed impacts the height visually, I can.

I’ll just put “y equals” in front of this equation.

And now when I tap the equals sign, I have an option to create a graph.

And if I’m curious how the height will be impacted by the angle of my shot, I can hover my Pencil over the angle and adjust it to see how it affects my graph in real time.

It’s an easy way to explore equations in math.

And that’s just a quick look at Math Notes in Calculator.

Back to you, Craig.

Math Notes are perfect for working through a problem set, or just tackling the math we run into day to day.

And this all works in Notes too.

When you need to crunch numbers, Notes has all of the new math capabilities from Calculator.

Just as we’ve reimagined math on iPad, we’ve also reimagined handwriting in Notes with a new feature called Smart Script.

Notes already has great handwriting features, like the ability to select and copy your writing, or even make it straighter.

With Smart Script, we’re making handwriting your notes smoother than ever.

It starts with improving the appearance of your writing, as you write.

We use a powerful on-device machine learning model to re-create your handwriting style from your notes, which unlocks new capabilities.

Just scribble your thoughts as fast as you have them and Smart Script refines your handwriting as you go.

It’s still your own writing, but it looks smoother, straighter, and more legible.

Smart Script further accelerates your writing flow by making handwriting just as flexible as typed text.

Now you can just paste typed text into a handwritten note, and it will appear in your own style.

Spell check works just as you would expect and fixes mistakes inline.

When you decide you need to add to something you’ve already written, just tap and hold with your Apple Pencil and your text will flow out of the way to create more space.

If you want to erase something you can just scratch it out.

Smart Script makes your handwritten notes more effective, fluid, and easier to read.

And with other enhancements to typed notes, including collapsible sections, it’s never been a better time to be a notetaker.

And that’s iPadOS 18, taking the distinct experience of iPad further with a big update to apps that makes navigating easier and more responsive, and new ways to work that have been reimagined with Apple Pencil.

Next, let’s talk about macOS.

macOS

The all-star combination of the power of Apple silicon and the legendary ease of use of macOS have made the Mac more capable than ever.

And we’re so excited to take macOS to new heights and embark on the next chapter of our journey of productivity and creativity.

But what should we call it?

Well, that brings us once again to the annual escapades of our legendary crack marketing team.

Distracted briefly from their marathon hacky sack session, they stumbled into their minibus and wove a trail toward the Sierras, eventually rolling to a stop in a beautiful national park.

Staring skyward up the towering trunks surrounding them, they felt a deep kinship with anything that could get that high.

They knew they’d found their spot.

Welcome to macOS Sequoia.

The incredible features we talked about in iOS18 and iPadOS 18 are going to be amazing for the ways you use Mac.

You can be even more expressive in Messages, Math Notes provide a helpful typed experience, and you can easily plan a hike in Maps.

These new features are terrific on the Mac, and macOS Sequoia introduces even more features to help you effortlessly get things done.

Let’s start with Continuity.

Continuity helps you do so much more when you use Apple products together.

It powers some of your favorite features, like Universal Clipboard, Universal Control, and Mac Virtual Display on Apple Vision Pro.

And macOS Sequoia makes Continuity even more magical.

For all those times when we want to use our iPhone, only to realize it’s tucked away in a bag over in another room, there’s a brand-new Continuity feature called iPhone Mirroring.

With iPhone Mirroring on Mac, I can see what’s on my iPhone, and can control it too, all while barely lifting a finger.

Let me show you how ti works.

To access my phone, I just click here in my Dock.

Boom! And there’s my iPhone, mirrored in a window right on my Mac.

I can fully interact with it, all wirelessly.

I can see my custom wallpaper.

My icons are right where they belong.

And I can use my phone normally, like swiping through pages of my Home Screen.

And I can open any of my iPhone apps, like the Philz Coffee app, for a bit of extra energy from my favorite local coffee shop.

I can use my Mac trackpad to interact with the app.

And I can use my Mac keyboard too, like to add special instructions.

Let’s make this ice-cold.

To make this even more magical, we’re bringing iPhone notifications to Mac.

They appear alongside my Mac notifications and I can even interact with them when I don’t have my iPhone handy.

Here’s one from Duolingo.

What’s neat is, when I click on it, bam!

I’m taken right into the Duolingo app on my iPhone, so I can practice my Spanish and extend my streak.

As you can hear, my iPhone’s audio even comes through my Mac.

So you might be wondering what’s on my iPhone screen while I’m suing iPhone Mirroring.

It stays locked, so nobody else can access it.

And it works seamlessly with StandBy.

StandBy stays visible, so I can get information at a glance as I use my phone with iPhone Mirroring.

And iPhone Mirroring makes it effortless to combine the power of my Mac and the convenience of its big screen, with the things I get done on my iPhone.

I’m using a template in the Unfold app to make a post, and I’ve got one last video to add.

On Mac, I’ve been using Final Cut Pro to stitch some clips together.

Watch how easy it is to use my devices together.

I can grab the exported video and just drop it right into the template.

Perfect!

So that’s iPhone Mirroring.

And macOS Sequoia has fantastic updates to how you arrange your windows, share while video conferencing, and organize your passwords.

Now, when you drag a window to the edge of the screen, macOS automatically suggests a tiled position on your desktop.

You can release your window right into place.

Quickly place tiles side by side, or place them into corners to keep even more apps in your view.

And new keyboard and menu shortcuts help you arrange your tiles even faster.

Now, let’s talk about video conferencing.

When you’re on a video call, say goodbye to oversharing with the new presenter preview.

It lets you see what you’re about to share before you share it, and works with apps like FaceTime and Zoom.

And when you want to express yourself or just hide the laundry behind you.

You can now replace your background with some beautiful built-in backgrounds, or your own photos.

Background replacements use Apple’s industry-leading segmentation, so you look your best while on a call.

Now let’s talk about how we’re building on the foundation of Keychain to help you manage your passwords.

For over 25 years, we’ve been adding features to make logging in to your accounts easier.

And now, we’re introducing the Passwords app.

Passwords makes it easy to access your credentials and have them securely stored, all in one place.

Everything is organized for you, from your passwords to verification codes to security alerts.

You can find the app on Mac, iPad, iPhone, Vision Pro, and on Windows, with the iCloud for windows app.

All the passwords securely sync across your devices, and if you use AutoFill, your passwords will automatically populate in the Passwords app.

Now, here’s Beth to tell you about Safari.

Safari offers an experience like no other browser on Mac.

In macOS Sequoia, Safari is the world’s fastest browser, enabling you to fly through the web with lightning speed.

And it offers up to four hours more battery life than Chrome when streaming video.

Safari is also a trailblazer in privacy, with industry-leading Intelligent Tracking Prevention and private browsing that’s actually private.

It not only protects your history, it prevents websites from seeing what you do while you browse.

And it’s built on WebKit, which supports the latest exciting web technologies and standards.

If you missed anything we’ve added to Safari in the last few years, it’s time to check it out.

Safari has everything you need to feel at home, like profiles, translation, and more.

And in this release, we’re making it even better, with easier ways to discover content and streamline your browsing.

When you’re on a site, Safari can now help you discover more about the page with Highlights.

Safari uses machine learning to automatically detect relevant information and highlight it for you as you browse.

Highlights share helpful information, like directions, summaries, and quick links to learn more about people, music, movies, and TV shows.

So if you’re planning a trip, you can effortlessly discover a hotel’s location and phone number right there.

You can listen to an artist’s music or check out a new show with just a click.

And even get a summary, so you can get the gist before reading on.

Summaries are also integrated into a redesigned Reader.

Reader instantly removes distractions from articles, and now it can provide a table of contents and includes a helpful summary, right next to the article.

We’re bringing a distraction-free experience to video on the web as well with Viewer.

When Safari detects a video on the page, Viewer helps you put it front and center, while still giving you full access to system playback controls, like AirPlay and Picture-in-Picture, and video automatically moves into Picture-in-Picture if you click away.

That’s a quick look at what’s new in Safari.

Back to you, Craig.

Let’s talk about gaming.

We’re so excited to see more and more game developers embracing the Mac with great games like these, including the most recent game of the year, “Baldur’s Gate 3”, all leveraging Metal 3 to deliver smooth frame rates, provide high-quality visuals, and take full advantage of Apple silicon.

Every Mac in the lineup can play today’s most cutting-edge games, like “Death stranding: Director’s cut.”

And so can iPhone15 pro and any iPad with an M-series chip.

And for developers, this creates a unified gaming platform across iPhone, iPad, and Mac, spanning well over a hundred million devices and growing rapidly.

These devices are capable of playing an entirely new class of games.

And with iOS 18, iPadOS 18, and macOS Sequoia, we continue to deliver features for an even more immersive gaming experience.

And since the introduction of Game Porting Toolkit, developers have been able to bring their games to Apple devices faster than ever, and gaming enthusiasts can experience more games on the Mac.

And this year, Game Porting Toolkit 2 takes this to the next level, enabling developers to bring even more advanced games to Mac, with improved Windows compatibility and shader debugging tools.

And it’s much easier to bring Mac games to iPad and iPhone with Xcode support that lets developers unify their game code and shaders across devices.

And for players, there’s a lot to look forward to.

And that’s more games.

Like “Frostpunk 2,” coming to Mac next month.

“Control,” providing a mind-bending story that just looks incredible with ray tracing.

And there’s some exciting news from Ubisoft, the developers that released “Assassin’s Creed: Mirage” on iPhone and iPad just a few days ago.

To tell you more about what’s to come from Ubisoft, here’s Marc-Alexis.

At Ubisoft, our mission is to enrich players’ lives by creating original and memorable gaming experiences.

We see a huge opportunity to share our passion for games to more players in the Apple ecosystem, thanks to the unified gaming platform with tight integration of Metal and Apple silicon.

Just last month, we announced that “Prince of Persia: The lost crown” is coming to Mac, and we unveiled that the next big chapter of “Assassin’s Creed” is also coming to Mac on November 15 alongside PCs and consoles.

We’re so excited about this game and can’t wait for you to experience it on Mac.

This is “Assassin’s Creed: Shadows.”

We’re venturing into feudal Japan, which you can experience from the perspectives of Naoe, a Shinobi assassin, and Yasuke, a legendary samurai of African origin.

Intricately detailed scenes like this are possible thanks to our next-generation Anvil engine supporting the latest advancements in Metal, enabling us to leverage the full power of Apple silicon with a gaming experience that delivers blistering frame rates and high resolutions.

Our next-generation Anvil engine scales performance and quality across the Mac lineup and delivers stunning vistas embellished with ray tracing.

And speaking of Apple Silicon, we’re thrilled to announce that in addition to Mac, “Assassin’s Creed: Shadows” will also be coming to iPad.

With Ubisoft’s Anvil engine now supporting the Apple ecosystem, we couldn’t be more excited about bringing our biggest titles to Apple devices.

Download and play “Assassin’s Creed: Mirage” today.

And “Assassin’s Creed: Shadows” will be available later this year.

Thank you.

Thanks, Marc-Alexis.

We’re so excited about these amazing games coming to Apple devices.

And this year, even more games are on the way, creating a stellar lineup of titles to look forward to.

So that’s gaming, which wraps up macOS Sequoia.

It’s a big release that up-levels your productivity and creativity.

You can quickly tile windows for your ideal workspace.

A massive update to Safari helps you browse the web distraction-free.

An amazing host of new gaming titles are coming to the Mac.

And iPhone Mirroring lets you wirelessly use your iPhone, right from your Mac.

macOS joins the announcements across your platforms.

And this is a huge year for developers, with brilliant new features and APIs coming so they can supercharge their apps and experiences.

Developer betas will be available today.

Public betas will be available next month.

And all of our OS releases will be available to users this fall.

Back to Tim.

Apple Intelligence

At Apple, it’s always been our goal to design powerful personal products that enrich people’s lives by enabling them to do the things that matter most, as simply and easily as possible.

We’ve been using artificial intelligence and machine learning for years to help us further that goal.

Recent developments in generative intelligence and large language models offer powerful capabilities that provide the opportunity to take the experience of using Apple products to new heights.

So as we look to build in these incredible new capabilities, we want to ensure that the outcome reflects the principles at the core of our products.

It has to be powerful enough to help with the things that matter most to you.

It has to be intuitive and easy to use.

It has to be deeply integrated into your product experiences.

Most importantly, it has to understand you and be grounded in your personal context, like your routine, your relationships, your communications, and more.

And, of course, it has to be built with privacy from the ground up.

Together, all of this goes beyond artificial intelligence.

It’s personal intelligence, and it’s the next big step for Apple.

Introducing Apple Intelligence, the new personal intelligence system that makes your most personal products even more useful and delightful.

To tell you all about it, here’s Craig.

This is a moment we’ve been working towards for a long time.

We are tremendously excited about the power of generative models.

And there are already some really impressive chat tools out there that perform a vast array of tasks using world knowledge.

But these tools know very little about you or your needs.

With iOS 18, iPadOS 18, and macOS Sequoia, we are embarking on a new journey to bring you intelligence that understands you.

Apple intelligence is the personal intelligence system that puts powerful generative models right at the core of your iPhone, iPad, and Mac.

It draws on your personal context to give you intelligence that’s most helpful and relevant for you.

It protects your privacy at every step.

And it is deeply integrated into our platforms and throughout the apps you rely on to communicate, work, and express yourself.

Let’s take a closer look at Apple Intelligence starting with its incredible capabilities.

Then, we’ll tell you about its unique architecture.

And after that, we’ll show you how it elevates so many of your everyday experiences.

Let’s begin with capabilities.

Apple Intelligence will enable your iPhone, iPad, and Mac to understand and create language, as well as images, and take action for you to simplify interactions across your apps.

And what’s truly unique is its understanding of your personal context.

Language and text are fundamental to how we communicate and work.

And the large language models built into Apple Intelligence deliver deep natural language understanding, making so many of your day-to-day tasks faster and easier.

For example, your iPhone can prioritize your notifications to minimize unnecessary distractions, while ensuring you don’t miss something important.

Apple Intelligence also powers brand-new Writing Tools that you can access systemwide to feel more confident in your writing.

Writing Tools can rewrite, proofread, and summarize text for you, whether you are working on an article or blog post, condensing ideas to share with your classmates, or looking over a review before you post it online.

And they are available automatically across Mail, Notes, Safari, Pages, Keynote, and even your third-party apps.

In addition to language, Apple Intelligence offers a host of capabilities for images.

From photos, to emojis, and GIFs, it’s so much fun to express ourselves visually.

And now you can create totally original images to make everyday conversations even more enjoyable.

And because Apple Intelligence understands the people in your photo library, you can personalize these images for your conversations.

So when you wish a friend a happy birthday, you can create an image of them surrounded by cake, balloons, and flowers to make it extra festive.

And the next time you tell Mon that she’s your hero, you can send an image of her in a superhero cape to really land your point.

You can create images in three unique styles: Sketch, Illustration, and Animation.

In addition to Messages, this experience is built into apps throughout the system, like Notes, Freeform, Keynote, and Pages.

Another way Apple Intelligence is deeply impactful is its ability to take action across your apps.

The greatest source of tools for taking actions is already in your pocket with the apps you use every day.

And we have designed Apple Intelligence so it can tap into these tools and carry out tasks on your behalf.

So you can say things like, “Pull up the files that Joz shared with me last week,” or, “Show me all the photos of Mom, Olivia, and me,” or, “Play the podcast that my wife sent the other day.”

We are designing Apple Intelligence to be able to orchestrate these and hundreds of other actions for you, so you can accomplish more while saving time.

There’s one more critical building block for personal intelligence, and that’s an understanding of your personal context.

Apple Intelligence is grounded in your personal information and context with the ability to retrieve and analyze the most relevant data from across your apps, as well as to reference the content on your screen, like an email or calendar event you are looking at.

This can be incredibly useful in so many moments throughout the day.

Suppose one of my meetings is being re-scheduled for late in the afternoon, and I’m wondering if it’s going to prevent me from getting to my daughter’s play performance on time.

Apple Intelligence can process the relevant personal data to assist me.

It can understand who my daughter is, the play details she sent several days ago, the time and location for my meeting, and predicted traffic between my office and the theater.

Understanding this kind of personal context is essential for delivering truly helpful intelligence.

But it has to be done right.

You should not have to hand over all the details of your life to be warehoused and analyzed in someone’s AI cloud.

With Apple Intelligence, powerful intelligence goes hand in hand with powerful privacy.

Let me tell you more about its architecture, and how it is built with privacy at the core.

The cornerstone of the personal intelligence system is on-device processing.

We have integrated it deep into your iPhone, iPad, and Mac and throughout your apps, so it’s aware of your personal data, without collecting your personal data.

This is only possible through our unique integration of hardware and software, and our years-long investment in building advanced silicon for on-device intelligence.

Deeply-integrated generative models require immense processing power.

And with our most advanced Apple silicon, the A17 pro and M-family of chips, we have the computational foundation to power Apple Intelligence.

This personal intelligence system is comprised of highly-capable large language and diffusion models that are specialized for your everyday tasks.

And can adapt on the fly to your current activity.

It also includes an on-device semantic index that can organize and surface information from across your apps.

When you make a request, Apple Intelligence uses its semantic index to identify the relevant personal data, and feeds it to the generative models so they have the personal context to best assist you.

Many of these models run entirely on-device.

There are times, though, when you need modes that are larger than what fits in your pocket today.

Servers can help with this.

But traditionally, servers can also store your data without you realizing it, and use it in ways you did not intend.

And since server software is only accessible to its owners, even if a company says it’s not misusing your data, you are unable to verify their claim, or if it changes over time.

In contrast, when you use an Apple device like your iPhone, you are in control of your data, where it is stored, and who can access it.

And because the software image for your iPhone is accessible to independent experts, they can continuously verify its privacy.

We want to extend the privacy and security of your iPhone into the cloud to unlock even more intelligence for you.

So we have created Private Cloud Compute.

Private Cloud Compute allows Apple Intelligence to flex and scale its computational capacity, and draw on even larger, server-based models for more complex requests, while protecting your privacy.

These models run on servers we have especially created using Apple silicon.

These Apple silicon servers offer the privacy and security of your iPhone from the silicon on up, draw on the security properties of the Swift programming language, and run software with transparency built in.

When you make a request, Apple Intelligence analyzes whether it can be processed on-device.

If it needs greater computational capacity, it can draw on Private Cloud Compute, and send only the data that’s relevant to your task to be processed on Apple silicon servers.

Your data is never stored or made accessible to Apple.

It’s used exclusively to fulfill your request.

And just like your iPhone, independent experts can inspect the code that runs on these servers to verify this privacy promise.

In fact, Private Cloud Compute cryptographically ensures your iPhone, iPad, and Mac will refuse to talk to a server unless its software has been publicly logged for inspection.

This sets a brand-new standard for privacy in AI, and unlocks intelligence you can trust.

So that’s a look at the powerful capabilities of Apple intelligence and its groundbreaking privacy protections.

Now we’d love to show you how it will transform your apps and experiences across iOS18, iPadOS 18, and macOS Sequoia, from a big leap forward for Siri, to powerful tools for writing and communication, and fun visual ways to express yourself.

Let’s start with Siri.

Here’s Kelsey to tell you more.

Today, Siri helps you get everyday tasks done quickly and easily.

In fact, Siri users make 1.5 billion voice requests every single day.

Thirteen years ago, we introduced Siri.

The original intelligent assistant.

And we had an ambitious vision for it.

We’ve been steadily building towards that vision.

And now, thanks to the incredible power of Apple Intelligence, we have the foundational capabilities to take a major step forward.

So we can make Siri more natural, more contextually relevant, and of course, more personal to you.

Right off the bat, you’ll see Siri’s got a new look.

Let me show you.

When you talk to Siri, you’ll notice it’s more deeply integrated into the system experience, with this elegant glowing light that wraps around the edge of your screen.

And you can speak to Siri more naturally thanks to richer language understanding capabilities.

Even if I stumble over my words, Siri understands what I’m getting at.

What does the weather look like for tomorrow at Muir Beach?

Oh, wait, I meant Muir Woods!

Siri: The forecast is calling for clear skies in the morning near Muir Woods National Monument.

sometimes it takes me a beat to figure out what I actually want to ask Siri, and now it follows right along.

Siri also maintains conversational context, so I can follow up and say, “Create an event for a hike there tomorrow at 9:00 a.m.”

Siri: Hike is scheduled for 9:00 a.m. to 11:00 a.m. on June 11.

I didn’t have to mention Muir Woods again.

Siri understood what I meant when I said “there.”

There are also certain times when you might not want to speak to Siri out loud.

What’s great is that now, at any time, you have the option to type to Siri.

With just a double tap at the bottom of the screen, I can quickly and quietly ask Siri to set an alarm.

And you can switch between text and voice, communicating in whatever way feels right for the moment.

We’re also laying the groundwork for some brand-new ways that Siri will be able to support you, one of which is its extensive product knowledge.

Siri now holds a great deal of information about features and settings and can answer thousands of questions when you want to know how to do something on your iPhone, iPad, or Mac.

Even if you don’t know exactly what a feature is called, you can just describe it and Siri will find the info you’re looking for.

Like this: “How can I write a message now and have it be delivered tomorrow?”

Siri understood what feature I was referring to, and now I have step-by-step guidance on how to use the new Send Later feature in Messages.

Everything I’ve showed you so far will be available from the moment you start using Apple Intelligence.

And over the course of the next year, we will be rolling out more features that make Siri even more personal and capable.

For one, Apple Intelligence will provide Siri with on-screen awareness, so it’ll be able to understand and take action with things on your screen.

For example, say a friend texts you his new address.

Right from the Messages thread, you can say, “Add this address to his contact card,” and Siri will take care of it.

Siri will also understand more of the things you get done in your apps.

And with new orchestration capabilities provided by Apple Intelligence, Siri will take actions inside apps on your behalf.

Siri will have the ability to take hundreds of new actions in and across apps, including some that leverage our new writing and image generation capabilities.

For example, you’ll be able to say, “Show me my photos of Stacey in New York wearing her pink coat,” and Siri will bring those right up.

Then you might say, “Make this photo pop,” and Siri will enhance it, just like that.

And Siri will be able to take actions across apps, so you could say, “Add this to my note with Stacey’s bio,” and it will jump from the Photos app to the Notes app to make it happen.

This is going to bring us closer to realizing our vision in which Siri moves through the system in concert with you.

This is made possible through significant enhancements that we are making to App Intents, a framework that lets apps define a set of actions for Siri, Shortcuts, and other system experiences.

And this won’t be limited to apps made by Apple.

For developers, they’ll be able to use the App Intents framework to define actions in their apps and tap into Apple Intelligence too.

So you might ask Siri to take a light trails video in Pro Camera by Moment.

Or ask Siri to share a summary of your meeting notes in an email you’re drafting to a teammate in Superhuman.

And this is only the beginning.

Siri will be able to understand and take more actions in more apps over time.

There’s one more set of really cool and useful capabilities coming to Siri.

Thanks to Apple Intelligence, it has awareness of your personal context.

With its semantic index of things like photos, calendar events, and files, plus information that’s stashed in passing messages and emails, like hotel bookings, PDFs of concert tickets, and links that your friends have shared, Siri will find and understand things it never could before.

And with the powerful privacy protections of Apple Intelligence, Siri will use this information to help you get things done without compromising your privacy.

You’ll be able to ask Siri to find something when you can’t remember if it was in an email, a text, or a shared note, like some book recommendations that a friend sent you a while back.

Or for times when you’re filling out a form and need to input your driver’s license, Siri will be able to find a photo of your license, extract your ID number, and type it into the form for you.

I want to show you one more demo that will give you a sense for how powerful Siri will be when it draws on the personal context awareness and action capabilities built into Apple Intelligence.

Imagine that I am planning to pick my mom up from the airport, and I’m trying to figure out my timing.

Siri is going to be able to help me do this so easily.

Siri, when is my mom’s flight landing?

What’s awesome is that Siri actually cross-references flight details that my mom shared with me by email with real-time flight tracking to give me her up-to-date arrival time.

What’s our lunch plan?

I don’t always remember to add things to my calendar, and so I love that Siri can help me keep track of plans that I’ve made in casual conversation, like this lunch reservation my mom mentioned in a text.

How long will it take us to get there from the airport?

I haven’t had to jump from Mail to Messages to Maps to figure out this plan.

And a set of tasks that would have taken minutes on my own and honestly probably would have resulted in a call to my Mom could be addressed in a matter of seconds.

That’s just a glimpse of the ways in which Siri is going to become more powerful and more personal thanks to Apple Intelligence.

And all of these updates to Siri are also coming to iPad and Mac, where Siri’s new design is a total game-changer.

It makes Siri feel seamlessly integrated with your workflow.

Thanks to the capabilities of Apple Intelligence, this year marks the start of a new era for Siri.

Here’s Justin to show you more places throughout the system where Apple Intelligence simplifies and accelerates your tasks.

Apple Intelligence unlocks incredible new ways to enhance your writing, whether you are tidying up your hastily-written class notes, ensuring your blog post reads just right on Wordpress, or making sure your email is perfectly crafted.

Let’s use Mail to take a closer look at how the systemwide Writing Tools can help you communicate even more effectively.

Rewrite gives you different versions of what you have written, so you can choose the one you like best.

This is great for making sure your cover letter for that job you’re excited for lands perfectly.

And suggestions are shown inline, so you can go with the combination of flow and wording that works for you.

Rewrite also helps you get the tone right.

Have you ever re-read a work email that you just wrote and thought, “Oh, this might not go over well”?

Well, now you can change the tone of that response to your colleague to make it sound more friendly, professional, or concise.

You can also describe how you’d like it rewritten.

For example, you can invite your friends to a get-together with a one-of-a-kind invitation written as a poem.

Who could say no to that?

Another way Writing Tools can help you is with Proofread.

Say you’re emailing your English professor.

With Proofread, you can nail grammar, word choice, and sentence structure to put your best foot forward.

You can review suggested edits and their explanations individually, or accept them all with a click.

And if you are about to email a project status that has gotten quite long, use Summarize to bring out the key points, and then add them as a TL;DR right at the top.

In addition to mail, you can access Writing Tools systemwide, nearly everywhere you write, including third-party apps.

Apple Intelligence also powers Smart Reply in Mail.

For example, when you need to RSVP to an event, you will now see suggestions for your response based on the email.

If you say you’ll be there, Mail identifies questions you were asked in the invite, and offers intelligent selections so you can quickly choose your responses.

Your drafted response incorporates your answers.

So with just a few taps, you’re ready to send it off with all the right details.

Finally, let’s talk about how Apple Intelligence helps you stay on top of a busy inbox.

We all deal with sorting through a ton of email every day.

And now it is easier and faster than ever to browse your inbox.

Instead of previewing the first few lines of each email that don’t always convey the most useful information, you can now see summaries, visible right from your email list.

So without even opening the email, you’ll know that your team is meeting on Thursday to discuss a new design.

And if you jump into a particularly long email when you’re in a hurry, you can tap to reveal a summary at the top of the email and cut right to the chase.

We’re also elevating Priority Messages.

Apple Intelligence can understand the content of the emails you receive, determine what’s most urgent, and surface it right at the top.

Like a dinner invite for tonight, or a boarding pass for your trip this afternoon.

And deep understanding of language extends beyond your inbox into more places, like your Notifications.

First, just like in Mail, your Priority Notifications appear at the top of the stack, letting you know what to pay attention to at a glance.

And to make scanning your notifications faster, they’re summarized.

So when the group chat is blowing up, you can quickly see that Savita booked the house and Lia is arriving early, right from your Lock Screen.

Apple Intelligence also enables an all-new Focus called Reduce Interruptions.

It understands the content of your notifications to selectively surface only the ones that might need immediate attention, like a text about today’s daycare pickup.

From catching up on Priority Notifications, to staying present and focused with Reduce Interruptions, and refining your words with Writing Tools, Apple Intelligence helps you save time in so many ways.

Now, over to Cyrus to show you how it unlocks new ways to express yourself.

Apple Intelligence enables you to create fun, original images whether you are sprucing up a Keynote for class or trying to land an idea while collaborating in Freeform.

And third-party apps can offer this experience too, like in Craft, where you can create a delightful image to add to your document.

Let’s take a closer look at how Apple Intelligence helps you express yourself visually in Messages.

One of the most fun ways to communicate in Messages is with emoji.

But even with thousands of emoji to choose from, there are times when you can’t quite find the right one for how you feel.

So we’re introducing Genmoji.

Leveraging the power of Apple Intelligence, you can create Genmoji, on-device, right in the Keyboard, and match any moment perfectly.

Just provide a description and you’ll see your Genmoji appear right before your eyes, along with more options to choose from.

This is great in those times when you’re updating a friend about your relaxing weekend, getting the group chat excited about brunch, or complaining about the rowdy squirrel right outside your window.

And because Apple Intelligence is aware of who’s in your photo library, you can simply pick someone and create a Genmoji that looks just like them.

These are perfect for sharing with friends as a sticker, reacting to messages with a Tapback, and you can even add Genmoji inline in your messages.

Let your imagination run wild as you create just the right Genmoji.

And because it’s so much fun to use images to express ourselves, we went even further with a new system experience we call Image Playground.

This is a new way to create playful images in just seconds.

It’s so easy to use, and we’ve built it right into apps like Messages.

To get started, you can choose from a range of concepts like themes, costumes, accessories, places, and more.

When you select them, they get added to your playground.

No need to engineer the perfect prompt.

In a few seconds, you’ll see Apple Intelligence creates a preview of what your image could look like.

A moment later, you’ll see more previews you can swipe through.

This all happens on-device.

So you have the freedom to experiment and create as many images as you want.

This is great for quickly responding to your friends with just the right image.

When you have a really specific idea in mind, you can just type a description to add it to your playground.

And you can easily adjust which style you want to use and choose from Animation, Sketch, or Illustration.

Whichever suits the vibe of your conversation.

If you change your mind along the way, no problem.

Just switch back and you’ll see your previous previews.

It’s that simple.

Since Apple Intelligence understands your personal context, you’ll see suggestions for concepts related to your Messages conversation, including you and people from your Messages thread.

When selected, it uses appearances from Photos to add you, or one of them, to the image you’re creating.

With an intuitive experience to create totally original images, and so many ways to express what you want, the Image Playground is going to make everyday conversations a whole lot more fun.

In addition to Messages, this experience is also available in apps like Keynote, Pages, and Freeform.

To make it easy to experiment with creating images, we’ve also built a dedicated Image Playground app.

You can use it to try out Styles, play around with different concepts, and make something to share with friends in other apps or on social media.

And for developers, they can integrate the new Image Playground experience in their app too, with a new API.

With the Image Playground experience and Genmoji, you can create fun and delightful images right where you need them.

Now, here’s Seb to show you more experiences enabled by the powerful capabilities of Apple Intelligence.

With the ability to deeply understand and create images, Apple Intelligence unlocks some fantastic new experiences.

Like a brand-new tool in the Notes app that we call Image Wand.

Image Wand can transform a rough sketch into a polished image that complements your notes and makes them more visual.

And it’s available right in your tool palette.

Suppose you want a better image for your architectural history course.

With Image Wand, you can circle your rough sketch using Apple Pencil to open up an Image Playground within your note.

Image Wand uses on-device intelligence to analyze your sketch and words and creates an image for you.

What's really fun is that you can even circle empty space, and it will pull out context from the surrounding area to suggest the ideal image to go with your note.

It has never been easier to make your notes more visual and engaging.

Apple Intelligence also helps us make the most out of our ever-growing photo libraries.

First, we have an update to photo editing.

We’ve all had that time when we thought we got the perfect shot, then realized later it wasn’t quite perfect.

Now, the new Clean Up tool will identify distracting objects in the background, so you can make them disappear, without accidentally changing your subject.

Plus, searching for photos and videos is much more convenient, because you can now use natural language phrases.

So you can search for really specific things, like “Maya skateboarding in a tie-dye shirt,” or “Katie with stickers on her face.”

Search in videos is also more powerful, with the ability to find a particular moment in the middle of a video clip.

So you can go right to the relevant segment when you search for that video of Maria cartwheeling on the grass.

Apple Intelligence also makes it so much more delightful to create a Memory Movie.

Today when you want to use your photos and videos to create a movie yourself, like for your fishing trips with your kids, it can take hours of work.

You have to search through tons of photos to pick out the best ones, figure out how to arrange them, and hunt for the right music.

Now, thanks to Apple Intelligence, it is super easy to create a memory about the story you want to see.

Just type a description, and it can interpret that “learning to fish” involves things like water, docks, fishing rods, and boats.

Using its language and image understanding, Apple Intelligence picks out the best photos and videos.

And then it crafts a storyline with unique chapters and arranges them into a movie with its own narrative arc.

So now I can watch a wonderful Memory that starts with my son practicing on the dock, transitions to fishing on the boat, and finishes with us holding the prize catch.

And all of this is set to the perfect song selected from Apple Music.

Like all of Apple Intelligence, these updates to Photos are built on a foundation of privacy, so your photos and videos are not shared with Apple, or anyone else.

With endless possibilities, it is so much fun trying out different ideas and revisiting our most precious moments.

And now, back to Craig.

Apple Intelligence is truly unique in how it understands you and meets you where you are.

And what you saw here is just the beginning.

It enables so many more helpful features.

For example, in the Notes app, you can now record and transcribe audio, to capture detailed notes while staying present in the moment.

And when your recording is finished, Apple Intelligence generates a summary to help you recall the key points at a glance.

Recordings, transcriptions, and Apple Intelligence-powered summaries are also coming to the Phone app.

And when you start a recording in a live call, participants are automatically notified, so no one is surprised.

Apple Intelligence is available for free with iOS 18, iPadOS 18, and macOS Sequoia, bringing you personal intelligence across the products you use every day.

Still, there are other artificial intelligence tools available that can be useful for tasks that draw on broad world knowledge, or offer specialized domain expertise.

We want you to be able to use these external models without having to jump between different tools.

So we’re integrating them right into your experiences.

And we’re starting out with the best of these, the pioneer and market leader ChatGPT from Open AI, powered by GPT-4o.

First, we built support into Siri, so Siri can tap into ChatGPT’s expertise when it might be helpful for you.

For example, if you need menu ideas for an elaborate meal to make for friends using some freshly caught fish and ingredients from your garden, you can just ask Siri.

Siri determines that ChatGPT might have good ideas for this, asks your permission to share your question, and presents the answer directly.

You can also include photos with your questions.

If you want some advice on decorating, you can take a picture and ask, “What kind of plants would go well on this deck?”

Siri confirms if it’s okay to share your photo with ChatGPT and brings back relevant suggestions.

It’s a seamless integration.

In addition to photos, you can also ask questions related to your documents, presentations, or PDFs.

We’ve also integrated ChatGPT into the systemwide Writing Tools with Compose.

You can create content with ChatGPT for whatever you’re writing about.

Suppose you want to create a custom bedtime story for your six-year-old who loves butterflies and solving riddles.

Put in your initial idea and send it to ChatGPT to get something back she’ll love.

Compose can also help you tap into ChatGPT’s image capabilities to generate images in a wide variety of styles to illustrate your bedtime story.

You’ll be able to access ChatGPT for free and without creating an account.

Your requests and information will not be logged.

And for ChatGPT subscribers, you’ll be able to connect your account and access paid features right within our experiences.

Of course, you’re in control over when ChatGPT is used and will be asked before any of your information is shared.

ChatGPT integration will be coming to iOS 18, iPadOS 18, and macOS Sequoia later this year.

We also intend to add support for other AI models in the future.

Now, let’s talk about developers, and how they can integrate the experiences powered by Apple Intelligence into their apps.

We have updated our SDKs with new APIs and frameworks.

For example, developers can add the image Playground experience to their app with just a few lines of code.

This means that an app like Craft can help users create images to make their documents much more visual.

And Writing Tools are automatically available within apps that use the standard editable text view.

So without any development effort, an app like Bear Notes can automatically allow users to rewrite, proofread, and summarize notes.

Plus, we are building many more ways for users to take action in apps with Siri.

If a developer has already adopted SiriKit, they’ll see immediate enhancements from many of Siri’s new capabilities without additional work.

We’re also investing deeply in the App Intents framework to connect the vast world of apps with Apple Intelligence.

We’re defining new intents across our operating systems and making them available to developers starting with these categories.

These intents are pre-defined, trained, and tested, so they’re easy for developers to adopt.

Using new App Intents, an app like Darkroom will be able to use the Apply Filter intent to give users the ability to say, “Apply a cinematic preset to the photo I took of Ian yesterday.”

These are just a handful of the updates coming to our platform SDKs so developers can add intelligent and useful features to their apps.

We will share more details in the Platforms State of the Union later today, like how we are bringing generative intelligence to Xcode for developing apps using Swift and SwiftUI, with features like on-device code completion, and smart assistance for Swift coding questions.

So that’s Apple Intelligence, with tremendous benefits for developers and users.

This is AI for the rest of us, personal intelligence you can rely on at work, home, and everywhere in between.

Apple Intelligence harnesses the power of our most advanced silicon, and will be available on iPhone 15 Pro, and iPad and Mac with M1 and later.

Apple Intelligence will be available to try out in US English this summer.

We are bringing it to users in beta as part of iOS18, iPadOS 18, and macOS Sequoia this fall, with some features and additional languages and platforms coming out over the corse of the next year.

This is the beginning of an exciting new chapter of personal intelligence.

Intelligence built for your most personal products: your iPhone, iPad, and Mac.

Intelligence grounded in the things that make you, you.

And intelligence available to you systemwide, so you can get things done in the way that works for you.

We are just getting started, and I hope you are as excited as I am for the road ahead.

And now, back to Tim.

Thank you, Craig, and thanks to all of our presenters.

It’s been an exciting day of announcements.

We shared powerful new features and advancements to our six incredible platforms.

And the introduction of powerful new Apple Intelligence features to iOS 18, iPadOS 18, and macOS Sequoia make these releases game-changers.

Built in a uniquely Apple way, we think Apple Intelligence is going to be indispensable to the products that already play such an integral role in our lives.

We have a big week ahead for developers.

It kicks off this afternoon with the Platforms State of the Union.

We also have over a hundred technical sessions, live forums, in-depth consultations, and Q&As with Apple engineers.

All of this content is available online, for free, for developers.

We’re excited to provide developers with the amazing new OS platforms and technologies we announced today, as well as tools and resources to help them do the very best work of their lives.

Thank you so much for joining us.

Let’s have a great WWDC.

Nacos本地缓存配置实践

2023-10-07T13:49:26+08:00

背景

前段时间做了一个项目，由于nacos的不稳定性，导致了生产环境拉取配置失败了，从而影响了生产环境的业务。

于是团队就做了一个大胆的决定，为了避免因为依赖nacos导致业务的不可用，我们一致决定，在本地做nacos的配置缓存。本篇文章只讨论nacos配置缓存的实践，不涉及注册中心。

经过几轮测试和验证，最终这个方案落地了，做了实际的故障演练，把nacos断了之后，应用是能正确的拿到缓存的配置。

下面我们就来看看如何实现的。

缓存配置的步骤

实现本地缓存配置数据的步骤，我这里做了几个具体步骤的总结：

首先，有一个Nacos服务器，这个是必须的，并且已经创建了相应的配置。
其次，在程序中引入Nacos的客户端SDK依赖，这个也是必须的。
然后使用SDK从Nacos服务器获取配置数据，这个也是必须的。
重点步骤来了，我的实现方式是，将从nacos服务端获取到的配置数据保存在本地缓存中，可以使用内存、文件或其他缓存机制。
然后在需要访问配置数据的地方，首先检查本地缓存是否存在数据。如果存在，直接使用缓存数据；如果不存在，再从Nacos服务器获取最新的配置数据。
最后就是要定期刷新本地缓存，以确保获取到的配置数据是最新的。

这只是其中一种实现方式，还有主动轮训的方式，这里就不讲了。

本次实践不涉及强实时要求的配置的更新。

测试代码

先用java代码测试一下可行性，试一下在本地缓存配置数据：

import com.alibaba.nacos.api.config.ConfigService;
import com.alibaba.nacos.api.config.listener.Listener;
import com.alibaba.nacos.api.exception.NacosException;
import com.alibaba.nacos.api.utils.StringUtils;

import java.util.Properties;
import java.util.concurrent.Executor;

public class NacosConfigCacheExample {
    private static final String SERVER_ADDR = "localhost:8848";
    private static final String GROUP_ID = "DEFAULT_GROUP";
    private static final String DATA_ID = "example-config";
    private static final String CACHE_FILE_PATH = "/path/to/cache/file";

    private static ConfigService configService;

    public static void main(String[] args) throws NacosException {
        // 创建Nacos配置服务实例
        Properties properties = new Properties();
        properties.put("serverAddr", SERVER_ADDR);
        configService = NacosFactory.createConfigService(properties);

        // 从Nacos服务器获取配置数据
        String configData = configService.getConfig(DATA_ID, GROUP_ID, 5000);

        // 将配置数据保存到本地缓存文件
        saveConfigToCache(configData);

        // 注册配置变更监听器
        configService.addListener(DATA_ID, GROUP_ID, new Listener() {
            @Override
            public void receiveConfigInfo(String configInfo) {
                // 当配置发生变化时，更新本地缓存
                saveConfigToCache(configInfo);
            }

            @Override
            public Executor getExecutor() {
                return null; // 使用默认的执行器
            }
        });

        // 从本地缓存获取配置数据
        String cachedConfigData = readConfigFromCache();
        if (StringUtils.isNotBlank(cachedConfigData)) {
            // 使用缓存数据
            System.out.println("Using cached config data: " + cachedConfigData);
        } else {
            // 缓存数据为空，从Nacos服务器获取最新的配置数据
            String latestConfigData = configService.getConfig(DATA_ID, GROUP_ID, 5000);
            System.out.println("Using latest config data: " + latestConfigData);
        }
    }

    private static void saveConfigToCache(String configData) {
        // 将配置数据保存到本地缓存文件
        // 这里使用了简单的文件存储方式，你可以根据实际需求选择其他缓存机制
        try (FileWriter writer = new FileWriter(CACHE_FILE_PATH)) {
            writer.write(configData);
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

    private static String readConfigFromCache() {
        // 从本地缓存文件读取配置数据
        // 这里使用了简单的文件存储方式，你可以根据实际需求选择其他缓存机制
        try (BufferedReader reader = new BufferedReader(new FileReader(CACHE_FILE_PATH))) {
            StringBuilder configData = new StringBuilder();
            String line;
            while ((line = reader.readLine()) != null) {
                configData.append(line);
            }
            return configData.toString();
        } catch (IOException e) {
            e.printStackTrace();
        }
        return null;
    }
}

跑了一下，是可以成功拿到数据，并缓存到文件中的。同时在缓存数据为空的时候，是可以从nacos读取最新的数据的。

那么下面就在正式的spring项目中开干。

Spring Boot代码DEMO

项目是用的Spring Boot，那么在项目中引入nacos这种简单的操作就浅写一下吧。

<dependency>
    <groupId>com.alibaba.cloud</groupId>
<artifactId>spring-cloud-starter-alibaba-nacos-config</artifactId>
</dependency>

然后，在配置文件中添加Nacos相关的配置这种简单的操作也浅写一下吧。

spring:
  cloud:
    nacos:
      config:
        server-addr: localhost:8848
        group: DEFAULT_GROUP
        namespace: your-namespace

接下来，创建一个配置类，用于获取和缓存配置数据：

import com.alibaba.nacos.api.config.annotation.NacosConfigListener;
import com.alibaba.nacos.api.config.annotation.NacosValue;
import org.springframework.stereotype.Component;

import java.io.*;
import java.util.HashMap;
import java.util.Map;

@Component
public class ConfigCache {
    private static final String CACHE_FILE_PATH = "/path/to/cache/file";

    private Map<String, String> configDataMap = new HashMap<>();

    public String getConfigData(String dataId) {
        String configData = configDataMap.get(dataId);
        if (configData == null || configData.isEmpty()) {
            configData = readConfigFromCache(dataId);
        }
        return configData;
    }

    @NacosConfigListener(dataId = "example-config", groupId = "DEFAULT_GROUP")
    public void onConfigUpdate(String config, String dataId) {
        configDataMap.put(dataId, config);
        saveConfigToCache(config, dataId);
    }

    private void saveConfigToCache(String configData, String dataId) {
        try (BufferedWriter writer = new BufferedWriter(new FileWriter(CACHE_FILE_PATH + dataId))) {
            writer.write(configData);
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

    private String readConfigFromCache(String dataId) {
        try (BufferedReader reader = new BufferedReader(new FileReader(CACHE_FILE_PATH + dataId))) {
            StringBuilder configData = new StringBuilder();
            String line;
            while ((line = reader.readLine()) != null) {
                configData.append(line);
            }
            return configData.toString();
        } catch (IOException e) {
            e.printStackTrace();
        }
        return null;
    }
}

这里配置不多的话，可以用一个map来存储。可以加快获取的速度。

当map中没有的时候，再去文件中读取缓存的配置。这里展示的是用文件存储的。

实际的项目中，最终是替换成了redis来存储的。

文件存储的问题在于是，配置很多的话，会产生很多个文件。如果写一个文件的话，当配置很多的时候IO效率又很慢。

用map存在内存中的问题是，如果配置很多，会占用很多内存。但好处是它查询很快。

最后，在业务代码中可以使用ConfigCache类来获取配置数据：

import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.web.bind.annotation.GetMapping;
import org.springframework.web.bind.annotation.RestController;

@RestController
public class ExampleController {
    @Autowired
    private ConfigCache configCache;

    @GetMapping("/config")
    public String getConfigData() {
        return configCache.getConfigData("example-config");
    }
}

总结一下里面的坑

实际测试下来，最终发现，不同的项目会用不同的缓存。

只用MAP

在配置很少的那种服务里，比如只有几个或者十几个，就简单的用map缓存在内存中就可以了。

但它的问题是，如果是服务本身挂了，那么重启后，这些map中的数据就丢失了。
另外一个问题是，服务有多个实例的时候，每个实例中缓存的map是不一样的。当正好没有缓存的实例去访问nacos拿数据的时候，nacos正好挂了，那么一样会失败。虽然概率很低。

用MAP+文件

对于配置多，也不在乎读取效率的服务里，就用MAP+文件。其实我们的项目中最终只有一个服务用了这种模式。

它的问题在于，配置很多的服务中，随着服务的使用，MAP占用的内存会越来越大。
用文件存储的问题在于，它有一定的概率会出现IO问题。导致读写文件失败。所以兜底的操作就是最终都去nacos读取。
另外一个问题在于，写到文件中的配置都是明文的。运维或者开发登陆到机器上就能看到配置的信息。有一定的安全风险。

用Redis

绝大部分的服务最终都用了这种模式。因为用了redis，所以也不需要用MAP了。经过测试，从MAP中读取，和从Redis读取的时间差可以忽略不计，几乎无感知。

最明显的缺点肯定就是会增加一个redis，在整体架构上多了一环，那就多了一个不稳定因素。因为redis也有挂掉的可能。
这个redis到底是独立的只用于配置，还是和其他的缓存的redis合用一个，也是一个纠结的问题。不过实际的决策还是要根据业务来。
还有一个坑是，如果redis和nacos在同一个可用区，那这个可用区挂掉之后，会导致两边都拿不到数据，一样导致不可用。所以redis和nacos本身的高可用部署也是需要考虑的。

通用问题

数据一致性问题。缓存中的数据和nacos中的数据可能是不一致的。虽然有监听更新，但在大数据量和频繁读取的场景中，也有可能导致不一致的情况出现。所以这种对一致性要求很高的场景，建议做一些一致性保障的逻辑。
如果Nacos配置数据量非常大或者数量众多，如果是用的缓存到本地文件的方式，可能会占用大量的存储空间，并且读取和写入大量的配置数据可能会影响应用的性能。所以量大的场景不太建议用文件缓存。
如果在配置中有一些敏感数据，比如密码、敏感数据等，用缓存的方式都可能增加新的安全风险。比如缓存到文件是会被读取到的，缓存到redis通常都是明文存储的。

最后总结

考虑了多种实现方式，最终选了上述的方式。另外一种实现方式是，可以主动轮训。

但它的问题在于，主动轮训会需要确定好时间间隔，太短可能占用应用的性能，太长可能数据更新不及时等。另外就是轮训也有数据一致性问题。

还有一种缓存方式，就是把量大的配置做分批缓存。比如每批缓存500个配置。由于和我们的业务不太匹配，就没有尝试这种方式。

总之，我们做nacos本地缓存是为了避免nacos故障导致的业务不可用。每个企业每个项目每个业务遇到的问题可能不一样，大家应该根据自己的实际场景，来寻找最适合的解决方案。

微服务优雅上下线的实践方法

2023-05-23T16:20:10+08:00

💡 本文介绍了微服务优雅上下线的实践方法，包括适用于 Spring 应用的优雅上下线逻辑，以及使用 Docker 实现无损下线的 demo，以及服务预热。同时，本文还总结了优雅上下线的价值和挑战。

前言

微服务优雅上下线的原理是指在微服务的发布过程中，保证服务的稳定性和可用性，避免因为服务的变更而造成流量的中断或错误。
微服务优雅上下线的原理可以从三个角度来考虑：

服务端的优雅上线，即在服务启动后，等待服务完全就绪后再对外提供服务，或者有一个服务预热的过程。
服务端的无损下线，即在服务停止前，先从注册中心注销，拒绝新的请求，等待旧的请求处理完毕后再下线服务。
客户端的容灾策略，即在调用服务时，通过负载均衡、重试、黑名单等机制，选择健康的服务实例，避免调用不可用的服务实例。

微服务优雅上下线可以提高微服务的稳定性和可靠性，减少发布过程中的风险和损失。

优雅上线

优雅上线，也叫无损上线，或者延迟发布，或者延迟暴露，或者服务预热。
优雅上线的目的是为了提高发布的稳定性和可靠性，避免因为应用的变更而造成流量的中断或错误。

优雅上线的方法

优雅上线的方法有以下几种：

延迟发布：即延迟暴露应用服务，比如应用需要一些初始化操作后才能对外提供服务，如初始化缓存，数据库连接池等相关资源就位，可以通过配置或代码来实现延迟暴露。
QoS命令：即通过命令行或HTTP请求来控制应用服务的上线和下线，比如在应用启动时不向注册中心注册服务，而是在服务健康检查完之后再手动注册服务。
服务注册与发现：即通过注册中心来管理应用服务的状态和路由信息，比如在应用启动时向注册中心注册服务，并监听服务状态变化事件，在应用停止时向注册中心注销服务，并通知其他服务更新路由信息。
灰度发布：即通过分流策略来控制应用服务的流量分配，比如在发布新版本的应用时，先将部分流量导入到新版本的应用上，观察其运行情况，如果没有问题再逐步增加流量比例，直到全部切换到新版本的应用上。

上面的方法核心思想都是一个，就是等服务做好了准备再把请求放行过去。

优雅上线的实现

大部分优雅上线都是通过注册中心和服务治理能力来实现的。
对于初始化过流程较长的应用，由于注册通常与应用初始化过程同步进行，因此可能出现应用还未完全初始化就已经被注册到注册中心供外部消费者调用，此时直接调用可能会导致请求报错。
所以，通过服务注册与发现来做优雅上线的基本思路是：

在应用启动时，提供一个健康检查接口，用于反馈服务的状态和可用性。
应用启动后，可以采用下列方法来使新的请求暂时不进入新版的服务实例。
- 暂时不向注册中心注册服务。
- 隔离服务，有些注册中心支持隔离服务实例，比如北极星。
- 将权重配置为0，比如nacos。
- 将服务实例的enable改为false，比如nacos。
- 让健康检查接口返回不健康的状态。
在新版本的应用实例完成初始化操作后，确保了可用性后，再对应的将上述的方法取消，这样就可以让新的请求被路由到新版本的应用实例上。
如果需要预热，就让流量进入新版本的应用实例时按比例的一点点增加。

这样，就可以实现优雅上线的过程，保证请求进来的时候，不会因为新版本的应用实例没有准备好而导致请求失败。

优雅上线的代码demo

我们以 Spring Cloud 和 Nacos 为例，讲一下如何通过服务注册与发现来做优雅上线的过程。
首先，我们需要创建一个 Spring Cloud 项目，并添加 Nacos 的依赖。
然后，我们需要在 application.properties 文件中配置 Nacos 的相关信息，如注册中心地址，服务名，分组名等，例如：

spring.application.name=provider
spring.cloud.nacos.discovery.server-addr=127.0.0.1:8848
spring.cloud.nacos.discovery.group=DEFAULT_GROUP

接下来，我们需要在启动类上添加 @EnableDiscoveryClient 注解，表示开启服务注册与发现功能，例如：

@SpringBootApplication
@EnableDiscoveryClient
public class ProviderApplication {

    public static void main(String[] args) {
        SpringApplication.run(ProviderApplication.class, args);
    }

}

然后，我们需要创建一个 Controller 类，提供一个简单的接口，用于返回服务的信息，例如：

@RestController
public class ProviderController {

    @Value("${server.port}")
    private int port;

    @GetMapping("/hello")
    public String hello() {
        return "Hello, I am provider, port: " + port;
    }
}

最后，如果需要我们可以重写健康检查接口，用于反馈服务的状态和可用性。这里我们需要引入Actuator。

@Component
public class DatabaseHealthIndicator implements HealthIndicator {

    @Override
    public Health health() {
        if (isDatabaseConnectionOK()) {
            return Health.up().build();
        } else {
            return Health.down().withDetail("Error Code", "DB-001").build();
        }
    }

    private boolean isDatabaseConnectionOK() {
        // 检查数据库连接、缓存等
        return true;
    }
}

这样，我们就完成了一个简单的服务提供者应用，并且可以通过 Nacos 来实现服务注册与发现。
接下来，我们需要创建一个服务消费者应用，并且也添加 Nacos 的依赖和配置信息。
然后，我们需要在启动类上添加 @EnableDiscoveryClient 注解，表示开启服务注册与发现功能，并且使用 RestTemplate 来调用服务提供者的接口，例如：

@SpringBootApplication
@EnableDiscoveryClient
public class ConsumerApplication {

    public static void main(String[] args) {
        SpringApplication.run(ConsumerApplication.class, args);
    }

    @Bean
    @LoadBalanced // 开启负载均衡
    public RestTemplate restTemplate() {
        return new RestTemplate();
    }

    @RestController
    public class ConsumerController {

        @Autowired
        private RestTemplate restTemplate;

        @GetMapping("/hello")
        public String hello() {
            // 使用服务名来调用服务提供者的接口
            return restTemplate.getForObject("<http://provider/hello>", String.class);
        }
    }
}

这里我们使用了 @LoadBalanced 注解来开启负载均衡功能，并且使用服务名 provider 来调用服务提供者的接口。
这样，我们就完成了一个简单的服务消费者应用，并且可以通过 Nacos 来实现服务注册与发现。
接下来，我们就可以通过以下步骤来实现优雅上线的过程：

在发布新版本的服务提供者应用时，先启动新版本的应用实例，但是不向注册中心注册服务，或者让健康检查接口返回不健康的状态，这样就不会有新的请求进入新版本的应用实例。这可以通过配置或代码来实现，例如：

# 不向注册中心注册服务
spring.cloud.nacos.discovery.register-enabled=false

// 让健康检查接口返回不健康的状态
this.isHealthy = false;

在新版本的应用实例完成初始化操作后，再向注册中心注册服务，或者让健康检查接口返回健康的状态，这样就可以让新的请求被路由到新版本的应用实例上。这可以通过配置或代码来实现，例如：

# 向注册中心注册服务
spring.cloud.nacos.discovery.register-enabled=true

// 让健康检查接口返回健康的状态
this.isHealthy = true;

这样，就可以实现优雅上线的过程，保证正在处理的请求不会被中断，而新的请求会被路由到新版本的应用上。

服务预热

服务预热是指在服务上线之前，先让服务处于一个运行状态，让其加载必要的资源、建立连接等，以便在服务上线后能够快速响应请求。如下图所示。
在流量较大情况下，刚启动的服务直接处理大量请求可能由于应用内部资源初始化不彻底从而出现请求阻塞、报错等问题。此时通过服务预热，在服务刚启动阶段通过小流量帮助服务在处理大量请求前完成初始化，可以帮助发现服务上线后可能存在的问题，例如资源不足、连接数过多等，从而及时进行调整和优化，确保服务的稳定性和可靠性。

Spring Boot实现服务预热

我们可以通过使用 Spring Boot Actuator 来实现服务预热。

添加 Spring Boot Actuator 依赖。
配置了将所有 Actuator 端点暴露出来，并启用了预热端点。

management.endpoints.web.exposure.include=*
management.endpoint.warmup.enabled=true

这时我们就可以调用warmup接口来实现预热了。默认的接口如下：http://localhost:8080/actuator/warmup

这里spring的warmup 端点会做以下几件事情：

加载 Spring 上下文
初始化连接池
加载缓存数据
发送测试请求

如果我们想自定义预热逻辑，我们也可以通过实现warmup接口来自定义预热的逻辑。代码如下：

@Component
public class MyWarmup implements Warmup {

    @Override
    public void warmup() {
        // 实现预热逻辑
    }
}

优雅下线

无损下线、优雅下线都是同一个意思。都是为了避免服务下线的时候由于请求没有处理完导致请求失败的情况。

优雅下线的方法

无损下线的一些常用的工具或框架有：

Dubbo-go：支持多种注册中心、负载均衡、容灾策略等，可以实现优雅上下线的设计与实践。
Spring Cloud：提供了多种组件来实现服务的配置、路由、监控、熔断等，可以通过监听 ContextClosedEvent 事件来实现优雅下线的逻辑。
Docker：可以通过 docker stop 或 docker kill 命令来停止容器，前者会发送 SIGTERM 信号给容器的 PID1 进程，后者会发送 SIGKILL 信号。如果程序能响应 SIGTERM 信号，就可以实现优雅下线的操作。

Spring Cloud优雅下线的原理

ContextClosedEvent 是 Spring 容器在关闭时发布的一个事件，可以通过实现 ApplicationListener 接口来监听这个事件，并在 onApplicationEvent 方法中执行一些自定义的逻辑。
对于 Spring Cloud 中的微服务来说，当收到 ContextClosedEvent 事件时，可以做以下几件事情：

从注册中心注销当前服务，这样就不会再有新的请求进入。
拒绝或者延迟新的请求，这样就可以保证正在处理的请求不会被中断。
等待一段时间，让旧的请求处理完毕，或者超时。
关闭服务，释放资源。

这样就可以实现优雅下线的逻辑，避免因为服务的变更而造成流量的中断或错误。

Spring boot优雅下线的demo

在旧版本里面，我们需要实现 TomcatConnectorCustomizer 和 ApplicationListener<ContextClosedEvent> 接口，然后就可以在 customize 方法中获取到 Tomcat 的 Connector 对象，并在 onApplicationEvent 方法中监听到 Spring 容器的关闭事件。
在2.3及以后版本，我们只需要在application.yml中添加几个配置就能启用优雅关停了。

# 开启优雅停止 Web 容器，默认为 IMMEDIATE：立即停止
server:
  shutdown: graceful

# 最大等待时间
spring:
  lifecycle:
    timeout-per-shutdown-phase: 30s

这个开关的具体实现逻辑在我们在 GracefulShutdown 里。
然后我们需要添加actuator依赖，然后在配置中暴露actuator的shutdown接口。

# 暴露 shutdown 接口
management:
  endpoint:
    shutdown:
      enabled: true
  endpoints:
    web:
      exposure:
        include: shutdown

这个时候，我们调用http://localhost:8080/actuator/shutdown就可以执行优雅关停了，它会返回如下内容：

{
    "message": "Shutting down, bye..."
}

优缺点

我觉得这种方法有以下的优点和缺点：
优点：

简单易用，只需要简单的配置，就可以实现优雅下线的逻辑。
适用于 Tomcat 作为内嵌容器的 Spring Boot 应用，不需要额外的配置或依赖。
可以保证正在处理的请求不会被中断，而新的请求不会进入，避免了服务的变更造成流量的中断或错误。

缺点：

只适用于 Tomcat 作为内嵌容器的 Spring Boot 应用，如果使用其他的容器或部署方式，可能需要另外的实现。
需要等待一定的时间，让正在处理的请求完成或超时，这可能会影响服务的停止速度和资源的释放。
如果正在处理的请求过多或过慢，可能会导致线程池无法优雅地关闭，或者超过系统的终止时间，造成强制关闭。

Docker优雅下线的demo

这里用一个简单的JS应用来演示docker实现无损下线的过程。
首先，我们需要创建一个 Dockerfile 文件，用于定义一个简单的应用容器，代码如下：

# 基于 node:14-alpine 镜像
FROM node:14-alpine

# 设置工作目录
WORKDIR /app

# 复制 package.json 和 package-lock.json 文件
COPY package*.json ./

# 安装依赖
RUN npm install

# 复制源代码
COPY . .

# 暴露 3000 端口
EXPOSE 3000

# 启动应用
CMD [ "node", "app.js" ]

然后，我们需要创建一个 app.js 文件，用于定义一个简单的 web 应用，代码如下：

// 引入 express 模块
const express = require('express');

// 创建 express 应用
const app = express();

// 定义一个响应 /hello 路径的接口
app.get('/hello', (req, res) => {
  // 返回 "Hello, I am app" 字符串
  res.send('Hello, I am app');
});

// 监听 3000 端口
app.listen(3000, () => {
  // 打印日志信息
  console.log('App listening on port 3000');
});

接下来，我们需要在终端中执行以下命令，来构建和运行我们的应用容器，并查看页面结果。

# 构建镜像，命名为 app:1.0.0
docker build -t app:1.0.0 .

# 运行容器，命名为 app-1，映射端口为 3001:3000
docker run -d --name app-1 -p 3001:3000 app:1.0.0

# 查看容器运行状态和端口映射信息
docker ps

CONTAINER ID   IMAGE       COMMAND                  CREATED          STATUS          PORTS                    NAMES
a8a9f9f7c6c4   app:1.0.0   "docker-entrypoint.s…"   10 seconds ago   Up 9 seconds    0.0.0.0:3001->3000/tcp   app-1

# 在浏览器中访问 <http://localhost:3001/hello> ，可以看到返回 "Hello, I am app" 字符串

这个时候假设我们要发布一个新版本的应用，我们需要修改 app.js 文件中的代码，把返回的字符串修改为 “Hello, I am app v2”。
然后，我们需要在终端中执行以下命令，来构建和运行新版本的应用容器：

# 构建镜像，命名为 app:2.0.0
docker build -t app:2.0.0 .

# 运行容器，命名为 app-2，映射端口为 3002:3000
docker run -d --name app-2 -p 3002:3000 app:2.0.0

# 查看容器运行状态和端口映射信息
docker ps

CONTAINER ID   IMAGE       COMMAND                  CREATED          STATUS          PORTS                    NAMES
b7b8f8f7c6c4   app:2.0.0   "docker-entrypoint.s…"   10 seconds ago   Up 9 seconds    0.0.0.0:3002->3000/tcp   app-2
a8a9f9f7c6c4   app:1.0.0   "docker-entrypoint.s…"   2 minutes ago    Up 2 minutes    0.0.0.0:3001->3000/tcp   app-1

# 在浏览器中访问 <http://localhost:3002/hello> ，可以看到返回 "Hello, I am app v2" 字符串

接下来，需要优雅地下线旧版本的应用容器，让它完成正在处理的请求，然后停止接收新的请求，最后退出进程。

# 向旧版本的应用容器发送 SIGTERM 信号，让它优雅地终止
docker stop app-1

# 查看容器运行状态和端口映射信息
docker ps

CONTAINER ID   IMAGE       COMMAND                  CREATED          STATUS          PORTS                    NAMES
b7b8f8f7c6c4   app:2.0.0   "docker-entrypoint.s…"   2 minutes ago    Up 2 minutes    0.0.0.0:3002->3000/tcp   app-2

# 在浏览器中访问 <http://localhost:3001/hello> ，可以看到无法连接到服务器的错误

这样，我们就实现了通过 Docker 来做优雅下线的过程，保证正在处理的请求不会被中断，而新的请求会被路由到新版本的应用上。
这里主要用到了docker stop命令。docker stop命令会向容器发送 SIGTERM 信号，这是一种优雅终止进程的方式，它会给目标进程一个清理善后工作的机会，比如完成正在处理的请求，释放资源等。如果目标进程在一定时间内（默认为 10 秒）没有退出，docker stop 命令会再发送 SIGKILL 信号，强制终止进程。
所以，使用 docker stop 命令能实现优雅下线的前提是，容器中的应用能够正确地响应 SIGTERM 信号，并在收到该信号后执行清理工作。如果容器中的应用忽略了 SIGTERM 信号，或者在清理工作过程中出现异常，那么 docker stop 命令就无法实现优雅下线的效果。
让容器中的应用正确地响应 SIGTERM 信号的方法，主要取决于容器中的 1 号进程是什么，以及它如何处理信号。如果容器中的 1 号进程就是应用本身，那么应用只需要在代码中为 SIGTERM 信号注册一个处理函数，用于执行清理工作和退出进程。例如，在 Node.js 中，可以这样写：

// 定义一个处理 SIGTERM 信号的函数
function termHandler() {
  // 执行清理工作
  console.log('Cleaning up...');
  // 退出进程
  process.exit(0);
}

// 为 SIGTERM 信号注册处理函数
process.on('SIGTERM', termHandler);

总结

优雅上下线的价值

在微服务实践中，实现优雅上下线能给我们带来以下好处：

最小化服务中断：通过优雅上下线，可以最小化服务中断的时间和影响范围，从而确保服务的可用性和稳定性。
避免数据丢失：优雅下线可以确保正在处理的请求能够完成，避免数据丢失和请求失败。
提高用户体验：优雅上下线可以确保用户在使用服务时不会遇到任何中断或错误，从而提高用户体验和满意度。
简化部署流程：通过使用自动化工具和流程，可以简化部署流程，减少人工干预和错误，提高部署效率和质量。
提高可维护性：通过使用监控和日志记录工具，可以及时发现和解决问题，提高服务的可维护性和可靠性。

这些好处可以帮助企业提高服务质量和效率，提升用户满意度和竞争力。

优雅上下线的挑战

但同时，优雅上下线也面临一些挑战：

复杂性增加：微服务架构通常由多个服务组成，每个服务都有自己的生命周期和依赖关系，因此优雅上下线需要考虑多个服务之间的交互和协调，增加了系统的复杂性。
部署流程复杂：优雅上下线需要使用自动化工具和流程，这需要投入大量的时间和资源来构建和维护，增加了部署流程的复杂性。
数据一致性问题：优雅下线需要确保正在处理的请求能够完成，但这可能会导致数据一致性问题，需要采取措施来解决这个问题。
人员技能要求高：微服务架构需要具备更高的技术水平和技能，需要拥有更多的开发和运维经验，这对企业的人员要求较高。

综上所述，企业需要认真考虑这些挑战，并采取相应的措施来解决这些问题，以确保在微服务实践中更好的落地优雅上下线。

如何用Serverless实现视频剪辑批量化、自动化与定制化

2022-03-23T11:31:34+08:00

前言

开始讲之前先解决大家看到这个标题时心里的3个疑惑：

视频剪辑不是用Adobe的软件就可以做了吗？
为什么要用Serverless？
如何写代码做视频剪辑？

首先说说哪些视频剪辑场景是Adobe等软件无法完成的

大家平常接触到的视频剪辑通常都是使用Premiere，AE等这类专业工具来完成视频剪辑。他们能完成一些复杂的效果，比如做宣传视频，广告视频等。

但有些企业在某些业务场景下是期望能批量且自动化的完成视频剪辑。

比如以下几种场景：

假设学校期望能在学生上完网课之后马上呈现所有学生学习过程中的精彩视频，配上学校的logo和宣传语等，让学生一键分享自己的成果。假设有1万个学生，需要为每个学生制作独一无二的视频，所以需要批量且自动化的完成1万个不同的视频剪辑。
某次营销活动中，需要为不同的用户生成不同的头像视频来吸引用户参与。每个用户的头像都是独一无二的，生成的视频也是独一无二的，用户可能成千上万，因此自动化完成是必须的条件。
网红运营公司期望能给所有主播生成统一的营业视频。可能有100个主播，专门找一个人剪辑100个视频好像勉强能接受，但如果每周都要剪一次不同的视频呢？所以自动化，批量和可定制化的剪辑就成了主要需求。

以上的场景中有三个特点：

批量
自动化
可定制

对于符合以上特点的场景，是传统的视频剪辑工具或者模版化的视频处理软件无法轻松完成的。

再来说说为什么用Serverless

因为视频剪辑这样的业务有几个特点：

使用时段集中。
计算量大。

单独购买高规格的服务器利用率很低，买便宜的服务器计算能力又跟不上。

因此Serverless按量计费的特点，以及高性能的计算能力，完美匹配了这样的需求场景。

既能达到100%的利用率，又能按量使用它的高性能计算能力。

同时，Serverless拥有多变的可编程环境，可以使用熟悉的编程语言，灵活性很高。

最后说说如何写代码做视频剪辑

本文章提到的所有视频剪辑的功能，都是用FFmpeg这个工具，所以先给大家讲讲什么是FFmpeg。

FFmpeg是一个用来做视频处理的开源工具，它有非常强大的功能，它支持视频剪辑、视频转码、视频编辑、音频处理、添加文字、视频拼接、拉流推流直播等功能。

我们通过不同的FFmpeg命令就可以编程完成不同的视频剪辑功能，组合编排起来，就可以应对各种批量自动化的场景了。

视频剪辑批量化、自动化与定制化实践

常见的视频剪辑场景主要包含以下几种：

视频转码
视频裁剪
视频加文字
视频加图片
视频拼接
视频加音频
视频转场
视频特效
视频加速慢速播放

接下来给大家展示一些具体的FFmpeg命令例子，如果你在本地安装了FFmpeg，也可以在本地执行这些命令。关于怎么安装FFmpeg，可以去看官网的教程。

// 将MOV视频转成mp4视频
ffmpeg -i input.mov output.mp4

// 将原视频的帧率修改为24
ffmpeg -i input.mp4 -r 24 -an output.mp4

// 将mp4视频转为可用于直播的视频流
ffmpeg -i input.mp4 -codec: copy -bsf:v h264_mp4toannexb -start_number 0 -hls_time 10 -hls_list_size 0 -f hls output.m3u8

// 将视频分别变为480x360，并把码率改400
ffmpeg -i input.mp4 -vf scale=480:360,pad=480:360:240:240:black -c:v libx264 -x264-params nal-hrd=cbr:force-cfr=1 -b:v 400000 -bufsize 400000 -minrate 400000 -maxrate 400000 output.mp4

// 给视频添加文字，比如字幕、标题等。
// `fontfile`是要使用的字体的路径，`text`是你要添加的文字，
// `fontcolor`是文字的颜色，`fontsize`是文字大小，`box`是给文字添加底框。
// `box=1`表示enable，`0`表示disable，`boxcolor`是底框的颜色，black@0.5表示黑色透明度是50%，`boxborderw`是底框距文字的宽度
// `x`和`y`是文字的位置，`x`和`y`不只支持数字，还支持各种表达式，具体可以去官网查看
ffmpeg -i input.mp4 -vf "drawtext=fontfile=/path/to/font.ttf:text='你的文字':fontcolor=white:fontsize=24:box=1:boxcolor=black@0.5:boxborderw=5:x=(w-text_w)/2:y=(h-text_h)/2" -codec:a copy output.mp4

// 给视频添加图片，比如添加logo、头像、表情等。filter_complex表示复合的滤镜，overlay表示表示图片的x和y，enable表示图片出现的时间段，从0-20秒
ffmpeg -i input.mp4 -i avatar.JPG -filter_complex "[0:v][1:v] overlay=25:25:enable='between(t,0,20)'" -pix_fmt yuv420p -c:a copy output.mp4

// 视频拼接，list.txt里面按顺序放所有要拼接的视频的文件路径，如下。
// 注意，如果视频的分辨率不一致会导致拼接失败。
ffmpeg -f concat -safe 0 -i list.txt -c copy -movflags +faststart output.mp4
// list.txt的格式如下
file 'xx.mp4'
file 'yy.mp4'

// 视频加音频，stream_loop表示是否循环音频内容，-1表示无限循环，0表示不循环。shortest表示最短的MP3输入流结束时完成编码。
ffmpeg -y -i input.mp4 -stream_loop -1 -i audio.mp3 -map 0:v -map 1:a -c:v copy -shortest output.mp4

FFmpeg能做的事情非常多，这里就不一一讲解了。更多的玩法可以在FFmpeg官网上探索探索。

对于音频的编辑也是同样的道理，FFmpeg也支持单独对音频进行编辑。

如何运行FFmpeg命令

因为Python运行这些命令比较便捷，所以我们可以使用python来运行所有的FFmpeg命令。同时python在serverless云函数上运行性能也比较好，部署也方便。

通过Python来使用FFmpeg的视频剪辑代码在文章最后有开源链接。并且在官网上也有模版可以直接使用，覆盖了常见的音视频剪辑等操作。

这里就展示一个简单的调用代码示例。

child = subprocess.run('./ffmpeg -i input.mov output.mp4',
                               stdout=subprocess.PIPE,
                               stderr=subprocess.PIPE, close_fds=True, shell=True)
if child.returncode == 0:
  print("success:", child)
else:
  print("error:", child)
    raise KeyError("处理视频失败, 错误: ", child)

在serverless部署

上面提到的常见的视频剪辑场景我已经实现并开源了，下载代码直接部署到serverless就可以使用了。

https://github.com/woodyyan/ffmpeg-composition

https://github.com/woodyyan/ffmpeg-splice

这里分为了两个函数，一个负责处理单个视频，一个负责把多个视频拼接成一个视频并配上背景音乐。

目前支持以下功能：

在视频中添加文字
视频分辨率转换
在视频中添加图片
视频拼接
添加背景音乐

源码里展示的只是常见的一些视频剪辑场景，大家可以根据自己的业务需要，编写自己的视频剪辑逻辑。

Serverless部署

方式一：Github Action自动部署

Fork仓库。
在仓库的Settings-Secrets-Actions中添加TENCENT_SECRET_ID和TENCENT_SECRET_KEY两个密钥。ID和KEY可以在腾讯云的访问控制里面获取。
添加之后，在Action中就可以发起部署了。每次修改代码推送后，也会自动触发Action部署。
如果需要有一些自定义的配置，请修改serverless.yml。
云函数最终会自动部署到TENCENT_SECRET_ID所在的账号下。

方式二：云函数控制台手动部署

下载代码。
在根目录把所有文件和文件夹一起打包成一个ZIP文件。
去云函数控制台，新建一个函数。
选择从头开始：
1. 选择python语言。
2. 上传ZIP文件。
3. 函数内存建议选择较大的内存。
4. 开启异步执行。
5. 执行超时时间根据视频大小建议设置长一点，比如30秒以上。
6. 配置触发器，选择API网关触发器，关闭集成响应。
完成部署后，就可以通过API网关的URL开始调用了。

真实案例回顾

一个做网课的学校，需要每次在学生上完网课之后把上网课的录像制作成一段30秒的视频，作为学生的学习成果。

此案例有几个关键的信息点：

通常一堂课有200个学生，需要同时制作200个视频。
需要把1小时的上课视频剪辑成30秒。
由于每个学生的上课屏幕有所不同，因此录制的视频都是不同的。
最终的成果视频还需要加上学生的名字和头像。
学生结束上课的时间很集中，因此制作视频时会有短时高并发。
每次上完课的时候才会需要制作视频，时段比较固定且集中。

综合上述特点，用Serverless来做这样的视频剪辑带来了多个好处：

解决了200个并发的问题，不需要自己搭建过多的服务器。
解决了只在发生时段使用的问题，其他时段都没有成本产生。
解决了需要较强计算能力快速制作视频的问题。

下面是这个案例的参考架构图。

总结

通过编排、组合、复用上面列举的各种音视频剪辑的场景，就能制作出各种各样想要的效果。

然后把视频剪辑中用来控制各种效果的参数，变成调用服务时传入的参数，就能实现各种效果的定制化了。

最后再总结一下通过这种写代码的方式完成视频剪辑的使用场景：

解决通过修改个别参数来批量制作视频的场景。
解决通过用户触发来自动化制作视频的场景。
解决不同场景需要不同定制化的制作视频的场景。

同时，利用serverless来完成视频剪辑，同样也解决了以下几个问题：

因为通常视频剪辑不是全天运行，利用serverless按量付费的特性能优化成本。
因为视频剪辑通常是重计算场景，利用serverless可选的高规格配置来应对这种重计算。
在批量制作视频的场景中通常会存在高并发，利用serverless自动弹性伸缩的特性能轻松应对高并发。

关于Serverless使用上或者视频剪辑大家有什么问题，欢迎给我留言。

如何做Serverless自动化部署

2022-02-28T17:48:15+08:00

前言

随着敏捷和DevOps的流行，CI/CD已经成了所有开发者在开发过程中必不可少的最佳实践，主要目标是以更快的速度、更短的周期向用户交付行之有效的软件。

它能给我们带来如下好处：

缩短发布周期
降低风险
提高代码质量
更高效的反馈循环
可视化过程

因此在Serverless越来越流行的今天，如何让Serverless的项目也能快速的搭建CI/CD，这是这篇文章的重点。

习惯了CI/CD的用户可能都期望有一个快速搭建自动化部署的教程，所以这篇文章会以下面几个流行的平台来讲解如何搭建自动化部署，让你能够推送代码就自动完成部署。

Github
Jenkins
Coding

基于 GitHub 的自动化部署

GitHub Actions是Github推出的自动化软件开发工作流。通过Actions可以执行任何任务，其中就包括 CI/CD。

前提条件

已托管你的 Serverless 项目代码到Github。
项目中必须包含Serverless framework部署需要用到的serverless.yml。serverless.yml的使用方式请参考官网。
如果是Web函数，需保证根目录有scf_bootstrap文件，具体请参考官网。

操作步骤

为了让这个部署过程更简单，我在GitHub的市场中发布一个腾讯云Serverless部署的Action来帮助大家快速完成自动化部署。

在GitHub的Marketplace中搜索tencent serverless就可以找到。如下图。里面有详细的Action代码。

首先，在Actions里面选择Set up a workflow yourself，如下图。

如果知道如何使用Action，那么直接用下面这句就可以了，里面封装了安装Serverless framework和执行部署命令的步骤。

    - name: serverless scf deploy
      uses: woodyyan/tencent-serverless-action@main

如果不知道如何使用Action，可以根据不同的语言选择下列不同的yml写法，下面我列举了Python、Java、NodeJS的写法。

适用于Python项目

# 当代码推动到 main 分支时，执行当前工作流程
# 更多配置信息: https://docs.github.com/cn/actions/getting-started-with-github-actions
name: deploy serverless scf
on: #监听的事件和分支配置
  push:
    branches:
      - main
jobs:
  deploy:
    name: deploy serverless scf
    runs-on: ubuntu-latest
    steps:
      - name: clone local repository
        uses: actions/checkout@v2
      - name: deploy serverless
        uses: woodyyan/tencent-serverless-action@main
        env: # 环境变量
          STAGE: dev #您的部署环境
          SERVERLESS_PLATFORM_VENDOR: tencent #serverless 境外默认为 aws，配置为腾讯
          TENCENT_SECRET_ID: ${{ secrets.TENCENT_SECRET_ID }} #您的腾讯云账号 sercret ID，请在Settings-Secrets中配置
          TENCENT_SECRET_KEY: ${{ secrets.TENCENT_SECRET_KEY }} #您的腾讯云账号 sercret key，请在Settings-Secrets中配置

适用于Java项目，请仔细看代码中的备注说明

name: deploy serverless scf
on: #监听的事件和分支配置
  push:
    branches:
      - main
jobs:
  build-and-deploy:
    runs-on: ubuntu-latest
    permissions:
      contents: read
      packages: write
    steps:
      - uses: actions/checkout@v2
      - name: Set up JDK 11
        uses: actions/setup-java@v2
        with:
          java-version: '11'
          distribution: 'temurin'
          server-id: github # Value of the distributionManagement/repository/id field of the pom.xml
          settings-path: ${{ github.workspace }} # location for the settings.xml file
      - name: Build with Gradle # Gradle项目用这个
        uses: gradle/gradle-build-action@937999e9cc2425eddc7fd62d1053baf041147db7
        with:
          arguments: build
      - name: Build with Maven # Maven项目用这个
        run: mvn -B package --file pom.xml
      - name: create zip folder # 此步骤仅用于Java Web函数，用于存放jar和scf_bootstrap文件。Java事件函数只需要在Serverless.yml中指定Jar目录就好。
        run: mkdir zip
      - name: move jar and scf_bootstrap to zip folder # 此步骤仅用于Java Web函数，用于移动jar和scf_bootstrap文件。Java事件函数只需要在Serverless.yml中指定Jar目录就好。注意如果是Maven编译请修改下面的jar路径为/target。
        run: cp ./build/libs/XXX.jar ./scf_bootstrap ./zip
      - name: deploy serverless
        uses: woodyyan/tencent-serverless-action@main
        env: # 环境变量
          STAGE: dev #您的部署环境
          SERVERLESS_PLATFORM_VENDOR: tencent #serverless 境外默认为 aws，配置为腾讯
          TENCENT_SECRET_ID: ${{ secrets.TENCENT_SECRET_ID }} #您的腾讯云账号 sercret ID
          TENCENT_SECRET_KEY: ${{ secrets.TENCENT_SECRET_KEY }} #您的腾讯云账号 sercret key

适用于NodeJS项目

# 当代码推动到 main 分支时，执行当前工作流程
# 更多配置信息: https://docs.github.com/cn/actions/getting-started-with-github-actions
name: deploy serverless scf
on: #监听的事件和分支配置
  push:
    branches:
      - main
jobs:
  deploy:
    name: deploy serverless scf
    runs-on: ubuntu-latest
    steps:
      - name: clone local repository
        uses: actions/checkout@v2
      - name: install dependency
        run: npm install
      - name: build
        run: npm build
      - name: deploy serverless
        uses: woodyyan/tencent-serverless-action@main
        env: # 环境变量
          STAGE: dev #您的部署环境
          SERVERLESS_PLATFORM_VENDOR: tencent #serverless 境外默认为 aws，配置为腾讯
          TENCENT_SECRET_ID: ${{ secrets.TENCENT_SECRET_ID }} #您的腾讯云账号 sercret ID，请在Settings-Secrets中配置
          TENCENT_SECRET_KEY: ${{ secrets.TENCENT_SECRET_KEY }} #您的腾讯云账号 sercret key，请在Settings-Secrets中配置

最后，由于部署的时候需要用到腾讯云的TENCENT_SECRET_ID和TENCENT_SECRET_KEY，所以需要在Github代码仓库的设置中的Secrets里面配置这两个变量。如下图所示。ID和KEY可以在腾讯云的访问控制里面获取。

配置完成之后，每次推送代码，都将会自动触发部署流程，同时在Actions中可以实时看到执行结果与错误日志。如下图。

大家还可以根据项目需要，在流程中添加测试、安全检查、发布等步骤。

基于Jenkinsfile的自动化部署

Jenkinsfile是通用于Jenkins、Coding等平台的，因此只需要配置好Jenkinsfile，则能在这些平台上完成自动化部署。

前提条件

已托管你的 Serverless 项目到 Coding/Github/Gitlab/码云等平台。
项目中必须包含Serverless framework部署需要用到的serverless.yml。serverless.yml的使用方式请参考官网。
如果是Web函数，需保证根目录有scf_bootstrap文件，具体请参考官网。

操作步骤

根据不同语言的需要，我把所有语言需要用到的语法都写在下面的Jenkinsfile中了，适用于Python、Java、NodeJS，请仔细阅读注释。

pipeline {
  agent any
  stages {
    stage('检出') {
      steps {
        checkout([$class: 'GitSCM', branches: [[name: env.GIT_BUILD_REF]],
            userRemoteConfigs: [[url: env.GIT_REPO_URL, credentialsId: env.CREDENTIALS_ID]]])
      }
    }
        stage('Package'){ // 此stage仅用于Java项目
       steps{
        container("maven") {
          echo 'Package start'
          sh "mvn package" // 此行用于Java Maven项目
                    sh "./gradlew build" // 此行用于Java Gradle项目
                    sh "mkdir zip" // 此行仅用于Java Web函数，用于存放jar和scf_bootstrap文件。Java事件函数只需要在Serverless.yml中指定Jar目录就好。
          sh "cp ./build/libs/XXX.jar ./scf_bootstrap ./zip" // 此行仅用于Java Web函数，用于移动jar和scf_bootstrap文件。Java事件函数只需要在Serverless.yml中指定Jar目录就好。注意如果是Maven编译请修改下面的jar路径为/target。
        }           
           }
    }
    stage('安装依赖') {
      steps {
        echo '安装依赖中...'
        sh 'npm i -g serverless'
        sh 'npm install' // 此行用于NodeJS项目
        echo '安装依赖完成.'
      }
    }
    stage('部署') {
      steps {
        echo '部署中...'
        withCredentials([
          cloudApi(
            credentialsId: "${env.TENCENT_CLOUD_API_CRED}",
            secretIdVariable: 'TENCENT_SECRET_ID',
            secretKeyVariable: 'TENCENT_SECRET_KEY'
          ),
        ]) {
             // 生成凭据文件
             sh 'echo "TENCENT_SECRET_ID=${TENCENT_SECRET_ID}\nTENCENT_SECRET_KEY=${TENCENT_SECRET_KEY}" > .env'
             // 部署
             sh 'sls deploy --debug'   
             // 移除凭据
             sh 'rm .env' 
        }
        echo '部署完成'
      }
    }
  }
}

使用上面的Jenkinsfile就可以在Jenkins、coding等平台一键完成CI/CD配置了。

注意，需要在平台中配置腾讯云需要用到的TENCENT_SECRET_ID和TENCENT_SECRET_KEY这两个变量。

总结

作为开发者，总是希望所有代码工作都是自动化完成，都能提高效率。因此，熟练的掌握如何快速配置自动化的CI/CD流程，是每个开发者必须掌握的技能之一。

在这里分享这些开箱即用的配置，也是希望能大大减少大家的学习成本，快速上手开始核心业务开发。

未来我还会继续探索更多的适用于Serverless的DevOps实践，与大家分享。

如果有任何疑问或在操作中遇到任何困难可以在文章下方留言，我会回复大家。

如何用Serverless云函数做免费私域运营机器人

2022-02-11T14:48:33+08:00

关于私域流量

近几年，私域流量运营的话题被提及得越来越多。

私域流量是指从公域（internet）、它域(平台、媒体渠道、合作伙伴等)引流到自己私域（官网、客户名单），以及私域本身产生的流量(访客)。私域流量是可以进行二次以上链接、触达、发售等市场营销活动客户数据。

而私域流量运营很重要的一点就是如何能自动化智能化的进行客户运营。

目前各大公司的办公软件都支持机器人这种应用形式，而这种机器人则是我们做私域流量运营的重要一环。

机器人能做什么

机器人在私域流量运营中可以做包括但不限于以下事情：

消息推送
智能客服
客户管理
建群引流
活动营销
企业互联

这些场景名词可能有些抽象，可以举几个具体例子。

比如，用户进群之后会收到机器人自动发送的欢迎仪式，里面附带新用户代金券等，同时此消息是仅他可见，不会打扰其他用户。
比如，用户通过询问智能客服机器人就能得到很多常见的答案，省去了人工成本。
比如，机器人自动在群里发起某营销活动的报名，无需人工收集。
再比如，通过客户管理，可以给客户打标签，针对不同的客户，自动发送不同的活动优惠。
再再比如，通过机器人收集广告投放获取的商机，自动创建商机线索，并同步到群里自动@相关销售，闭环整个商机发现路径。

可以想象的空间有很多很多。

为什么是Serverless呢

为什么选择serverless来做呢，好处主要有以下几点：

机器人的通信都是通过HTTP请求与企业微信通信，而serverless按调用次数收费，拥有极高的性价比。
机器人通常在晚上都没有人使用，如果使用传统的服务器部署会有较高的闲置率，用serverless可以把利用率做到近乎百分百。
机器人可能会涉及多个使用场景，可以针对不同的场景使用不同的FaaS云函数，做到细粒度的管理和问题隔离。
腾讯云云函数支持所有主流语言，无需关心服务器，开发快，周期短，一个机器人从开发到上线最快只需要1小时。

为什么说免费呢?

因为腾讯云云函数包含有免费额度。而机器人的使用并不是高频调用，所以免费额度足以涵盖所有的使用量。

免费的羊毛薅起来吧！

这篇文章将选择企业微信作为平台，从最基础的场景，讲解如何用serverless云函数来完成一个企业微信机器人。

企业微信机器人原理

我们先来了解一下企业微信机器人的原理。如上图所示，左边表示我们的serverless云函数机器人，右边是企业微信。

中间的箭头表示两种机器人和企业微信的通信方式：

机器人单向给企业微信发送消息
机器人和企业微信双向互发消息

从图中可以看出，单向通信是蓝色的箭头，因为单向通信没有任何限制，机器人无法获取企业微信的相关信息。这种模式主要适合于所有的通知类的场景。比如消息推送，全局群发等。

而红色的箭头就有诸多限制了，因为企业微信可以向外发送信息的话，这里就涉及到很多安全问题了。因此企业微信对于这种情况主要做了多方面的限制：

发送的消息必须经过严格的加解密。
某些特殊消息内容拥有一定的实效性，比如获取会话信息必须通过一个临时的URL，有效期只有５分钟，且调用一次后失效。
双向通信的回调URL可以由企业设置一些限制，比如只支持企业内网URL。

那配合双向通信，就可以做到上面说的所有场景，比如智能客服、客户管理等。

机器人实战

那我们就从两个简单的场景来讲解一下如何实现一个企业微信机器人。

消息通知 - 单向通信
知识库搜索 - 双向通信

消息通知

首先需要创建一个机器人，创建方式是在任何一个企业微信群里，点击右上角，添加群机器人。

然后选择新创建一个机器人。

创建完成之后，你就获得了一个webhook地址。如下图。

这个webhook地址就是你推送消息到企业微信的地址。

推送的消息格式有很多种，支持往群聊会话中发送文本、markdown、图片、图文、文件、模版卡片六种消息类型。

以文本消息为例，你只需要推送以下JSON内容到webhook地址，企业微信就会收到通知。

{
    "chatid":"CHATID1 | CHATID2",
    "msgtype":"text",
    "text":{
        "content":"广州今日天气：29度，大部分多云，降雨概率：60%",
        "mentioned_list":["lisi", "@all"],
        "mentioned_mobile_list":["13800001111", "@all"]
    }
}

那么以云函数为例，如何创建云函数可以参考官网文档。

创建好之后，只需要几行代码就能完成一个通知发送机器人。如下图。

注意要将url替换成你的机器人webhook地址，content必须是utf8编码。

如果你期望每天早上8点定时推送天气预报，你只需要修改一下上面的代码，从某个天气预报API拿到天气预报，然后设置一个定时触发器，触发周期用CRON表达式定义每天8点触发，如下图。

这样之后，每天8点你的企业微信群就能收到如下图的消息了。

知识库搜索

上一个例子是单向通信的例子。那这个例子则是双向通信的例子。

在企业中，以及在私域流量运营中，我们经常有搜索知识库寻找答案的场景。这里我们就以搜索腾讯云文档为例，来向大家讲解如何完成一个双向通信的知识库搜索机器人。

我们要做的就是当输入关键字，就去腾讯云文档搜索结果并返回，同时高亮显示关键字和文档链接。

首先，还是一样的，你需要创建一个云函数。但这个云函数是需要接收企业微信发过来的消息，因此在上一个云函数的基础上，我们需要添加一个API网关触发器，让云函数能接收API请求。

创建触发器选择API网关触发器，创建好之后如下图，复制访问路径那个URL，它就是企业微信在回调消息的需要填的URL。

接着到企业微信，鼠标放到你创建的机器人上，点击配置，选择【接收消息配置】，在URL那里填入上面复制的URL。如下图。

Token和EncodingAESKey可以自己写，也可以随机获取，它是你用来做加密解密时用的。

💡 当点击“保存”提交以上信息时，企业微信会发送一条验证消息到填写的URL，发送方法为GET。群机器人的接收消息服务器接收到验证请求后，需要作出正确的响应才能通过URL验证。

完成了上述设置之后，你在群聊中@机器人并输入你想搜索的关键字，你的云函数就会收到对应的JSON消息，msgContent就是你搜索的关键字。

{
    "msgType": "text",
    "msgContent": "函数计费",
    "chatId": "XXX",
    "botKey": "XXX",
    "hookUrl": "http://qyapi.weixin.qq.com/cgi-bin/webhook/send?key=XXXX",
    "botName": "腾讯云文档搜索助手",
    "userName": "XXX·",
    "msgId": "CAIQ4",
    "chatType": "group",
    "chatInfoUrl": "http://qyapi.weixin.qq.com/cgi-bin/webhook/get_chat_info?code=XXX"
}

这个时候你只需要拿到msgContent的内容，然后去调用腾讯云的文档搜索API，拿到JSON的结果，把JSON结果处理为如下图中的markdown格式，并返回。

于是我们的腾讯文档搜索助手就做好了，使用效果如下图。

至此，我们两个企业微信机器人都做好了。

这里就不展示代码了，想看具体怎么写的同学可以去看我的源码。

总结

我从两个简单的例子去讲解了如何做企业微信机器人，而企业微信机器人是我们做私域流量运营的重要一步，同时Serverless则完美帮我们解决了实现机器人的技术选型。

随着我们对客户体验和服务体验的追求，我们利用自动化的手段帮我们提高了响应速度，利用智能化帮我们提高了服务准确度。
在追求售前和售后效率的今天，机器人的使用可以节省人力成本和时间，缩短客户等待时间。
Serverless作为一种弹性伸缩与按量计费的服务，完美匹配了机器人的使用场景，从成本与效率上帮助企业在私域流量运营场景中业务的快速搭建与迭代。
Serverless作为一种FaaS服务，通过多个云函数的编排，独立或混合的处理不同的业务场景，做到细粒度的管理，与业务容错隔离。

未来，我会继续探索Serverless做私域流量运营的更多场景和实践，也会继续和大家分享。

如果大家有私域流量运营相关的问题，欢迎来和我一起探讨。

如何用Serverless搭建Mock server

2022-01-18T11:17:34+08:00

前言

什么是Serverless

无服务器Serverless是一种云原生开发模型，可使开发人员专注构建和运行应用，而无需管理服务器。

云函数（Serverless Cloud Function，SCF）则是腾讯云提供的无服务器执行环境，可以在无需购买和管理服务器的情况下运行代码。

什么是Mock Server

现在的业务系统很少有孤立存在的，它们或多或少需要使用或依赖其他服务，这给我们的联调和测试造成了麻烦。

为了应对这种情况，我们常会搭建一个临时的server，模拟那些服务，提供模拟数据进行联调和测试。

这个临时的server就是 mock server 。

因此mock server通常具有以下特点：

快速搭建、无需写代码
能模拟任何数据
低成本
简单配置

也正是这些特点，均符合serverless的特点，因此我们使用serverless来做这件事情再合适不过了。

接下来我们就用腾讯云的云函数为例，来讲解一下如何快速搭建Mock Server。

如何用云函数快速搭建Mock Server

目前市面上有很多Mock server工具，开源的不开源的都有。

这里就用Moco作为例子来教大家快速部署一个Mock Server。

Moco是一个开源框架，这是它的Github链接。

准备工作

首先去Moco的github页面下载准备好的jar文件。

其次需要自己准备一个定义response的JSON文件，如下。里面的内容需要根据自己的业务去定义要返回的mock数据是什么。

[
  {
    "response" :
      {
        "text" : "Hello, Moco"
      }
  }
]

最后在云函数中运行需要一个启动文件，文件名必须是scf_bootstrap，内容如下：

#!/bin/bash
/var/lang/java8/bin/java -jar moco-runner-1.2.0-standalone.jar http -p 9000 -c foo.json

其中端口号必须是9000，JSON配置文件名如果不是foo.json则需要改成自己的文件名。

然后把这个三个文件打包成一个zip文件，如下图。

部署Mock Server

打开云函数的控制台，新建一个云函数。如下图。

选择从头开始
选择Web函数
运行环境选择Java8
在函数代码那里上传刚才打包好的zip文件

最后，点击完成即可。

然后，你到函数管理界面就可以看到访问路径了。如下图。

向URL发送HTTP请求就能获得你在JSON文件中定义的response。

一键部署

上面的方式是不是已经很快捷了。但是还有更快的，没有错！

Mock server已经上架到云函数的官方模版中了。

如下图，在模版中搜索mock就可以看到，一键就可以部署一个Mock server了。

用Serverless搭建Mock Server的优势

用Serverless搭建Mock Server具有下面几个优势。

快速搭建

所有开发团队都希望只花极少的时间就能快速搭建一个Mock Server。

因此使用Serverless不用关注和维护服务器，所以可以快速搭建运行一个mock server。

极低成本

由于Mock server只用于测试，如果我们购买服务器来搭建，会增加不少金钱成本和维护成本。

而Serverless按量收费和免运维的特点，则可以既节约了金钱成本，又节约了维护成本。

通常我们调用Mock Server的次数都很少，而云函数是按调用次数收费的，每个月有10万次免费调用次数。所以使用云函数则可以免费薅羊毛。

无需运维

我们不需要像管理服务器那样需要去配置端口、防火墙等。

只需要上传mock server就结束了。

最后

Serverless还可以做很多类似的事情，因为它的高性能、自动伸缩、按量计费等特性，让它成为了很多解决方案中的性价比首选。

未来我会继续探索serverless的更多实用的场景与大家分享。

Serverless与微服务探索（二）- SpringBoot项目部署实践

2021-11-24T21:23:19+08:00

前言

上次的文章分享后，有粉丝反应内容太理论太抽象，看不到实际的样子。

因此，我这里就写一篇教程，手把手教你如何把一个SpringBoot项目部署到Serverless并测试成功。

下面的链接是我发表到官方的文章，但官方的文章会综合考虑，所以不会有那么细的步骤。本文是最详细的步骤。

SpringBoot + SCF 最佳实践：实现待办应用

本文章以腾讯云Serverless云函数为例，将分为事件函数和Web函数两种教程。

事件函数就是指函数是由事件触发的。

Web函数就是指函数可以直接发送HTTP请求触发函数。具体区别可以看这里。

两者在Spring项目迁移改造上的区别在于：

事件函数需要增加一个入口类。
Web函数需要修改端口为固定的9000。
事件函数需要操作更多的控制台配置。
Web函数需要增加一个scf_bootstrap启动文件，和不一样的打包方式。

事件函数

Spring项目准备

事件函数示例代码下载地址：https://github.com/woodyyan/scf-springboot-java8/tree/eventfunction

示例代码介绍

@SpringBootApplication 类保持原状不变。

package com.tencent.scfspringbootjava8;

import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;

@SpringBootApplication
public class ScfSpringbootJava8Application {

    public static void main(String[] args) {
        SpringApplication.run(ScfSpringbootJava8Application.class, args);
    }
}

Controller类也会按照原来的写法，保持不变。这里以todo应用为例子。

记住此处的/todos 路径，后面会用到。

代码如下：

package com.tencent.scfspringbootjava8.controller;

import com.tencent.scfspringbootjava8.model.TodoItem;
import com.tencent.scfspringbootjava8.repository.TodoRepository;
import org.springframework.web.bind.annotation.*;

import java.util.Collection;

@RestController
@RequestMapping("/todos")
public class TodoController {
    private final TodoRepository todoRepository;

    public TodoController() {
        todoRepository = new TodoRepository();
    }

    @GetMapping
    public Collection<TodoItem> getAllTodos() {
        return todoRepository.getAll();
    }

    @GetMapping("/{key}")
    public TodoItem getByKey(@PathVariable("key") String key) {
        return todoRepository.find(key);
    }

    @PostMapping
    public TodoItem create(@RequestBody TodoItem item) {
        todoRepository.add(item);
        return item;
    }

    @PutMapping("/{key}")
    public TodoItem update(@PathVariable("key") String key, @RequestBody TodoItem item) {
        if (item == null || !item.getKey().equals(key)) {
            return null;
        }

        todoRepository.update(key, item);
        return item;
    }

    @DeleteMapping("/{key}")
    public void delete(@PathVariable("key") String key) {
        todoRepository.remove(key);
    }
}

增加一个ScfHandler类，项目结构如下：

Scfhandle类主要用于接收事件触发，并转发消息给Spring application，然后接收到Spring application的返回后把结果返回给调用方。

默认端口号为8080.

其代码内容如下：

package com.tencent.scfspringbootjava8;

import com.alibaba.fastjson.JSONObject;
import com.qcloud.services.scf.runtime.events.APIGatewayProxyRequestEvent;
import com.qcloud.services.scf.runtime.events.APIGatewayProxyResponseEvent;
import org.springframework.http.HttpEntity;
import org.springframework.http.HttpHeaders;
import org.springframework.http.HttpMethod;
import org.springframework.http.ResponseEntity;
import org.springframework.web.client.RestTemplate;

import java.util.HashMap;
import java.util.Map;

public class ScfHandler {
    private static volatile boolean cold_launch;

    // initialize phase, initialize cold_launch
    static {
        cold_launch = true;
    }

    // function entry, use ApiGatewayEvent to get request
    // send to localhost:8080/hello as defined in helloSpringBoot.java
    public String mainHandler(APIGatewayProxyRequestEvent req) {
        System.out.println("start main handler");
        if (cold_launch) {
            System.out.println("start spring");
            ScfSpringbootJava8Application.main(new String[]{""});
            System.out.println("stop spring");
            cold_launch = false;
        }
        // 从api geteway event -> spring request -> spring boot port

        // System.out.println("request: " + req);
        // path to request
        String path = req.getPath();
        System.out.println("request path: " + path);

        String method = req.getHttpMethod();
        System.out.println("request method: " + method);

        String body = req.getBody();
        System.out.println("Body: " + body);

        Map<String, String> reqHeaders = req.getHeaders();
        // construct request
        HttpMethod httpMethod = HttpMethod.resolve(method);
        HttpHeaders headers = new HttpHeaders();
        headers.setAll(reqHeaders);
        RestTemplate client = new RestTemplate();
        HttpEntity<String> entity = new HttpEntity<>(body, headers);

        String url = "http://127.0.0.1:8080" + path;

        System.out.println("send request");
        ResponseEntity<String> response = client.exchange(url, httpMethod != null ? httpMethod : HttpMethod.GET, entity, String.class);
        //等待 spring 业务返回处理结构 -> api geteway response。
        APIGatewayProxyResponseEvent resp = new APIGatewayProxyResponseEvent();
        resp.setStatusCode(response.getStatusCodeValue());
        HttpHeaders responseHeaders = response.getHeaders();
        resp.setHeaders(new JSONObject(new HashMap<>(responseHeaders.toSingleValueMap())));
        resp.setBody(response.getBody());
        System.out.println("response body: " + response.getBody());
        return resp.toString();
    }
}

Gradle

这里以gradle为例，与传统开发不一样的地方主要在于，build.gradle中需要加入全量打包的plugin，来保证所有用到的依赖都打入jar包中。

添加id 'com.github.johnrengelman.shadow' version '7.0.0' 这个plugin。
添加id 'application'
添加id 'io.spring.dependency-management' version '1.0.11.RELEASE'
指定mainClass。

build.gradle具体内容如下：

plugins {
    id 'org.springframework.boot' version '2.5.5'
    id 'io.spring.dependency-management' version '1.0.11.RELEASE'
    id 'java-library'
    id 'application'
    id 'com.github.johnrengelman.shadow' version '7.0.0'
}

group = 'com.tencent'
version = '0.0.2-SNAPSHOT'
sourceCompatibility = '1.8'

repositories {
    mavenCentral()
}

dependencies {
    api 'org.springframework.boot:spring-boot-starter-web'
    api group: 'com.tencentcloudapi', name: 'tencentcloud-sdk-java', version: '3.1.356'
    api group: 'com.tencentcloudapi', name: 'scf-java-events', version: '0.0.4'
    testImplementation 'org.springframework.boot:spring-boot-starter-test'
}

test {
    useJUnitPlatform()
}

application {
    // Define the main class for the application.
    mainClass = 'com.tencent.scfspringbootjava8.ScfSpringbootJava8Application'
}

Maven

这里以maven为例，与传统开发不一样的点主要在于，pom.xml需要加入maven-shade-plugin ，来保证所有用到的依赖都打入jar包中。同时需要指定mainClass，下面代码中的mainClass需要改为你自己的mainClass路径。

pom.xml具体内容如下：

<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
    <modelVersion>4.0.0</modelVersion>
    <parent>
        <groupId>org.springframework.boot</groupId>
        <artifactId>spring-boot-starter-parent</artifactId>
        <version>2.5.5</version>
        <relativePath/> <!-- lookup parent from repository -->
    </parent>
    <groupId>com.example</groupId>
    <artifactId>demo</artifactId>
    <version>1.0</version>
    <name>demo</name>
    <description>Demo project for Spring Boot</description>
    <properties>
        <java.version>1.8</java.version>
    </properties>
    <dependencies>
        <dependency>
            <groupId>org.springframework.boot</groupId>
            <artifactId>spring-boot-starter-web</artifactId>
        </dependency>

        <dependency>
            <groupId>org.springframework.boot</groupId>
            <artifactId>spring-boot-starter-test</artifactId>
            <scope>test</scope>
        </dependency>
    </dependencies>

    <build>
        <plugins>
            <plugin>
      <!-- Build an executable JAR -->
      <groupId>org.apache.maven.plugins</groupId>
      <artifactId>maven-jar-plugin</artifactId>
      <version>3.1.0</version>
      <configuration>
        <archive>
          <manifest>
            <addClasspath>true</addClasspath>
            <classpathPrefix>lib/</classpathPrefix>
            <mainClass>com.mypackage.MyClass</mainClass>
          </manifest>
        </archive>
      </configuration>
    </plugin>
            <plugin>
                <groupId>org.apache.maven.plugins</groupId>
                <artifactId>maven-shade-plugin</artifactId>
                <dependencies>
                    <dependency>
                        <groupId>org.springframework.boot</groupId>
                        <artifactId>spring-boot-maven-plugin</artifactId>
                        <version>2.1.1.RELEASE</version>
                    </dependency>
                </dependencies>
                <configuration>
                    <keepDependenciesWithProvidedScope>true</keepDependenciesWithProvidedScope>
                    <createDependencyReducedPom>true</createDependencyReducedPom>
                    <filters>
                        <filter>
                            <artifact>*:*</artifact>
                            <excludes>
                                <exclude>META-INF/*.SF</exclude>
                                <exclude>META-INF/*.DSA</exclude>
                                <exclude>META-INF/*.RSA</exclude>
                            </excludes>
                        </filter>
                    </filters>
                </configuration>
                <executions>
                    <execution>
                        <phase>package</phase>
                        <goals>
                            <goal>shade</goal>
                        </goals>
                        <configuration>
                            <transformers>
                                <transformer
                                        implementation="org.apache.maven.plugins.shade.resource.AppendingTransformer">
                                    <resource>META-INF/spring.handlers</resource>
                                </transformer>
                                <transformer
                                        implementation="org.springframework.boot.maven.PropertiesMergingResourceTransformer">
                                    <resource>META-INF/spring.factories</resource>
                                </transformer>
                                <transformer
                                        implementation="org.apache.maven.plugins.shade.resource.AppendingTransformer">
                                    <resource>META-INF/spring.schemas</resource>
                                </transformer>
                                <transformer
                                        implementation="org.apache.maven.plugins.shade.resource.ServicesResourceTransformer" />
                            </transformers>
                        </configuration>
                    </execution>
                </executions>
            </plugin>
            <plugin>
                <groupId>org.apache.maven.plugins</groupId>
                <artifactId>maven-compiler-plugin</artifactId>
                <configuration>
                    <source>8</source>
                    <target>8</target>
                </configuration>
            </plugin>
        </plugins>
    </build>
</project>

编译JAR包

下载代码之后，到该项目的根目录，运行编译命令：

Gradle项目运行：gradle build
Maven项目运行：mvn package

编译完成后就能在当前项目的输出目录找到打包好的jar包。

Gradle项目：在build/libs目录下看到打包好的jar包，这里需要选择后缀是-all的JAR包。如下图。
Maven项目：在target目录下能看到打包好的jar包，这里需要选择前缀不带orginal-的jar包。

一会部署函数的时候就用这个JAR包。

云函数准备

云函数创建

在函数服务中，点击新建，开始创建函数。

如下图

选择自定义创建
选择事件函数
输入一个函数名称
运行环境选择Java8
提交方法选择本地上传zip包
执行方法指定为包名.类名::入口函数名
1. 比如此处是：com.tencent.scfspringbootjava8.ScfHandler::mainHandler
上传那里选择前面编译好的带-all后缀的jar包。

然后点击完成创建函数。

云函数配置

创建完成之后，选择函数管理-函数配置-编辑。如下图。

点开编辑之后，在环境配置中：

把内存修改为1024MB
把执行超时时间修改为15秒

触发器配置

在触发管理中，创建触发器。

创建触发器时，在下图中：

触发方式选择API网关触发。
集成响应勾选。
然后提交

创建完成之后需要修改一些API网关参数。点击API服务名进入修改。

点击右侧的编辑按钮修改。

第一个前端配置中，将路径修改为Spring项目中的默认路径。如下图。

然后点击立即完成。

然后点击发布服务。

发布完成之后回到云函数控制台。

开始测试

此处我们就以Controller里面写的第一个GET方法为例，如下图，我们将获得所有的todo items。

在函数管理中，选择函数代码，就可以很方便的进行测试。如下图。

测试事件选择“API Gateway事件模版”。
请求方式选择GET
Path填/todos
最后就可以点击测试按钮。

测试结果和日志将直接显示在界面的右下方。如下图。

如果想要获取完整的访问URL，可以在触发管理中，找到刚才创建的API网关触发器，下面有可以访问的URL。URL后面有复制按钮。如下图。

Web函数

Spring项目准备

示例代码介绍

Web函数示例代码下载地址：https://github.com/woodyyan/scf-springboot-java8/tree/webfunction

Web函数的项目代码相比事件函数更简单。代码改造成本几乎没有。对原代码的修改只有一个端口号。

Web函数则不需要ScfHandler入口类，项目结构如下：

因为web函数必须保证项目监听端口为9000，所以需要将Spring监听的端口改为9000。如下图：

代码部署包准备

代码包编译方式参考上面的“编译JAR包”。

然后新建一个scf_bootstrap启动文件，文件名字必须是scf_bootstrap，没有后缀名。

第一行需有 #!/bin/bash。
java启动命令必须是绝对路径，java的绝对路径是：/var/lang/java8/bin/java
请确保你的 scf_bootstrap 文件具备777或755权限，否则会因为权限不足而无法执行。

因此启动文件内容如下：

#!/bin/bash
/var/lang/java8/bin/java -Dserver.port=9000 -jar scf-springboot-java8-0.0.2-SNAPSHOT-all.jar

接着，在scf_bootstrap文件所在目录执行下列命令来保证scf_bootstrap文件可执行。

chmod 755 scf_bootstrap

然后将scf_bootstrap文件和刚才编译处理的scf-springboot-java8-0.0.2-SNAPSHOT-all.jar文件，一起打包成zip文件。如下图。

打包好的zip文件就是我们的部署包。

云函数创建

在函数服务中，点击新建，开始创建函数。

如下图

选择自定义创建
选择Web函数
输入一个函数名称
运行环境选择Java8
提交方法选择本地上传zip包
上传那里选择前面压缩好的scf_spring_boot.zip包。

然后在下面的高级配置中，写上启动命令，命令中的jar文件应该是你编译出来的jar文件的名字。

因为web函数必须保证项目监听端口为9000，所以命令中要指定一下端口。

更多关于启动命令的写法可以参考启动文件说明。

如下图：

然后环境配置那里，把内存改为512MB。执行超时时间设置为15秒。

其他设置都使用默认的就可以了。然后点击完成。

点击完成之后如果没有反应，是因为要先等待ZIP文件上传，才会开始创建函数。

因为Web函数默认会创建API网关触发器，因此我们不需要单独配置触发器。

开始测试

此处我们就以Controller里面写的第一个GET方法为例，如下图，我们将获得所有的todo items。

在函数控制台的函数代码里面，我们可以直接测试我们的云函数。

依据上面的代码，我们请求方式选择GET，path填写/todos，然后点击测试按钮，然后就可以在右下角看到我们的结果了。

如果想在其他地方测试，可以复制下图中的访问路径进行测试。

最后

本教程没有涉及镜像函数，因为镜像部署和原来的部署方式没有差异。项目代码也不需要改造。理论上这是最适合微服务项目的方式。

下一篇文章中，我就会详细分析Serverless中下面几个话题了。

Serverless中的服务间调用
Serverless中的数据库访问
Serverless中的服务的注册与发现
Serverless中的服务熔断与降级
Serverless中的服务拆分

Serverless与微服务探索（一）- 如何用serverless实践Spring boot项目

2021-11-11T21:28:15+08:00

前言

随着技术的发展，我们有越来越多的选择来实现我们的业务逻辑。Serverless作为时下前沿的技术，是不是也可以探索一下微服务架构的新可能性？

这篇文章就是总结近段时间以来，我探索的用serverless落地SpringBoot微服务项目的一些成果。

什么是Serverless

什么是微服务和什么是springBoot已经不需要我讲解了。

那什么是Serverless呢？

根据CNCF的定义，Serverless（无服务器）是指构建和运行不需要服务器管理的应用程序的概念。

Serverless并不是没有服务器就能进行计算，而是指对于开发者或者公司来说，无需了解和管理底层服务器，就能进行计算。

通俗一点讲，Serverless就是封装了底层计算资源，你只需要提供函数，就可以运行了。

这里还要提到一个概念，就是FaaS（Function as a Service），函数即服务。我们通常运行在Serverless上的逻辑是函数级别的粒度。

因此对于拆分粒度控制很合理的微服务，是非常适合使用serverless的。

Serverless对于微服务的价值

每个微服务API被调用的频率不一样，可以利用Serverless精准管理成本和弹性。
不用担心一个API调用量大而需要扩容整个服务。Serverless可以自动扩缩容。
不需要去运维每个服务背后部署多少个容器，多少个服务器，不用做负载均衡。
屏蔽了K8S等容器编排的复杂学习成本。
Serverless这种无状态的特性也非常符合微服务使用Restful API的特性。

初步实践

首先，需要准备一个SpringBoot项目，可以通过start.spring.io快速创建一个。

在业务开发上，Serverless和传统的微服务开发并没有任何不同。所以我就快速写了一个todo后端服务，完成了增删改查功能。

示例代码在这里。

那么使用Serverless真正有差异的地方在哪里呢？

如果只是简单的想要部署单个服务，那么主要差异在于两个方面：

部署方式
启动方式

部署方式

由于我们摸不到服务器了，所以部署方式的变化是很大的。

传统的微服务部署，通常是直接部署到虚拟机上运行，或者用K8S做容器化的调度。

传统的部署关系大致如下图。

如果使用serverless通常要求我们的微服务拆分粒度更细，才能做到FaaS。
所以使用Serverless部署微服务的关系大致如下图。

Serverless只需要提供代码就可以了，因为serverless自带运行环境，因此serverless部署微服务通常有两种方式：

代码包上传部署
镜像部署

第一种方式和传统部署相比是差异最大的。它需要我们将写好的代码打包上传。并且需要指定一个入口函数或者指定监听端口。

第二个种方式和传统的方式相比几乎不变，都是把做好的镜像上传到我们的镜像仓库。然后在serverless平台部署的时候选择对应的镜像。

启动方式

因为serverless是使用的时候才会创建对应的实例，不使用的时候就会销毁实例，体现了serverless按量计费的特点。

所以serverless在第一次调用的时候存在一个冷启动的过程。所谓冷启动就是指需要平台分配计算资源、加载并启动代码。因此依据不同的运行环境和代码可能有不同的冷启动时间。

而Java作为一种静态语言，它的启动速度也一直被人诟病。然而还有更慢的，就是spring的启动时间，是大家有目共睹的慢。所以，java+spring这种强强联合造就了树懒般的启动速度。就有可能造成首次调用服务出现超长的等待时间。

不过，不用担心，spring已经提供了两种解决方案来缩短启动时间。

一种是SpringFu
另一种是Spring Native。

SpringFu

Spring Fu 是 JaFu (Java DSL) 和 KoFu (Kotlin DSL) 的孵化器，以声明式方式使用代码显式配置 Spring Boot，由于自动完成，具有很高的可发现性。它提供快速启动（比最小 Spring MVC 应用程序上的常规自动配置快 40%）、低内存消耗，并且由于其（几乎）无反射方法非常适合 GraalVM 本机。如果搭配上GraalVM编译器，应用启动速度就能直线下降到原先的大约1%。

不过，目前SpringFu还处于特别早期的阶段，使用过程中问题也比较多。另外，使用SpringFu会有较大的代码改造成本，因为它干掉了所有的annotation，所以这次我没有使用SpringFu的方式。

Spring Native

Spring Native 为使用 GraalVM native-image编译器将 Spring 应用程序编译为native可执行文件，以提供打包在轻量级容器中的native部署选项。 Spring Native的目标是在这个新平台上支持几乎没有代码改造成本的 Spring Boot 应用程序。

因此我选择了Spring native，因为它不需要改造代码，只需要添加一些插件与依赖就能实现native image。

Native image有几大好处：

在构建时会移除未使用的代码
classpath 在构建时就已经确定
没有类延迟加载：可执行文件中所有的内容都会在启动时加载到内存中
在构建时就运行了一些代码

基于这些特性，因此它能让程序的启动时间大大加快。

关于如何使用它我将在下一篇文章中讲解，详细教程可以查看这个官方教程。我也是参考这个教程做的。

我就说说我的测试对比结果吧。

我把编译好的image分别在本地，腾讯云serverless的云函数和AWS serverless lambda进行了部署和测试。

规格	SpringBoot冷启动时长	SpringNative冷启动时长
本地16G内存Mac	1秒	79毫秒
腾讯云Serverless 256M内存	13秒	300毫秒
AWS Serverless 256M内存	21秒	1秒

从测试结果看，SpringNative大大提升了启动速度。提高serverless的规格还能进一步提升速度。

如果Serverless的冷启动速度控制到了1秒内，那么大部分业务都是能接受的。并且也只有首次请求的时候会存在冷启动的情况，其他请求都和普通的微服务响应时间一样。

此外，目前各大平台的Serverless都支持预置实例，也就是在访问到来之前提前创建实例来减少冷启动时间。带来业务上更高的相应时间。

总结

Serverless作为目前先进的技术，它给我们带来诸多好处。

自动扩缩容的弹性和并发性
细粒度的资源分配
松耦合
免运维

但serverless也不是完美的，当我们尝试在微服务领域使用它的时候，我们依然能看到它存在很多问题等待解决。

难以监视和调试
1. 这是目前公认的一个痛
可能会有更多的冷启动
1. 当我们拆分微服务为了适应函数粒度时，同时也分散了每个函数的调用时间，导致每个函数调用频率变低，带来更多的冷启动。
函数间的交互会更复杂
1. 由于函数粒度变细，在大型微服务项目中，导致原本就错综复杂的微服务会变得更加错综复杂。

总结起来就是，想要完全替代传统的虚拟机，在微服务这条路上Serverless还有很长的路要走。

下一步

我会继续探索serverless与微服务的实践。

后面的文章我会探讨下面几个话题

Serverless中的服务间调用
Serverless中的数据库访问
Serverless中的服务的注册与发现
Serverless中的服务熔断与降级
Serverless中的服务拆分

云基础设施架构设计

2021-06-07T17:31:45+08:00

前言

随着AWS、阿里云、Azure、腾讯云等公有云的蓬勃发展，越来越多的企业开始在考虑上公有云了。

这些年做架构咨询，发现很多传统的公司在基础设施上云的过程中没有明确的设计思路，只是一通购买云产品，然后使用云产品，认为这样就是上云了。

但其实，如果没有一个很好的云基础设施架构设计，会使后续的云使用变得难以维护，达不到预期的效果，同时成本上升。

这篇文章里面，我会分享一些过去项目中的公有云设计经验和思路，给大家提供一些基于微服务的场景下如何设计云基础设施架构的参考。其中这里的云指的是如阿里云，AWS的公有云。

为什么要上云

上云的好处或价值在各大文章里面已经说得非常多了。

我这里也敷衍的列举几个好处和价值吧。

降低成本
可伸缩性
专业的运维
速度

降低成本

降低成本主要是降低了两类成本。

公司不再需要招聘专业的运维人员专门负责维护服务器。（大型公司除外，他们通常需要大量的运维人员统一维护全公司的云资产或者自建云计算）
公司不再需要专门的人员来针对运维开发各类工具，如今常见的云计算平台都包含丰富的功能，以及完善的售后体系。

可伸缩性

可伸缩是传统的服务器无法做到的，这也正是如今云计算越来越火的一个很大的原因。

我们可以根据业务的需要，扩展服务性能，不仅如此，而且还能做到缩小服务性能，以节约成本。

专业的运维

上云之后不是没有运维了，而是把运维交给了云计算平台的专业的人来做了。而你只需要关心如何基于这些云基础设施构建自己的产品了。

速度

速度是指各方面的速度都提升了。比如，你只需要花1分钟时间就能创建一台新的服务器，你只需要花1分钟时间就能扩容某个服务。由于减少了各种搭建配置时间，开发时间因此也缩短了。缩短了试验和测试的时间，更快的为客户提供可用性。

云基础设施架构成熟度评估

那么上云之后我们如何知道我们的云基础设施架构是足够优秀的呢？

这里就需要有一套云基础设施架构成熟度评估模型。

我根据这些年的架构咨询工作，结合多个项目总结了这套云基础设施架构成熟度评估模型。它

主要分为8个模型，5种等级。

这8个模型是：

可伸缩性 - 云基础设施能根据业务需要自由伸缩
可复制性 - 云基础设施能根据业务需要快速复制
可恢复性 - 云基础设施挂了之后能自动或快速恢复
可用性 - 云基础设施的设计能保证服务的高可用性
安全性 - 云基础设施的设计能有非常高的安全设计
可量化管理 - 云基础设施应该可以被量化管理以优化成本
可维护性 - 云基础设施应该具有更简单的可维护性
可组合性 - **云基础设施会根据业务需要组合使用

这5种等级是：

原始级 - 完全没有使用云基础设施
基础级 - 尝试了一些基本的云基础设施
标准级 - 所有基础设施都上云了
成熟级 - 所有基础设施都上云了并且掌握云基础设施架构的最佳实践
领先级 - 自建云计算

结合上面的模型，我们就可以得出如下的打分。

基于这个打分，我们就可以得到如下的评估图。

那么接下来，我们就把这个架构设计展开来说一说。

主要说一说VPC设计，访问控制设计，安全设计和数据库设计。

VPC设计

VPC全称是Virtual Private Cloud，是云上的一个逻辑隔离的专有网络。

使用VPC主要是为了安全隔离，把不同的环境隔离开来。一是避免环境污染，二是保障安全性。

因此，如下图所示，通常一个企业都会设计以下几个VPC环境：

产品环境
测试环境
开发环境
UAT环境

产品环境是我们线上所有产品运行的VPC环境，只有用户能接触到的一个环境。

测试环境是做测试用的，也是大部分公司开发人员和测试人员能接触的一个环境。

开发环境则是日常工作所在的环境，它通常与办公网络是连通的，这里会放我们的git，pipeline，镜像仓库，制品库等。

UAT环境通常是给客户做验证的，比如上线前，需要让客户去验证是否符合预期了，这个环境之所以不能使用测试环境是因为通常客户需要导入一些真实数据做测试，需要保证UAT环境的干净。

另外，如果有多地域部署系统的要求，就需要使用多个VPC，因为VPC是地域级别的资源，是不能跨地域的。

访问控制设计

我们的云资源不是对所有人开放的，特别是对于产品环境的访问控制应该尤其严格，一是防止内部人的误操作，二是防止黑客的入侵。

但我在很多客户那里都遇到过，他们没有任何访问控制设计，所有的开发人员都共用一个或几个账号。这是非常危险的使用方式，不利于管理，也有诸多风险。

现在的公用云都有访问控制功能，通常叫做Resource Access Management。

访问控制的设计主要从下面几个纬度去考虑：

用户管理
1. 用户分为：真实用户，虚拟用户
2. 真实用户就是那些真实的人。比如员工和用户。
3. 虚拟用户就是分配给某个系统使用的账号。比如某系统需要有上传图片的权限。
读写分离
1. 通常有的人只应该拥有只读权限。
2. 有的提供给系统的账号应该只有只读权限。
3. 比如，访问对象存储的用户头像的账号应该只有只读权限。
4. 管理员或者上传图片功能需要拥有写入权限。
角色管理
1. 不同的人可能都是同一个角色。
2. 同一个人可能拥有不同角色。
3. 角色决定了我们拥有哪些权限集。

其次就是，云基础设施的访问控制是否需要和企业内部自己的单点登录集成，也是需要考虑的设计。

总结一下就是，访问控制应该遵循最小权限原则，才能最大限度的保证系统的安全性。

安全设计

大部分人的误区是，我已经用云了，再买个防火墙什么的，就很安全了。

但其实公有云是完全暴露在互联网的，因此也需要有完善的安全设计才能保证云基础设施的安全。

把该隐藏的隐藏进私网里面，只暴露最少的信息。

基础设施的安全设计主要包含几个方面：

网络安全
1. 包括传输安全，比如数据如何加密传输
2. 网络是否暴露
3. 网络设计是否合理
数据安全
1. 数据是否被足够的保护了
2. 数据是否暴露在了外面
权限安全
1. 是否按照最小权限设计的
2. 是否读写权限分离了

总结一下，设计的时候需要遵循两个原则：

零信任网络
最小权限设计

数据库设计

这里主要是涉及到数据库的高可用和高性能的设计问题。

高可用方案通常有3种方向：

主备架构
1. 通常会有多节点，不同点节点会在不同的可用区
容灾
1. 容灾主要分为异地容灾和同城容灾
备份恢复
1. 数据库挂了之后如何快速自动恢复
2. 恢复期间的数据丢失如何找回

这里的高性能设计不包括分库分表相关的设计，因为这只是关于基础设施的架构设计。

通常需要考虑：

如何弹性扩容？
是否需要读写分离？
1. 读写分离后数据的一致性如何保证？
2. 根据业务场景，需要多少读实例？
缓存如何设计

下面是一个大概的高性能高可用数据库架构的样子，供大家参考。

云基础设施架构设计

总结起来，一个常见的基于微服务的云基础架构设计，大概就长下图的样子。

我们在设计的时候可能需要考虑的远不止下图的中的东西。

比如我们得考虑：

多个VPC如何通信
集群如何编排
数据库选型
日志收集用什么工具才能易于收集易于搜索
运维监控用什么工具才能全面监控并能智能警报
MQ是否要满足一些特殊场景
第三方服务是否有特殊要求
在这个架构下我们是否能动态横向和纵向扩容

总结

希望上面的一些分享能帮助大家在设计云基础设施架构的时候提供一些参考。

这样的架构设计涉及的东西是非常广的，不同的项目也会有不同的设计要求。

因此，最终设计一定要结合项目的实际情况，满足了业务需要就是好的设计。

记住一句话，没有正确的设计，只有刚好适用的设计。

从价值流图分析研发效能

2021-05-10T18:51:01+08:00

什么是价值流

传统的工作中，不同的职位都只关注自己所交付的东西，比如产品经理关注产品需求文档的交付，开发人员关注软件代码的交付，运维人员关注于软件产品的部署。

随着DevOps与敏捷的发展，它们越来越强调交付整体价值，而不是单一角色的交付内容。

因此，价值流的意义就体现出来了。价值流是DevOps的关键概念之一。

根据《DevOps精要》的解释，我们可以从创造价值以响应客户请求的角度，来考虑一下组织中的工作。完成请求所需要实施的相关行动，可以按顺序排列起来，这称为价值流。

什么是价值流图

价值流图就是可视化的价值流。

它大概长下图的样子。

如何画价值流图

画价值流图其实很简单：

识别处理请求的关键步骤
分析每个关键步骤所需的3个度量数据
1. 前置时间 Lead Time，LT
2. 处理时间 Process Time，PT
3. 完成度与准确度百分比 the Percent Complete and Accurate，%C/A
将这些步骤组织为一个创造预期结果的活动序列

价值流图画好之后，最有价值的信息是流中每个步骤的3个度量数据，即前置时间(Lead Time，LT)、处理时间(Process Time，PT)及完整度与准确度百分比(the Percent Complete and Accurate，%C/A)。

前置时间

前置时间是供应链管理中的一个术语，是指从采购方开始下单订购到供应商交货所间隔的时间。

对应到我们的软件开发中，则是指一个任务从创建到完成的时间。如下图所示。

处理时间

处理时间指的是一个任务从开始做到完成所需的时间。如下图所示。

完整度与准确度百分比

一个任务完成了并不意味着它是准确的。

举个很简单的例子，我们读书的时候写作业，通常我们都能100%的完成，但并不能百分百的正确。

这在软件开发中也非常常见，一个需求被实现了，但和最初的描述并不一致。

通过完整度与准确度百分比（%C/A）可以分析项目的返工成本。

画价值流图的几个tips：

不要过度细化关键步骤，大致可以按照看板上的列一一对应或略多。
建议关键步骤数量不要超过15个。
可以按有堆积或由于等待而产生延迟来画每个步骤。
实践中，计算这些度量的数值是一个很大的挑战。一个比较可行的做法是取每个迭代的几张卡，然后计算PT的平均值。LT则看它从上一步挪动到下一步等待了多少天，也取平均值。%C/A则难免会拍脑袋得到数值了，这不可避免。
某个关键会议也可以是步骤之一。比如发布评审会。

真实案例分析价值流图

下面这张价值流图是来自我一个真实的案例。

对于这个图上的数据我做一些说明：

不是所有公司的关键流程都长这个样子，公司不同项目不同，画出来的价值流图也不同。
设计审核的%C/A只有60%是因为当时这个项目的产品经理设计原型的时候并没有频繁和业务方沟通，导致设计的东西审核的时候经常返工。
软件开发的LT有12天那么长，是因为很多卡ready了之后，需要等待排进某个迭代，所以平均等待时间有7天。这个数字在很多公司可能更长。
发布申请和发布上线的LT都挺长的，也是因为申请需要等待领导审批，发布需要排期，所以等待时间也挺长。
用户验收测试的%C/A只有70%也是因为开发过程中并没有频繁获取业务方的反馈，导致用户验收测试的时候发现挺多不符合预期的情况。

从上图的数据中我们可以得到一些关键信息：

PT/LT只有39.2%，这就意味着中间有大量的等待时间。那我们就可以具体分析每个等待时间该如何优化。
构建上传和测试环境部署这两步都没有等待时间，而且%C/A是100%。因为这个案例中它们都是利用pipeline自动化完成的。因此，自动化能帮助我们提高%C/A和缩短等待时间。
平均%C/A是88.75%，意味着有可以改进的空间。那我们就可以具体分析每个步骤中为什么达不到100%。
经过分析我们发现，很多%C/A不能达到100%的原因是，这个步骤的人并没有频繁的向前一个步骤获取反馈来验证自己是否做对了。
经过分析我们发现，很多LT长是因为，有一些审批流程必须等到领导审核才能往下继续走，而领导往往不能及时审批。还有一些LT长是因为没有可视化看板，不同步骤的人并不知道工作已经ready了。

在上面的例子中，花费在创造预期成果上的工作时间比例，仅占总开销时间的39.2%。这样的情况在常规IT部门中，类似的占比数字相当普遍，甚至更低。

上面的例子根据价值流图最终分析产出的报告有很多，这里就不详细展开说了。每个数字都可以研究背后的原因和找到改进的方案。

有了价值流图之后，通常我们可以提出来这3个问题：

为什么这些工作步骤的%C/A值低于100%？我们如何才能够完全杜绝错误从一个步骤
被传递到下一个步骤（并因返工而浪费时间和资源）？
除了开发产品的时间，具体有什么因素导致了lead time？我们如何能够大幅降低队列和等
待所损耗的时间？

3、我们如何改变工作实践，来降低每个步骤的处理的时长？

值流图的好处

价值流图的好处在于让参与的团队成员对整个流程有可视化的数据化的认知。并清晰的知道该从哪个步骤入手开始改进。
其次，过程的可视化呈现，有助于聚焦到被创造的价值上，而不是被实施的动作上。员工们和经理们常常能很好地理解他/她们的日常任务（做什么），而忽视了预期成果（为什么）。
再次，价值流图有助于识别和消除瓶颈，并避免局部优化的陷阱：即把时间和精力花费在根本没有效果甚至带来负面效果的约束消除上。
最后，对价值流的了解，有助于实现DevOps的关键思想：构建一个顺畅、一致经各个步骤的价值流，使得我们能够持续地、有节奏地、没有非必要的延迟、并以最优的资源使用方式来交付成果。

基于Eliyahu Goldratt提出的约束理论，任何系统中，在任何一个时间点上，有且仅有一个真正的瓶颈，这个瓶颈拖慢了工作，同时，花费在除了消除这个瓶颈点之外的任何事情上的精力，都可以说是浪费。

总结

要画出完整的价值流图，一定要去研究项目中真实的实践是什么样子的，不能凭空想象，也不要去指望某些记录的文档信息，因为他们常常没人维护。

价值流图是帮助我们更好的构建DevOps的一种方式，要想做好DevOps只凭这一点是远远不够的。

如果大家对相关话题感兴趣，可以给我留言，我们一起讨论更多可能性。

参考：《DevOps精要》

DevOps成熟度评估模型

2021-04-21T13:15:35+08:00

什么是DevOps

随着敏捷软件方法的广泛采用，以及IT基础设施即程序代码的管理方式的推广，DevOps也应运而生了。

DevOps 是通过人、流程和技术的有机整合，以协作、自动化、精益、度量和共享文化为指引，旨在建立一种可以快速交付价值并且具有持续改进能力的现代化 IT 组织。

什么是DevOps成熟度评估

随着技术的发展，越来越多的公司期望各种有用的方法论能够标准化，可量化。这样可以帮助决策者快速的知道我目前的水平，以及我未来发展的目标。

因此，随着DevOps被越来越多的推广，决策者们也期望知道自己公司或者团队的DevOps被量化之后长什么样子。于是DevOps成熟度评估模型便诞生了。

DevOps成熟度模型

在这些年的咨询生涯中，见过很多公司的成熟度模型。

这里给大家介绍几种吧。

常见DevOps成熟度模型

首先是信通院的，信通院把DevOps分成了3个模块，每个模块下面对应了一些纬度。如下：

敏捷开发管理
- 价值交付管理
- 敏捷过程管理
- 敏捷组织模式
持续交付
- 配置管理
- 构建与持续集成
- 测试管理
- 部署与发布管理
- 环境管理
- 数据管理
- 度量与反馈
技术运维
- 监控管理
- 事件与变更管理
- 运营配置管理
- 容量与成本管理
- 高可用管理
- 连续性管理
- 用户体验管理

再来看看某技术咨询公司的，他们把DevOps成熟度分为了6个纬度来进行评估。如下：

组织职能与能力
轻量级变更流程
自动化环境管理
持续部署与发布
运维监控与度量
架构解耦

同样是某咨询公司的，他们的DevOps成熟度模型的纬度又不一样了，他们把成熟度模型分为了8个纬度。如下：

发布组织与方法论
精益发布治理与过程
自动化软件发布
持续集成
持续部署
自动化运维
基础设施与云计算
平台与应用架构

另外一家科技公司，他们的DevOps成熟度模型的纬度也不一样，他们的分法如下：

持续集成
持续部署
轻量级变更流程
自动化环境管理
质量保证
运维监控与度量
可视化与可追溯

我总结的DevOps成熟度模型

凭借这些年的DevOps咨询经验，我总结了一套我认为更易用的能符合大部分公司情况的DevOps成熟度模型。结合了各家成熟度模型，做了一些调整和优化，以适用于大部分团队的DevOps成熟度评估。

他们也是8个纬度，如下：

组织与文化
敏捷开发
CI/CD
质量与安全
可视化与自动化
版本与配置管理
运维监控与预警
持续度量与改进

组织与文化

DevOps不是一个软件产品，DevOps也不是一个工程师。DevOps需要文化与组织的变化，不仅是开发与运维之间的隔阂需要消失，IT与业务之间的隔阂也需要消失。

由于DevOps和敏捷一样，离不开组织的变革和支持，同时DevOps也是一种文化，因此综合了一下，把组织能支持DevOps的程度，与现阶段文化与DevOps的匹配程度，作为了这个纬度的关键。

敏捷开发

为什么要有这个纬度可能有些人会比较疑惑。因为Oleg Skrynnik在《DevOps精要》里面提到DevOps发展的其中一个前提是敏捷开发被广泛的采用。因此，敏捷做得好不好直接影响到DevOps做得好不好。他们是相辅相成的。

CI/CD

CI/CD即代表的是持续集成和持续部署。也可以理解为我们俗称的pipeline流水线。但它指代的不仅仅是工具，更是一种方法论。相对于集成与部署，更重要的是持续两个字。CI/CD占据了我们开发过程中的大部分阶段，从代码提交那一刻开始，到代码运行在生产环境，都是由CI/CD促成的。

质量与安全

这个纬度很多公司没有把它单独提出来，但我认为它是非常重要的。很多团队随着时间的推移，通常都会累积越来越多的技术债，最终也无法偿还。在兼顾交付的同时，质量与安全一定是我们长线能看到收益的纬度。因为质量与安全问题带来的隐形成本浪费，是很多团队和公司会忽略的。

可视化与自动化

IT工作内容很多是不可见的，因此可视化成了DevOps的重要指标。可视化的好处在于可以构建拉式系统，有助于识别低效环节，并且改善对剩余工作以及当前状态的了解。

很多团队以为有了流水线就是DevOps了，却不曾想依然有很多人工的工作。DevOps就是要规避人为的风险。因为人是会犯错的，而机器则不会。因此，自动化是非常重要的DevOps成熟度的考量纬度。

版本与配置管理

全面的版本控制能帮助团队在开发过程中获得收益，这些版本控制包括但不限于测试、脚本、环境、包、类库、文档、配置等。团队成员可以无风险的删除不需要的文件。

配置管理也是同样的原理和收益，配置与环境管理使得我们所有的变更都是受控的，系统可以被快速地重置到稳定状态。如果关键成员离开，知识也不会遗失。

运维监控与预警

过去常常由一个独立的运维部门来负责所以线上运维的事情，这种情况在DevOps这里需要发生极大的变化了。运维的事情和开发合并到一起了，由一个团队共同负责了。这也是DevOps所强调的职责共担。同样对于运维的监控和预警也应该是对整个团队可见的。

持续度量与改进

DevOps已经越来越多的和效能关联上了。因此也出现了各种关于DevOps或者效能的度量。这些度量不是为了考核KPI，而是帮助团队持续改进的一种手段。DevOps提倡更频繁的直面问题，度量则是一种很好的方式帮助我们发现问题，并持续改进。

小结

可见，行业里没有一个统一的DevOps成熟度模型，各家都是按照各家的方法论在总结成熟度模型。

他们没有好坏，他们各有各的优点和侧重点。在不同的场景和不同的公司现状下，选择不同的成熟度模型能帮助我们更好的评估。

DevOps成熟度评级

关于级别的定义，行业里面普遍有两种，一种是4个级别，一种是5个级别。

我个人认为5个级别更合理，因为第1级其实就是零，并未尝试DevOps，第5级就是天花板，是以谷歌、微软、亚马逊等公司的领先DevOps团队为代表的天花板。

因此综合一下，级别定义如下：

Regressive初始级 - 几乎没有尝试任何DevOps实践
Basic基础级 - 做了一些DevOps实践，正在起步阶段
Standard成熟级 - 能成熟运用各种DevOps实践
Optimized优化级 - 不仅能运用各种DevOps实践，还能根据团队和组织情况进行优化改进
Leading领先级 - 是行业里面DevOps的先行者、创新者、探索者、领导者

因此，把这个成熟度模型做成一张可量化的表则如下。

简单解释一下，表里的描述并不是全部，只是包含了一些关键例子和方向，使用者可以根据自己的情况调整和添加。其次，由于这些纬度很难量化，因此如果只是符合了部分描述，我们也认为没有做到。这样的好处是，我们不会因为评估为成熟级，而放弃了那些成熟级本应该达到而没有达到的描述。

举个例子，质量与安全纬度中，基础级做到了部分，同时成熟级也做到了部分，我们也认定只达到了基础级。

通过对这张表的打分，最后我们就能得到了一个例如下图的成熟度评估图：

总结

DevOps成熟度评估模型并不是指导你把DevOps做得更好的方法论，它只是用来评估目前的现状，以指导你未来还有哪些改进空间。

我做咨询的经历中，发现很多公司会把DevOps当作项目来做，要求团队在N个月内完成DevOps搭建和应用。但DevOps不应该以项目的方式进行，因为项目的方式意味着公司期望在有限的时间以及预算内获得特定的结果，然而DevOps其实是一场没有终点的马拉松比赛。

如果想知道如何把DevOps做得更好，推荐去看《DevOps精要》那本书，里面讲了DevOps的一些原则和关键实践，掌握这些东西才能持续把DevOps做好。

下回我也整理一篇文章来讲讲如何把DevOps持续做好。

DevOps效能度量

2021-04-14T12:28:49+08:00

前言

之前做了几个公司的DevOps转型，发现不少公司都比较热衷于如何去度量DevOps效能。（一部分原因是领导们想要以此来考核KPI。）

度量的方式有很多种，这里我就基于这些年DevOps咨询的经验，总结了一些可度量的指标供大家参考。

大家可以根据不同的指标组合使用，并结合自己的团队的实际情况，量力而为。有些指标需要自己根据情况开发脚本来统计。

其实目前市面上的大部分DevOps工具或平台，都或多或少的有了各种数据报表，来帮助团队进行DevOps效能度量。

DevOps效能度量指标

效能等级

效能主要分为4个等级：

精英效能
高效能
中等效能
低效能

精英效能则是以谷歌、微软、亚马逊等公司的DevOps先驱团队为代表的团队水准。

高效能是目前大部分DevOps实践做得好的团队水准。

中等效能则代表了大部分正在DevOps探索路上的团队水准。

低效能则是大部分还没开始使用DevOps的团队水准。

度量指标

度量指标有两种：

结果指标
过程指标

过程指标远多于结果指标，过程指标是帮助我们改进敏捷开发过程的。结果指标多用于做总结汇报。

阶段分类

DevOps是一个非常长的价值流，那么在这个过程中，我们大致可以把它分为三个阶段来进行效能度量。

敏捷开发管理
持续交付
技术运维

第一个阶段主要对应了需求和管理部分。

第二个阶段主要针对整个研发过程。

第三个阶段主要针对上线后的运维部分。

所有的指标又可以分为交付效率和质量两类。

因此把所有指标汇总一下，就如下图一样。

DevOps效能度量指标详解

用户故事交付周期

- 用户故事从创建到上线所需要的时间。
- 计算方式：一个用户故事从创建到上线所需的时间。

用户故事吞吐量

- 一个迭代内能完成的用户故事总数。
- 计算方式：迭代内完成用户故事数。

用户故事完成率

- 一个迭代内，规划的用户故事数与完成的用户故事数的比率。
- 计算方式：实际完成用户故事数/规划用户故事数。

团队速率

- 一个团队每个迭代能完成故事点的数量。
- 计算方式：每个迭代完成故事点数。

故事点完成率

- 一个迭代内，规划的故事点数与完成的故事点数的比率。
- 计算方式：实际完成的故事点/规划的故事点。

需求停留时长

- 一个需求从创建到分析完成后，并进入开发所花的时间。
- 计算方式：需求挪动到“开发中”列的时间点 - 需求出现在“待处理”列的时间点。
- 备注：需求分析完成后在等待开发中也算等待成本。这里其实度量的就是Processing time。

研发停留时长

- 一个需求开发完成所需要的时间。
- 计算方式：需求挪动到“测试中”列的时间点 - 需求出现在“开发中”列的时间点。
- 备注：需求开发完成后在等待测试中也算等待成本。这里其实度量的就是Processing time。

测试停留时长

- 一个需求测试完成所需要的时间。
- 计算方式：需求挪动到“待上线”列的时间点 - 需求出现在“测试中”列的时间点。
- 备注：这里其实度量的就是Processing time。

部署频率

- 代码部署到服务器的频率，不论是测试环境还是生产环境。
- 计算方式：间隔多久部署一次

发布频率

- 代码发布到生产环境的频率。
- 计算方式：间隔多久发布一次。
- 备注：发布频率是小于部署频率的。

发布时长

- 一次发布上线的过程所需的时间。
- 计算方式：一次发布所需时间。

发布失败率

- 发布上线有可能失败，此指标用于统计失败率。
- 计算方式：发布失败次数/发布总次数

变更前置时间

- 从代码提交到代码正式运行在生产环境所需要的时间。
- 计算方式：代码上线时间 - 代码提交时间。

构建频率

- 构建发生的频率
- 计算方式：一个迭代内构建的次数

构建失败率

- CI在构建的时候会因为各种原因失败
- 计算方式：迭代内构建失败次数/迭代内构建总次数

CI修复时长

- CI红了之后需要花时间去修复，此指标统计修复CI所需的时间。
- 计算方式：CI重新变绿的时间 - CI变红的时间

代码坏味道数

- 很多代码扫描工具都能统计代码坏味道，比如Sonar。
- 计算方式：单次构建扫描的坏味道数，取多次的平均数。

代码重复率

- 重复代码在代码库中的占比。Sonar等工具能统计结果。
- 计算方式：重复代码数量 / 总代码数。

代码扫描bug数

- Sonar等工具能扫描出代码中存在的bug数。
- 计算方式：迭代内代码扫描bug出现的次数。

代码扫描漏洞数

- Sonar等工具能扫描出代码中存在的漏洞数。
- 计算方式：迭代内代码扫描漏洞出现的次数。

代码提交频率

- 统计每日代码提交的次数。
- 计算方式：每日代码commit或push次数。

代码合并次数

- 统计周期时间内代码合并的次数。
- 计算方式：迭代内代码merge request次数。

代码评审通过率

- 每次代码评审都有可能因为代码质量原因不通过。
- 计算方式：迭代内代码评审通过次数/迭代内代码评审总次数

自动化测试率

- 测试工作有多少是自动化完成的，而不依赖于人工测试。
- 计算方式：（自动执行测试用例数 - 人工执行测试用例数）/ 总测试用例数
- 备注：测试分为很多类型：单元测试、集成测试、验收测试、性能测试、安全测试等，可以分别度量。

自动化测试失败率

- CI跑的测试是会因为各种原因失败的，这个指标用于统计失败的频率。
- 计算方式：CI测试失败次数/CI测试执行总次数

测试覆盖率

- 测试所覆盖的代码的比率
- 计算方式：大部分测试工具都能自动统计测试覆盖率，比如Jacoco

缺陷修复时长

- 一个缺陷修复所需的时间。
- 计算方式：一个缺陷修复所需的时间。

缺陷密度

- 统计一个迭代内缺陷出现的密度。
- 计算方式：迭代内缺陷个数/迭代内用户故事数

服务恢复时间

- 一次服务故障恢复所需的平均时间。
- 计算方式：服务恢复时间 - 服务故障发生时间

生产问题个数

- 统计周期时间内生产问题的个数，多次统计后以计算平均数。
- 计算方式：用户故事上线后2个迭代内生产问题的个数。

生产问题密度

- 统计周期时间内生产问题产生的密度。
- 计算方式：用户故事上线后2个迭代内发生的问题个数/2个迭代的用户故事数

每个指标对于具体的等级的数值请参考下表。

总结

DevOps是敏捷开发的一种演进，通过结合人、流程与工具，持续改进研发效能。

因此度量只是一种可视化手段，真正能提高效能的还是要从文化、组织、技术等方面去建设DevOps。不断消除浪费，提高生产效率。

对于此DevOps效能度量有任何疑问欢迎找我讨论。我也会在后续的咨询工作中不断去总结和改善这个效能度量指标。

谈一谈线上事故的故事

2021-04-11T13:44:53+08:00

背景

做过不少公司的咨询，发现有些公司对于线上事故没有规范化的认知，没有预防措施，发生之后也没有一个规范的流程去响应。甚至有的公司发生线上事故之后，没有监控没有预警，只有开发或运维知道，然后他们就悄悄的把线上事故修复了，神不知鬼不觉。

那么接下来我就从下面这几个方面来谈一谈线上事故的故事。

线上事故预防

线上事故预防分为3个方面

预发环境
上线checklist
日志与运维监控

预发环境

预发环境也叫UAT环境，或者预生产环境等。

非生产环境中应该有一个预发环境，此环境理论上应该和生产环境一模一样，包括一样的配置来保证性能一致，一样的数据来保证行为一致。

这样才能最大限度的模拟生产环境，并提前发现问题解决问题，起到沙盘演练的作用。

预发环境的位置如下图：

预发环境搭建好了之后，应该定期的同步生产环境的数据到预发环境做测试。

注意：同步数据时得有一定的脱敏策略，不然会有安全风险。

影子流量

由于预发环境毕竟不是生产环境，所有的请求都是由测试人员模拟的，并不是真实的情况。

而影子流量(shadow traffic)就是将发给生产环境的请求复制一份转发到类生产环境上去，以此来达到压力测试和正确性测试的目的。

影子流量的实现有多种方式，常见的比如在统一的入口处API Gateway等地方复制一份流量转发到预发环境，由此来监测预发环境有没有异常情况发生。

影子流量的过程如下图所示。

由于影子流量来自真实的生产环境，过程中一定要注意安全防范，以及流量转发给生产环境性能带来的性能影响。

影子库

同理影子流量，理论上预发环境应该有一份和生产环境一模一样的数据库，才能更真实的模拟生产环境的情况。影子库就是把生产环境的数据库复制了一份，有相同的数据量和相同的数据库配置。

当然，理论上，影子库的数据应该是经过脱敏处理之后的。

上线checklist

上线前应该有一份不断累计不断更新的checklist，用于上线前的检查，每项检查都通过之后，方能上线。

有任何一项检查不通过，则不能上线。直到问题解决之后，才能上线。另外，应该避免在业务高峰期进行上线操作。

下面是一些常见的checklist项，不同的项目会根据需要有不同的checklist：

测试报告是否已通过
业务验收报告是否已通过
是否有其他版本正在上线
是否有线上问题未解决
是否有回滚方案

虽然它是checklist，但很多公司在实践的时候都把它做成了上线报告。

更推荐的做法是把checklist做得越轻量级越好，就像checklist，轻量级的检查就能上线。

不然就会让上线变得越来越困难、越来越花时间，违背了持续交付的理念。

日志与运维监控

如果有非常完备的日志系统，比如常见的ELK，splunk等，则可以在预发环境的影子流量中和生产环境中，监测到异常日志，提前发现问题，提前解决问题。

日志监控是一个挺大的话题，这里就不展开讲了。

但日志是我们定位问题和追溯问题中不可或缺的一环。

线上事故修复

线上报警

线上环境应该有监控预警，一旦发生任何事故，都应该及时报警。

这些事故可能发生的情况包括但不限于：

服务不可用
数据库不可用
服务器不可用
高级别日志警告
流量异常

发生了事故，应该第一时间自动通知开发团队，并抄送相关负责人或领导。收到邮件或通知的开发团队则应该第一时间修复该线上问题。

邮件中应该包含事故的现象描述，原因，日志等信息，以方便开发团队分析和修复。

最重要的是，上述过程都应该是全自动化的，才能保证开发人员或者运维人员第一时间响应。

通常一个团队中会建立某种机制来保证发生线上事故的时候有人能及时响应事故。

我见过有的公司是运维人员24小时待命的。

也见过做的比较好的公司是，团队成员轮流做那个站岗的人，系统通知自动绑定站岗人的电话和邮件，发生线上事故之后，第一时间自动逐级通知站岗人，负责人，领导等。由站岗人作为线上事故的owner，他不一定能修，但他要保证问题被修复。如果事故发生在工作时间以外，则根据响应的时间来调休。

回滚机制

优秀的团队都是使用CI/CD的pipeline来进行上线部署，一旦失败了，pipeline都是支持回滚到上一个版本的。

如果是使用的Kubernetes，则只需要回滚到上一个版本的镜像则可以了。

这就要求，我们每次构建的包或者镜像都应该带上版本号，才方便我们追踪每个版本的状态，以及根据版本来进行回滚操作。

灾备恢复

线上事故如果是灾难性的，则应该启动灾备恢复程序。我了解的大部分公司其实都没有灾备恢复的相关流程。理论上，一个意识比较好的公司，通常都会有严格的灾备恢复流程。

灾备恢复可以是一个操作手册，指导员工如何一步一步的从0恢复服务。通常银行业是一定有这样的灾备恢复的。

同时，每次发生了大的基础设施架构变更的时候，比如服务器配置变化，数据库变化等，都需要及时更新灾备恢复手册。

线上事故复盘

线上事故修复之后，需要对此事故做一次复盘，目的是为了避免下次再发生，以及发生了之后能更快的应对。

这个复盘有很多种形式。其中一种形式叫无过错验尸报告。

具体的介绍可以看我的另外一篇文章：
无过错验尸报告 - Blameless Postmortem

总结

这里简单列举了一些关于线上事故我能想到的几个点，如果大家对于这个话题还有其他感兴趣的部分，欢迎给我留言，我们下回接着探讨。

无过错验尸报告 - Blameless Postmortem

2021-03-30T15:38:49+08:00

前言

在咨询的经历中，发现有些软件项目经常出现线上事故，出现了线上事故之后，第一时间会去修复这个问题，第二时间，则是问责。

这是一个很有意思的现象，通常在一些传统行业的团队或者政府背景的团队中，发生了线上事故，他们会启动问责程序，找到事故的负责人，并对他做出相应的处罚。

作为程序员，大家都知道，代码的世界不出错是不可能的。问责在很大程度上会导致团队成员不敢写代码，不敢上线，不敢触碰线上环境的一切东西，最终导致团队研发效率下降。

那正确的做法应该是什么呢？

这里就给大家介绍一下Blameless Postmortem，中文意思就是无过错验尸报告。

什么是无过错验尸报告？

无过错验尸报告是对线上事故的书面记录，用来描述:

这一线上事故的影响。
减轻或解决事故所采取的行动。
事故的根本原因。
为防止该事故再次发生而采取的后续行动。

无过错验尸报告这个名字是英文直译过来的，如果觉得这个名字过于血腥，可以叫它无过错反思报告，或者无过错事故报告，或者无过错事后分析报告。但更多的人都习惯亲切的叫它验尸报告。

之所以强调无过错，是因为这样的话人们就不会在写报告的时候由于害怕被问责，从而互相埋怨或者隐藏自己的过错。

为什么需要无过错验尸报告？

验尸报告的目标是了解所有导致事故的根本原因，记录事故的经过以供未来参考，并制定有效的预防措施以减少事故再次发生的可能性。

为了使验尸报告能够有效地减少重复事故，总结过程必须激励团队识别根本原因并修复它们。

同时，关注这个过程并确保它是有效的则需要组织中各级的承诺。比如不能出现对团队某个人的问责。

什么时候需要无过错尸检报告？

线上事故都会有严重程度或者影响程度分级，因此，通常我们只会对级别较高的事故写尸检报告。

我们通常会在下面两个时间点开始写尸检报告：

修复事故期间
修复事故之后

谁完成验尸报告？

事故产生的服务所属的交付团队共同负责完成验尸报告。

但需要选择一名owner来主要负责编写报告，并且这个owner需要保证下面两件事情的发生：

分配不同的人去完成各类的事故调研工作，最后把结果汇总给这个owner。
保证报告中的改进action按照紧急程度安排到后面相应的迭代中。

如何跟踪报告中的action？

这个问题其实是紧接上面的第二条。

报告中的action通常分为两类：

根本原因改进
非根本原因改进

对于报告中的每个action：

应该在对应的团队的backlog中建卡，并根据优先级安排进相应的迭代。
Owner要负责跟踪卡的完成情况。并记录到报告中。

验尸报告会议

验尸报告相关的会议有两种。

一种是在编写报告前，用于讨论事故的根因。
一种是在报告完成后，用于向团队分享报告内容，学习成长。

不管是哪种会议，都要记住，这个会议不是批斗会，不能在会议中指责任何人。

这里的指导原则和retro类似。

实践过程中，我发现大部分团队只会开第一个会议，在编写报告的过程中，大家基本上都学习了，并了解了报告中的根因。所以大部分团队都不会开第二个会议。

但是不少公司会开另外一种会议，就是报告写完了之后给领导的汇报会议，此会议根据不同公司的政策不同，可有可无。

报告模版

下图就是一个完整的报告模版。

报告可以是表格的形式，也可以是文档的形式。有了模版，写报告的人就可以照着模版往里面填内容了。

关于如何识别根因，Atlassian提供了一种叫5 Whys的分析方法，具体怎么做可以参考这里：

https://www.atlassian.com/tea...

总结

验尸报告是为了在软件开发过程中以及项目交付过程中能持续改进，有记录能存档，并且成为知识沉淀的一部分。

所以它的形式可以根据团队的实际情况来。我见过有团队用表格来写的，像上面模版那样，也有直接写卡上的，也有直接开会画在白板上然后拍下来的。

对于无过错验尸报告，大家在实践过程中有任何疑问，欢迎来找我讨论。

参考资料

https://www.atlassian.com/inc...
https://www.atlassian.com/inc...

敏捷迭代日历

2021-03-29T14:00:15+08:00

前言

我咨询过的项目中，发现很多团队想用敏捷，但是不知道如何做。那我们就从最基础的一个迭代日历开始吧。

经过多个项目的迭代，总结了一个适合于大部分团队的敏捷迭代日历。

不管是在敏捷的理论里面，还是scrum的理论里面，下面总结的活动都多少有一些差异。但敏捷不是说一定要严格按照标准的活动来进行，而是拥抱变化持续改进，找到最适合于自己团队的最佳实践，才是敏捷的核心。

敏捷迭代日历

图中所展示的是一个迭代内发生的所有活动。

通常一个迭代是2周。横向是两周的时间线，纵向是一天的时间线。

紫色的是当前迭代会进行的活动，蓝色的是需要在本迭代为下个迭代做准备的活动，橙色是总结上一个迭代的活动。

接下来我们就每个活动展开来讲解。

站会Standup

站会通常发生在一天的开始。之所以叫站会，是因为大家站着开，才能让会议尽快结束。站会长如图的样子。

站会的目的是什么？

促进和改善团队协作
让团队达成一致
- 理解共同的目标
- 互通不同成员的工作内容
消除障碍

站会如何进行呢？

开站会通常有两种模式。

一种是按人头更新，每个人轮流说下面三个问题：

昨天做了什么？
今天准备完成什么？
有没有任何问题或阻碍？

第二种模式是按卡更新，在看板上从右至左的更新每一张卡的状态，及时更新卡的状态，同时也要及时暴露该卡有没有任何问题或阻碍。

谁参与站会呢？

整个团队的人都应该参与。也是那些坐在一张桌子上工作的人。

站会需要多长时间呢？

通常不应该超过15分钟，这也是为什么要站着开的原因，这样才能让大家保持专注，尽快结束会议。

做好站会还有哪些tips呢？

站会要固定时间，不能因为某人缺席而推迟。不然会打乱团队的节奏。
不要在站会上引入新东西，只聚焦在当前迭代的范围中。新东西应该在其他时间或其他会议中提出来。
为了让站会尽快结束，可以准备一个小物件作为信物，拿到信物的人才能发言。大家轮流传递信物进行发言，这样可以避免大家你一言我一句的发散导致会议时间过长。
每次站会都应该有一个facilitator。作为站会的推动者负责保证站会高效有序的进行，并更新相关的状态。

迭代计划会议Iteration Planing Meeting

迭代计划会议的举行就标志着上个迭代结束了，下个迭代开始了。因此，我们会在这个会议上在看板中关闭上个迭代，然后开启下个迭代。

迭代计划会议大概长下图的样子，大家打开迭代看板和backlog，然后讨论下一个迭代要做什么。并设置好下一个迭代的看板。

目的

这个会议的目的就是让团队决定接下来这个迭代中，我们要做哪些工作（哪些故事卡）。

如何进行？

通常会考虑团队工作载量，比如一个团队每个迭代能完成30个点，那么就会计划30个点到下一个迭代中，同时会选择优先级高的卡优先完成。

然后团队一起决定下个迭代做哪些卡，这些卡是否都已经是ready的，如果不ready的卡不应该放到下个迭代中。

谁参与呢？

理论上这个会议需要Product Owner参加，他可以回答大家对于这个迭代的工作的问题。实践中发现国内大部分公司没有Product Owner，而类似的角色是提出需求的业务方。因此实践中只需要让团队所有成员参与就可以了。

而对于需求的澄清，会议前应该让产品经理或BA提前完成和业务方的需求澄清和优先级排序。

会议需要多少时间？

通常在一个小时以内。为了保证会议能按时结束，有些工作会放到会前或者其他会去做，比如估点、卡的需求澄清等。

迭代的开始不一定是周一。可以是团队觉得合适的任何一天，比如周三。

当前迭代的工作

迭代开始后，开发和测试人员会工作在当前迭代的卡片中。

如果遇到不清楚的细节，就可以找产品经理或BA澄清。需求澄清应该是一个随时随地发生的动作。

代码评审Code review/技术分享

有的团队会把代码评审和技术分享分开来做。大部分团队没有太多技术分享，因此可以放到一起。

代码评审理论上每天都会发生，而技术分享则看情况。做得好的团队，每周都会有技术分享。

代码评审的两种模式

团队成员各自去查看Merge request并留下评论。
团队成员集中时间一起看Merge request并留下评论。

第一种模式是为了解决团队很难有统一的时间来一起看merge request，则把代码评审的集体时间分散到团队成员各自的零散时间中去做。

第二种模式的好处是，团队可以一起分享业务和分享技术，通常在代码量比较小的时候，快速做完代码评审之后，团队可以商量由某个人分享一些大家不知道的业务或者技术，分享的内容一定要小巧，比如10分钟就分享完，如果是大分享则应该单独计划一个时间来做。

第二种模式的review时间应该控制在30分钟到60分钟以内。时间宽裕的团队，可以做1小时，加上技术分享等。时间不宽裕的团队，则建议在30分钟内结束。因此为了让代码评审快点结束，可以安排一个facilitator来防止大家发散，推进代码评审的进度。

第二种模式的代码评审通常会在每天下午的5点到6点之间进行，这样的好处在于可以回顾大家一天所写的代码。

代码评审的tips

每个Merge Request都应该有2个及以上的人通过了，才能合并。
小步提交。提交越小步，代码review起来越容易。
发现任何有问题的代码或者疑惑的地方，都应该直接问或在代码旁边留下评论。
代码评审不是批斗会，而是互相了解业务与技术，互相学习的机会。
代码评审不是只有leader才有权利去评审，任何团队成员都有权利去做。
代码评审时，应该先从与代码相关的业务说起，让其他人对于代码的业务背景有了解。
代码评审是互相学习的好机会，是拉齐团队编码能力的好机会。

代码评审的好处

知识共享
质量保证
技术氛围建设
快速反馈
编码能力提升

CI/CD

CI是continuous integration，也就是持续集成。

CD是continuous delivery，也就是持续交付。也会说是continuous deployment，也就是持续部署。

CI/CD是我们在开发过程中快速迭代的基本质量保障，它能让我们快速获取反馈。是必不可少的部分。

这个话题也挺大的，就不在这里展开了。

下次单独找个时间来分享这个话题。

迭代估算会

有的团队会把这个会议的内容放到迭代计划会IPM中。但在实践中我们发现，下一个迭代的准备工作可以单独划分一个迭代估算会议来做。

那么这个迭代估算会主要做以下两件事情：

澄清故事卡的需求，确保团队内部对它的理解一致。如果不一致，则需要产品经理或BA去找业务方澄清。
大家一起为故事卡估点。
1. 故事点（Story Point）通常代表这张卡的复杂度或者工作量，有的团队也用它来代表价值。
2. 点数通常有两种方式，一种是使用斐波拉契数列，一种是使用T-shirt size。第一种比较常见。
3. 由于故事卡估点也是一个比较大的话题，这里就不展开叙述了。

下个迭代的工作

下个迭代的工作主要分为下面三个方面。

产品经理或者BA会在当前迭代去准备下个迭代的故事卡，完善那些待分析的卡，补全需求细节，画出原型图等。不清楚的需求就找业务方澄清。
设计师对需求已经清楚的故事卡设计高保真的UI图。
可能会有一些新技术的调研工作，可以安排个别开发人员对下个迭代可能用到的技术做调研。

Retro回顾会

回顾会的英文是Retrospectives，俗称retro。

每个迭代结束之后都会进行Retro回顾会，目的是为了回顾过去展望未来。总结上个迭代中做得好的，可以继续保持，然后看看哪些地方可以改进。

安全检查

在有必要的团队，回顾会可以有一个安全检查，安全检查就是让大家投票觉得现在这个回顾会可以安全进行吗，如果大家的投票是NO，则把参会人员中职位最高的人请出去，再进行投票，直到大家都认为安全后，再进行回顾会。

实践中，大部分团队都不需要安全检查。如果需要，可以反思一下，为什么大家的团队安全感很低。

最高指导原则

回顾会不是批斗会，因此回顾会有一个最高指导原则，目的是为了创造一个安全的环境，在这个环境中，团队成员可以自由检查他们的流程和工具，而且不必担心别人的指责。

最高指导原则就是：

无论我们发现了什么，考虑到当时的已知情况、个人的技术水平和能力、可用的资源，以及手上的状况，我们理解并坚信：每个人对自己的工作都已全力以赴。

最高指导原则的好处在于为了最大化知识的产出而将思考的重点暂时从“人”移开。

回顾会经过这么多年的发展，已经有非常成熟丰富的模版来做了。下图就是一些常见的retro模版。

模版来自：https://metroretro.io/templates

回顾会的几个小tips

大家讨论出来的改进项可能会很多。大家可以投票选出优先级最高的前几个action来执行。
Action必须有owner，没有owner的action就意味着没有人会去做。
每次回顾会开始的时候，应该先回顾一下上次回顾的action是否都完成了。
回顾会不一定是迭代结束就马上做，这样会让当天同时有迭代计划会和回顾会，占用团队太多时间。实践中发现，如果IPM在周一，则retro可以在周三比较合适。

Showcase

每个迭代结束后，需要向PO或者业务方做showcase，展示工作成果，同时获取业务方反馈或批复，以持续改进。

Showcase通常需要团队中的开发、测试、产品经理/BA出席，同时需要业务方或者PO出席。

实践过程中，很多团队会通过showcase会来获得业务方的批复（Sign off），以取得上线许可。

上线

上线的时间不是固定的，每个团队会根据每个团队自己的实际情况来定。

咨询这么多年，我见过随时持续上线的，见过半夜三更上线的，也见过每月固定时间上线的。

通常敏捷是强调小步快跑的，因此，每个迭代做完的工作就应该及时上线。

这样的好处在于上线的内容越少，出现问题的影响越小，回滚成本越小。同时也能更快的获取用户反馈。

另外，上线时间安排在周一的原因在于，如果是周五上线，出了问题，大家周末都不好过了。

总结

我就不赘述瀑布模式和敏捷的区别了。就简单说说我们把需求或者工作划分成每个迭代来完成，有以下几个好处。

我们可以通过燃尽图等相关统计报表，分析团队的研发效率。
通过多个迭代的开发，我们就可以精确度量团队工作载量和速率，更好的利用团队资源。
按迭代回顾和改进，也符合敏捷小步快跑的理念。
工作范围和内容清晰明确。
可视化未来的工作。
更早的识别风险。
降低了因为变化带来的风险，让团队更快的响应变化。

最后，本文只是从敏捷迭代日历的角度谈了谈敏捷相关的实践，如果对敏捷其他话题感兴趣，欢迎给我留言，我们再继续讨论。

敏捷改造（下）：真实案例敏捷改造

2021-03-26T10:27:43+08:00

上篇分析了我做过的一个真实的项目的研发过程中的种种问题，那么这篇就来讲解一下我们如何针对这些问题做敏捷改造。

怎样用敏捷做改进

小瀑布模式的缺点在于它的沟通成本、等待成本、返工成本依然很高，因此我们可以考虑从这3个角度出发去做改进。

我画了一个图来展示小瀑布模式和敏捷开发的详细对比。

图中紫色管道是小瀑布模式，蓝色管道是敏捷开发，两个管道是相同的团队管道容量，换句话说，两种模式的工作载量是相等的。

横向是时间线，从左到右，按需求提出开始直到此需求上线，即为此需求的生命周期。

首先，先说一下为什么这3个成本很高。

沟通成本

产品经理通常需要1-2小时来与业务方进行需求沟通，沟通完了之后，产品经理会有一个初步方案，然后会与团队内部的技术人员，测试人员等，一起评审这个需求，如果这中间有任何疑问，产品经理就需要不停的反复在业务方与技术人员中间进行沟通。因此沟通成本是比较高的。

产品经理通常也需要1-2小时来与技术人员一起做技术评审，时间比较长，很多问题细节需要来回反复确认，沟通成本很高。

总结一下就是，由于需求粒度比较大，每个环节都比较重，有大量的细节需要讨论和确认，因此带来了较高的沟通成本。

等待成本

开发只有等产品文档完全设计好了，才能开始开发，由于需求粒度过大，此设计过程相对过长，因此开发的等待时间也长，没有充分利用开发资源。

同理，测试也是，只有等开发提测了才能做测试。但测试在此之前，会先写测试用例，因此测试资源浪费还算较小。

但另外一个问题是，测试一次性要写非常多的用例测试，一次性测那么多用例，测试的完整性完全依赖于测试人员的耐心。

返工成本

测试如果发现问题，会让开发返工，由于需求粒度比较大，经常出现测试发现多个问题的情况，那么就会来回的多次返工与测试。

甚至有时候返工会直接返到业务方去重新确认需求的细节。

敏捷研发过程改进

那么敏捷是如何改进这个过程的呢？

首先敏捷是提倡小步快跑，拥抱变化。目前由于需求粒度比较大，无法小步快跑，同时开发到中间的时候的，需求突然变化，应对起来也比较慢。

那我们就可以根据下图来进行改造。

上图中最大的变化就是把原来的需求A拆解成了3个story。

敏捷提倡小步快跑，那么管理需求也一样，只有需求足够小，才更利于我们快速理解和分析。而user story（用户故事）则是敏捷里面的一个可以工作的最小单位。

用户故事在软件开发过程中被作为描述需求的一种表达形式；为了规范用户故事的表达，便于沟通；包含角色、活动、价值三个要素。

瀑布式的需求管理和敏捷需求管理的区别在于：

瀑布式的需求分析要求在一开始就获取所有需求，分析所有细节，并且假设我们可以对软件项目有个完美的预测。
而用户故事则基于我们不能完美预测，不能在一开始就知道所有细节的基础。因为我们对需求的理解是一个逐渐清晰的过程。同时，在项目开始时尝试编写所有的需求忽略了重要的反馈循环。用户故事承认故事的时间维度，随着时间的推移以及功能的增加，会有新的用户故事产生，或者使故事的相关性发生变化。所以要延迟细节，融入业务到整个软件开发过程中，鼓励交流和沟通。
另外，做了用户故事拆分之后，产品经理或者BA需要补全细节，不停的做需求澄清，和业务方做sign off。
敏捷需求管理会借助JIRA等工具进行可视化的看板或者scrum管理。而不是基于传统的Excel管理。
每个故事写好之后，会让业务方做card sign off，比如在卡下面留言ready to go等。如果每张卡做sign off太频繁，可以由产品经理或BA单独找业务方用邮件等的形式针对一个epic统一做sign off。
敏捷里其实没有一个专门给业务方和产品经理/BA的需求澄清会，因为默认为已经发生在日常工作中了，按理说应该分析一张卡确认一张卡，才能尽可能减少因一张卡片理解不到位引起的大面积返工。

拆解成小的用户故事之后有如下一些好处：

原来产品经理和业务方的沟通成本随着需求被拆成小的用户故事而变小了。
由于用户故事比较小，分析完成的时间就变快了，产品设计的时间也变快了，那么开发开始的时间也就变快了，减少了等待成本。
由于开发时间更短，第一个用户故事测试时间也就提前了，因此如果出现问题需要返工，那么返工的时间比原来就更早，返工修改的内容也更少，能较快的完成返工并重新测试。整体返工成本就变小了。
由于各方时间都提前了，那么第一个用户故事上线的时间也提前了，业务方就能更早的看到需求的部分功能，就能更早的反馈问题。
由于每个环节的沟通成本，等待成本，返工成本均减少了，因此整个需求的交付时间也就提前了。

从下图就可以看出，每个环节相比之前都是提前的。敏捷的目的是能够让团队拥抱变化，快速响应。

敏捷开发改进

分支改进

原来的分支管理比较混乱，可能造成的问题已经在前面分析过了。

这里就说说分支管理的最佳实践是什么。

现在业界普遍都采用了git flow，具体怎么做可以Google一下，网上有太多文章讲这个，我就不赘述了。这里就展示一张git flow的全景图。
每次git的commit message的推荐格式：Card ID: message
1. Type主要有：feature, refactor, fix等。
2. 每次git的commit推荐能关联到每个故事卡的卡号，这样方便追溯每个故事卡相关的改动。现在很多工具都支持通过message的卡号直接找到对应的故事卡，反之亦然。

CI/CD

从代码的生命周期开始，CI/CD是保证每个环节快速流转的基础，同时也是快速获取反馈的途径。
而CI的基础则是自动化测试，比如最基本的单元测试。
每完成一张故事卡，理论上都可以持续部署到生产环境。而不应该等待所有需求都完成了，或者等前后端都完成了，再做上线部署。
- 部署的步子越小，回滚的成本也就越低。

总结

图里的敏捷开发一定比上面的小瀑布快吗？不一定，这里还有几个因素是需要考虑的。

需求A拆解成story是有成本的。根据产品经理或BA的能力不同，以及需求复杂度的不同，拆卡花的时间也不同。
小瀑布模式里面，在没有bug的情况下只会测试一次。在敏捷模式下，相比原来的小瀑布，会针对story1和story2做回归测试。因此增加了测试时间。
另外，决定敏捷开发能否运行很好的因素还有很多，只有不断探寻最佳实践，持续改进，才能无限逼近我们期望的状态。

总结起来，敏捷是通过小步快跑的方式，提升了响应变化的速度，以达到提升整体交付速度与质量的目的。

本文只是通过一个真实的客户案例，来分析如何基于当前现状做敏捷改造，本文并没有写完全部敏捷改造的内容，因为敏捷包含的内容实在太多了，如果大家感兴趣可以给我留言，后面我会继续分享这个案例中涉及到的其他敏捷改造。

敏捷改造（上）：真实案例研发过程分析

2021-03-23T20:19:20+08:00

背景

最近我去一家科技公司做敏捷咨询，通过梳理该公司的研发过程，发现了该公司的研发过程中许多可以改进的地方，于是我便记录下来，与大家分享学习。

本文会剖析该公司的研发过程，把每个环节详细分析一遍，以找出研发过程中的问题和可以改进的地方。然后再讲解如何做敏捷改造。

研发过程分析

全景图

下图是该公司的研发全景图，从时间线来看，上面一条时间线可以看出整个需求流转的生命周期，下面一条时间线可以看出整个代码流转的生命周期。

下面我们就把每个环节拆开来仔细分析一下。

需求管理

现状

现在是通过共享Excel来管理所有的需求，业务方在表格里面填写想要的需求，包括新需求或bug等，并为每个需求生成一个序号方便追踪。

大概长下图的样子。这是非常典型的传统需求管理方式。

问题

大颗粒的需求可以这样管理，但是不能所有阶段都这样管理，会造成需求粒度太大，细节太多，边界太模糊。
如果不做story拆分，这样的需求离能开发还有很多空间，需要做拆分、细化、转化，最后才能开发。
这样的需求表格缺乏很多细节，比如UI长什么样子，某个业务逻辑有多少条分支等。
这样的表格无法知道业务方和研发方对需求的理解是否一致，很容易出现返工。
此类表格管理需求，不便于业务方追踪需求进度和状态，以及可视化需求的转化过程。

需求评审

现状

产品经理会和业务方一起开会，针对表格里面的某个需求，来确定这个需求的细节，以及怎么做。确保双方的理解是一致的。

问题

需求评审会的时候没有记录过程中确认的结论，导致会后大家又忘记当时的结论是什么。
由于需求粒度过大，很多细节无法详尽的确认清楚，容易导致返工。
由于需求粒度过大，需要比较长的时间来完成需求评审，通常会花2小时以上。
没有sign off，无法判定需求是否通过了业务方的认可。
需求澄清是一个随时随地的动作，但该公司缺乏能随时做需求澄清的氛围或文化。

产品设计

现状

拿到需求后，产品经理会根据需求以及和业务方的沟通，达成一致后开始设计产品文档，把需求涉及到的原型图，业务逻辑等全部画到产品文档上，以提供给开发人员进行开发。

问题

需求通常都很大，产品经理很少把需求拆分成story，也很少在JIRA等工具上拆卡建卡来管理所有的需求。导致产品设计周期很长，细节很多，无法一次性考虑全面。
产品经理设计产品文档的时候，通常是自己设计，设计好了再给业务方或者开发看。没有频繁反馈和需求澄清，导致需求可能被脑补，并不是业务方想要的。
产品文档目前是用版本管理工具来管理的，比如git，不便与查找和归档。
需求、产品文档、代码没有关联关系，不方便后期查找某个需求相关的产品文档和代码。

技术评审

现状

目前，产品经理设计完成后，会拉上开发和业务一起进行技术评审，确保设计的产品文档三方能达成一致。

问题

由于需求太大，评审时间太长，通常超过2小时，久而久之大家会越来越反感这样的评审会议，并且会议后期大家的注意力也不集中了。
细节太多，容易忽略某些细节，导致最后开发依然有不确定的开发细节，并且开发的结果和业务方的期望不匹配。

开发

现状

拿到产品文档之后，后端会根据文档中的业务逻辑，开发完成服务端的功能，前端会根据文档中的原型图或者高保真UI设计图，开发完成客户端的功能。

再来说说该公司的分支管理模式。

他们把分支分为了：线上分支（master），测试分支（stable），开发分支（dev）等。

保证不同的分支做不同的事情，防止分支污染。

线上分支（master）：是预上线环境和线上环境的分支，以这个分支为准，其他分支都是以这个分支为基础拉取。
测试分支（stable）：测试环境分支，是给测试团队测试使用，如果有些功能在本地及开发不容易测试，开发人员可以到测试分支进行自测。
开发分支（dev）：开发人员自测。

分支命名规范：姓名+需求名+日期

分支会根据上线需要，merge到stable进行测试，或者merge到master进行上线。如下图。

问题

分支管理混乱，每个分支既可能合并到dev，也可能合并到master，原因是因为这样可以解决仅部分功能要上线的问题，哪个功能要上线，就合并哪个分支到master。
1. 理论上，拉了分支开发的代码都是应该要上线的，不上线的代码会浪费开发资源。
2. 分支开发的时间也不应该太长，太长会导致代码冲突变严重，回滚成本变高。
3. 如果是因为测试没做完而暂时不上线，那可能是因为分支所代表的功能粒度太大了，测试时间太长，应该从源头开始拆解需求。
4. 如果是因为业务变更而暂时不上线，应该使用feature toggle来解决。
功能分支虽然写了开发者名字和需求名，但依然很难关联具体的需求是哪一个。
虽然规定了从master拉取分支，但大家有的从dev拉取，有的从stable拉取，没有统一规范。
分支命名中的日期意义不大，因为分支理论上存在的时间应该尽量短，才能避免更多的冲突，减少review的工作量，以及减少回滚的成本。其次分支拉出来的时间在git上都能清晰的看到。
开发很少做需求澄清，会按照自己的想法实现某个需求，遇到不确定的地方没有和团队讨论，没有找产品、业务确认。会导致最终实现和业务方的期望不匹配。
没有code review，无法统一开发团队成员对代码的规范，无法及时发现代码中的问题，无法做代码层面的知识传递。
没有写单元测试，无法做到研发自测与质量内建，无法保证代码的正确性，无法保证其他人不会破坏原有代码功能，无法持续集成。
没有CI/CD，无法及时获取反馈，无法快速部署，无法快速发现问题。

提交测试

现状

开发完成开发工作之后，自己测试通过了之后，会交给测试人员进行测试。测试人员在提测之前会根据产品文档先写测试用例。

问题

提测的过程靠口头传递，测试人员无法可视化的知道开发进度，做了哪些改动，可以部署哪个环境，使用哪个版本。

测试

现状

测试会根据写好的测试用例对功能进行测试，如果发现问题，会返回给开发，让开发修复。

问题

测试用例目前是用单独的工具来管理，没有和需求关联起来。
测试完成之后，没有对业务方的showcase，无法获取业务方的验收反馈。

上线

现状

每个需求基本上都包含了前后端，因此会等前后端都开发测试完成后，再一起做上线。

问题

上线内容比较多，一旦出了问题，会导致回滚成本比较高，定位问题比较慢。
上线时间比较慢，不能让业务方快速看到最终的功能。

总结

这样的研发过程梳理完了之后，会发现其实这样的过程就是我们俗称的小瀑布。它的特点是相比传统的瀑布模式它更轻量级，但相比敏捷，它又更重量级。目前很多公司都在采用这样的小瀑布模式。

小瀑布模式的缺点在于它的沟通成本、等待成本、返工成本依然很高，还有可以优化的空间。
同时整个过程中，需求评审、技术评审、用例评审都做得比较重，每次评审的内容都非常多，时间非常长，细节非常多。
整个过程中的所有产出物并没有明确的关联关系，也没有统一的管理工具和存储位置，随着时间的推移，所有知识管理将变得越来越难，新人的学习成本将变得越来越高。软件项目中的信息量会在潜移默化中变成异常高的复杂度。
环节与环节之间没有文字记录明确一个环节的结束与开始，比如开发到测试。基本上是靠成员之间的口头传递。
最后还发现该公司不是全功能团队模式，而是按角色分的，一个角色可能会同时负责几个项目，比如A开发上午在写X项目的代码，下午可能在写Y项目的代码了。

根据这些现状问题，具体怎么改造，将在下篇来具体讲解。

单元测试的一些分享

2021-03-22T10:52:44+08:00

背景

最近在给一个客户做技术咨询，然后发现了客户对于单元测试的一个有意思的现象。分享出来，大家一起学习探讨一下。

现状分析

这里以java后端项目例，发现客户写的测试长下面的样子。（代码已经脱敏处理过。）

    @Autowired
    private SampleJob handler;

    @Test
    public void testStart() throws Exception {
        SampleParamVo paramVo = new SampleParamVo();
        paramVo.setStartTime("2021-03-18");
        paramVo.setEndTime("2021-03-18");
        handler.execute(paramVo);
    }

    @Autowired
    private SampleHandler handler;

    @Test
    public void testHandler() {
        handler.doHandler(new DateTime("2021-11-26"), null);
    }

那么这样的测试代码有什么问题呢？

别人看不懂这个测试是在做什么。首先测试的方法名没有任何意义，其次测试代码也只是调用了某个函数。
无法运行。这类测试代码运行往往需要启动其他服务或者需要一些特殊的设置。无法运行就意味着它不能成为CI跑测试的一部分。
没有断言。没有断言就无法知道测试的代码的正确性。
使用了@Autowired这样的代码，增加了测试的耦合以及编写成本。

和客户深聊了之后发现，原来客户不同的人对单元测试的理解也不一样。

写这个代码的开发人员说，“这些代码是在开发完成之后做一些自测的辅助脚本。”
有的开发人员说，“我们是微服务，单元测试需要调用其他服务，写起来很麻烦，而且如果其他服务不可用时，测试也跑不过。”
测试人员说：“单元测试我们有的，我每天都在写测试用例，到单元测试的时候我就会把我的用例全部过一遍。”

所以我们可以发现，有的开发人员口中的单元测试其实应该属于集成测试或者E2E测试，有的开发人员完全没有写过单元测试，而测试人员理解单元测试是自己手动测试的时候用的测试用例。

那我们就先来说说什么是单元测试。

什么是单元测试？

单元测试（unit testing），是指对软件中的最小可测试单元进行检查和验证。

通常在java的世界里面，单元测试就是指对一个public的方法编写检查和验证的代码。

为什么要写单元测试？

写单元测试主要有两大目的：

验证功能实现。
保护已有功能不被破坏。

当我们写完一个方法，我们如何知道自己写的方法是按期望工作的呢？这个时候就可以添加单元测试来验证我们的代码是按期望工作的。即当我们给定指定的输入，我们获得期望的输出，则我们说这个功能是符合期望的。

其次，代码不是写了就永远不变的，当需求变更时，新增需求时，修复bug时，都会修改代码，而单元测试则能保护我们已有的功能不被破坏。保护已有功能不会被自己破坏，被新人破坏，被新功能破坏。

如何写单元测试？

下面是一个单元测试的例子

    @Test
    public void should_return_fizz_given_input_can_be_divided_by_3() {
        FizzBuzz fizzBuzz = new FizzBuzz(); // Given
        String actual = fizzBuzz.sayIt(6); // When
        Assertions.assertEquals("Fizz", actual); // Then
    }

一个标准的单元测试包含以下几个部分：

能描述清楚做了什么的测试名（方法名）
单元测试的Given、When、Then具体内容。
1. Given：初始状态或前置条件
2. When：行为发生
3. Then：断言结果

写好单元测试要主要几个要点：

因为测试代码并不会进入生产环境，同时我们期望测试即文档，因此测试的名称写很长也没有关系，重要的是能清晰的表达我们这个测试所覆盖的用例是什么。
一个测试只测一种case。
单元测试通常需要覆盖大量的case来保证我们的代码在绝大多数场景下都是按期望工作的。因此要做到这一点可以参考下面两大原则。这里就不详细讲解这两个原则，具体内容可以Google。
- CORRECT原则
- Right-BICEP原则
单元测试有一个考核的标准就是测试覆盖率，指的是我们的代码有百分之多少被单元测试测到了。
- 测试覆盖率分几种：行覆盖率，分支覆盖率，路径覆盖率，条件覆盖率等。每种都可以单独设置百分比。通常我们会看中行覆盖率和分支覆盖率。
- 通常行业里面常设置测试覆盖率在85%以上。
- 为什么不是100%？因为不是所有代码都能被测到的，比如private的构造函数是无法被测到的，这种就会降低覆盖率。
通常所有的自动化测试都是开发人员来写，比如单元测试，集成测试等。

测试金字塔

说到单元测试，就不得不提测试金字塔，如下图，最底层是单元测试，最顶层是UI测试。（测试金字塔有好几种，但道理都是相通的）

看左边的箭头，越往下越快，越往上越慢，它主要包括编写越快，运行越快，定位问题越快等。

看右边的箭头，越往下成本越低，越往上成本越高，包括时间成本，金钱成本，人员成本，维护成本等。

什么是mock？

我们在做单元测试的时候，常常可能访问外部系统或者外部类，这些外部的不可控性会让我们的单元测试成本变得很高。

常见的外部不可控性有：HTTP访问，增删文件，随机性，时间相关性，接口类等。

于是开发者便开始探索更廉价的方式来写单元测试，mock就是其中的解决方案。

mock 对象运行在本地完全可控环境内，利用 mock 对象模拟被依赖的资源，使开发者可以轻易的创建一个稳定的测试环境。

mock是Test double理论中的一种，如果对test double理论感兴趣，可以到这里了解更多，这里就不展开说了。

如何用mock？

还是以java为例，java的世界中常用的mock框架比如mockito。

下面是一个mock的例子。

    @Test
    void should_return_100_when_get_list_size() {
        List map = mock(List.class);
        //当调用list.size()方法时候，返回100
        when(map.size()).thenReturn(100);
        Assert.assertEquals(100, map.size());
    }

单元测试是我们测试的最小单位，因此我们只测当前这个public的方法中的实现，而方法中调用第三方类的东西，我们都应该mock掉。

这样的好处有两个：

不会因为其他类的不可控性而导致这个测试方法变得难写。
其他类的修改不会导致这个测试方法挂掉。所有的变化都被隔离出去了。

什么是TDD？

最后再升华一下，简单说一说TDD，TDD的全称是Test driven development，即测试驱动开发。它是极限编程XP中的一个标准实践。

TDD要求在编写某个功能的代码之前先编写测试代码，然后只编写使测试通过的功能代码，通过测试来推动整个开发的进行。

这样做有四大好处：

TDD是一个很好的契机，可以让你在考虑解决方案之前先考虑问题。
首先考虑测试会迫使你首先考虑与代码的接口。先思考接口可以帮助你将接口与实现分开。
简单设计。
几乎100%的测试覆盖率。

这里我就不详细叙述TDD相关的话题了，因为TDD是一个比较大的话题，如果感兴趣，下次专门开一个新话题来聊TDD。

能力识别模型

2021-02-26T14:09:11+08:00

背景

这些年一直在做对外的敏捷开发培训，也就是针对其他企业的开发人员进行敏捷全栈开发培训，经过培养之后期望这些开发人员能快速胜任工作。

但是由于大部分的培训最终产出物可能都是一些评价或者一个分数，它并不能很好的反应一个人的能力情况，也不能帮助培训者或者企业更好的识别每个人的能力。

因此，经过多次迭代，归纳总结出了一个针对开发岗的能力识别模型，目的是能够帮助培训者和企业更好的识别他们的能力分布。

能力识别模型是什么

能力识别模型是一个包含4种能力，32种抽象行为的一个模型。

用于识别开发岗的开发人员的各纬度能力分布。

4种能力分别是：

技术能力
学习能力
理解能力
沟通能力

每种能力对应了8种抽象的相关行为，共32种行为。

下面就展开详细说一下。

技术能力

顾名思义，技术能力就是指和开发技术相关的各种能力。

它包含的下面8种抽象行为指的是他/她会在过程中使用到或者被观察到能体现这些行为的facts或事实。

1 - 验收思想

验收思想指的是做事情或者编码时能考虑到如何验收。

比如写测试是一种验收，tasking时标明输入和输出是一种验收。

2 - 代码设计

代码设计指的是写代码的时候有代码设计。

比如使用设计模式，比如在Java里面使用stream API，比如有良好的OO设计，比如遵守了SOLID原则。

3 - 独立编码

独立编码指的是能独立编写代码。

比如独立完成Java编程，独立完成react编程，独立完成python编程等。

4 - 整洁代码

整洁代码指的是编写的代码满足clean code。

比如没有明显的坏味道等。

5 - 完成任务

完成任务就是指的能按时合格完成任务。

比如按时合格完成编程练习，比如按时合格完成非技术的画图工作等。

6 - 解决问题

解决问题指的是能解决遇到的各种问题，包括技术问题，非技术问题等。

比如能使用debug修复遇到的bug，比如能通过看日志或者搜索等解决遇到的技术问题。

7 - 利用资源

利用资源指的是能利用一切资源来完成任务。

比如向教练求助，向同学同事提问，上网搜索，使用工具等都属于利用资源。

8 - 探索新技术

探索新技术指的是自己能探索一些新的技术，包括框架，工具，算法等。

比如学习了某种新算法，研究了某个新工具或者新框架，比如没接触过Jenkins但自己研究了如何使用Jenkins，比如没写过python但学习python解决了某个问题。

学习能力

顾名思义，学习能力就是通过不断学习来完善自身的能力。

它包含的下面8种抽象行为指的是他/她会在过程中使用到或者被观察到能体现这些行为的facts或事实。

1 - 迭代思想

迭代思想指的是做任何事情都能小步快跑，迭代式的完成任务。

比如写代码的时候能够小步提交，比如做项目时能迭代开发。

2 - 遵循最佳实践

遵循最佳实践指的是做任何事情都能遵循最佳实践。

比如遵循重构的最佳实践，比如遵循code review的最佳实践，比如遵循TDD的最佳实践，比如遵循站会的最佳实践。

3 - 从他人身上学习

从他人身上学习指的是能学习他人的优秀的技术、习惯和思想。

比如会向他人请教如何做代码设计，比如能学习别人是如何组织站会的，比如code review时能学习别人更好的代码实践。

4 - 每日总结

每日总结指的是每天都能坚持总结一天的所得和所缺，类似于一个人的retro，来帮助自己回忆所学和改善不足。

比如每日会写总结日志，比如每次获得新知识会记笔记，比如经常写博客。

5 - 执行力

执行力指的是完成预定目标的操作能力。

比如code review之后马上就能重构自己的代码，比如获得任务之后马上就能开始计划，比如执行任务的时候没有拖延症。

6 - 优先级

优先级指的是做任何事情都能分清优先级，优先完成优先级高的任务或者环节。

比如做项目的时候能优先完成核心功能而不是选择自己喜欢的功能做，比如编码的时候能优先完成核心功能而不是纠结某个非核心算法。

7 - 工作习惯

工作习惯指的是有良好的工作习惯。

比如编码时能使用快捷键提高工作效率，比如能使用自动化流程来提高效率。

8 - 持续改进

持续改进指的是每天都会根据反馈或者自己总结而持续不断的改进自己的各方面能力。

比如重构就是一种持续改进，比如不断改善站会体验就是一种持续改进，比如额外练习自己不熟悉的编程技术也是一种持续改进。

理解能力

理解能力是一种比较综合的能力，它包含了多种综合行为。

把理解能力拆解一下，也包含了下面8种抽象的行为。

1 - 任务分解

任务分解指的是做事情之前会先tasking，或者会把复杂的任务先列出执行步骤。

比如TDD时先做tasking，比如要调研一个复杂技术时先理清要调研的每个步骤。

2 - 接受反馈

接受反馈指的是能接受他人基于事实的反馈并改进。

比如code review时别人对代码提出的更好的建议能接受并重构。

3 - 需求澄清

需求澄清指的是拿到任务或者需求时都能先做需求澄清，避免产生二义性。

比如做编程练习的时候能澄清所有模糊的描述，而不是自己想象应该是什么样的需求。比如设计功能的时候能和用户以及团队讨论功能需求而不是自行决定。

4 - 理解需求

理解需求指的是能理解每次练习的需求，能理解别人的提出的需求。

比如编程结果里面没有偏离需求的实现。

5 - 发现他人的问题

发现他人的问题指的是能发现他人代码中的问题，或者敏捷实践中的问题。

比如能发现他人代码中不合适的命名，比如能发现他人代码中的逻辑错误，比如能发现他人在敏捷活动中的错误实践。

6 - 理解新知识

理解新知识指的是能理解学到的所有新知识，包括技术知识，敏捷知识以及业务知识。

比如能理解新框架的使用方式，比如能理解新工具的使用场景，比如能理解新的敏捷活动的最佳实践。

7 - 版本管理

版本管理指的是会使用GIT等版本管理工具，并提交有意义的commit。

比如git commit的描述清晰记录了团队要求的所有信息。比如在创建数据库时也会使用数据库版本管理工具。

8 - 业务命名

业务命名指的是在代码中或者故事卡中，都能使用有业务意义的名字。

比如不会出现技术命名，或者毫无含义的命名。比如能在所有编程场景中统一语言。

沟通能力

沟通能力指的是沟通、表达、团队协作等软实力相关的能力。

它同样包含了下面8种抽象行为。

1 - 提供帮助

提供帮助指的是在团队中能积极主动的向团队提供支持或帮助。

比如帮助团队攻克技术难题，比如帮助成员fix某个bug，比如主动承担某个任务。

2 - 积极讨论

积极讨论指的是能积极参与团队的讨论和决策。

比如code review的时候能积极的参与讨论，比如需要某个决定的时候能积极说出自己的想法。

3 - 团队协作

团队协作指的是能积极促进团队正向的成长和前进，体现自己的协作精神。

比如能互相激励完成某个任务，比如能共享资源来帮助团队沉淀知识，比如能取长补短帮助团队前进，比如能组织管理团队的相关事务。

4 - 有效对话

有效对话指的是能在和别人的沟通中产生有效对话。

比如没有多余的废话，或者不会出现沟通完之后依然没有得到答案。

5 - 回答问题

回答问题指的是在工作和学习中能积极的回答问题。

比如教练问的问题，同学同事提的问题等。

6 - 寻求帮助

寻求帮助指的是在遇到困难的时候能积极的寻求帮助，而不会因为个人原因阻碍团队或者项目的前进。

比如遇到不懂的编程问题就直接提问，比如遇到不懂的知识就提问，比如遇到解决不了的技术难题就寻求他人或者网络的帮助。

7 - 给出反馈

给出反馈指的是能在团队中积极的给他人反馈，帮助他人成长，帮助团队成长。

比如在code review中指出他人的代码坏味道，比如在团队活动中给他人给出反馈帮助他人成长。

8 - 分享

分享指的是能积极分享自己的想法或者技能。

比如在站会中分享业务，在code review的时候分享自己学的新技术等。

如何使用能力识别模型

能力识别模型可以被设计成一个二维表格，每种能力对应8种行为，每种行为有1-10分，每种行为默认每个人都具备这些能力，所以默认5分。

如果该行为主动做到所有人中的最好就是10分，如果该行为没有做到或者做得不好就相应扣分。

同时，每种行为都需要观察记录facts，基于事实来支撑打分。所以它大概会长下图的样子。

根据这些数据，最终就可以生成这样一张直观的能力雷达图。

通过雷达图我们就可以直观的看到每个人在不同的能力纬度上的优势和不足，以更有针对新的帮助这个人的成长。

有了这些数据，还可以从多维度去对比不同的人之间的能力差异，获取不同的数据视图，帮助团队更好的定位人才的发展。

未来

这个能力识别模型并不是完美的，它还需要不停的迭代优化，适配各种不同场景的抽象行为。

它目前只是用于帮助企业了解开发人员的一种可视化形式。

未来，有了这个能力识别模型，可以根据不同团队的需要，生成不同的数据视图，来辅助团队的发展。

最后，如果对于这个能力识别模型有任何想法或者建议，欢迎与我讨论。

在线培训工具合集

2020-02-20T11:52:43+08:00

引言

上次总结了一些在线培训的经验，这次给大家总结一些在线培训的工具。

有一些常见必备工具就不仔细介绍了。

比如微信，用于群聊讨论发资料等等。

比如钉钉，用于群聊讨论发资料等。但不太推荐用钉钉做在线视频会议培训，因为它的视频会议功能太简单，下面看了zoom的介绍大家就知道了。

比如GitHub，用于技术培训的代码保存和传递。

比如iCloud，WPS，石墨文档，腾讯文档等，用于在线协作场景。

为什么没有推荐Office，因为Office是收费的，不是所有人都愿意付费使用Office，另外Office的云都没有其他的快，会影响线上效果。

那么这里就重点给大家介绍以下几个非常适合在线培训的工具。

PowerPoint
Keynote
Mural
Miro
Zoom
Visual Studio Code

PowerPoint

俗称PPT，通常讲课都需要用PPT来教授知识，在线培训也不例外。大家对PPT的使用也很熟悉了。这里只介绍PPT的两个适合在线培训的功能。

第一个功能：实时字幕

实时字幕就是，你在通过PPT讲解的时候，PPT可以实时的识别你说的话，并以字幕的形式显示在屏幕下方。如下图所示。

这个字幕识别支持多种语言，包括英文和中文。而且速度非常快。

怎么打开这个功能呢？

在PPT演示模式，把鼠标移到屏幕左下角就会出现如下图的图标，第三个图标就是打开字幕功能的开关。

这样功能有两个好处：

当网络不稳定的时候，有时候听课的人有个别关键字没听清楚，他只需要看字幕就知道刚才说了什么，不用打断讲师再问一遍。
当屏幕在一页PPT上长时间不动的时候，可以让学生聚焦在不停变化的字幕上，不至于盯着不变的屏幕而枯燥犯困。

第二个功能：笔

笔有两种，一种是激光笔，另一种是可以画画的笔。

先说激光笔，激光笔也就是你在现场培训的时候，很多讲师手里会拿一个红外激光笔，可以在投影屏上打出一个红色的亮点，告诉听众现在讲解的重点是哪里。

那么PPT自带这个功能，它能让你的鼠标变成一个激光笔，如下图所示。这样你就可以通过鼠标来不断指示目前讲解的重点是什么。

有人会问，那就用原始鼠标箭头不也能做同样的事情吗？

首先，在PPT演示的时候，鼠标长时间不动是会自动消失的。

其次，当你移动鼠标的时候，听众不知道是你在划重点，还是在随意移动鼠标。而激光笔的出现则是在暗示听众，我是在划重点。

然后说说可以画画的笔，如下图所示。

这种笔可以帮助讲师在讲课的时候勾勒重点，或者在PPT上直接画图辅助讲解。

怎么打开这个功能呢？

在PPT演示模式，把鼠标移到屏幕的左下角就会出现下面的几个图标。

选择第二个笔的图标，就可以选择不同类型的笔，笔还可以选择不同的颜色哦。

Keynote

Keynote没有激光笔，也没有其他笔。

但它可以利用iPhone手机来遥控keynote，同时在iPhone上就能使用白板和激光笔。

当然在线培训时使用iPhone来遥控不是特别合适，所以这里就不展开讲了。

这里重点介绍keynote live这个功能。

Keynote live

Keynote live的中文名叫keynote直播，它允许你邀请任何人在他们的电脑或者手机设备上观看你的keynote，重点是，观看者不需要有iCloud账户。

大部分时候我们视频会议要给别人看keynote，都是通过分享屏幕来完成的，而keynote live让我们无需分享屏幕就可以让别人实时看到keynote内容。你只需要把keynote live的分享链接发给观看者，观看者在浏览器打开链接就可以了。

若要开启 Keynote 直播，首先需要将该文稿保存在 iCloud 中，然后在工具栏中找到 Keynote 直播即可。

点击继续之后可以看到直播设置，在这里可以将链接复制给需要参加的听众，并且可以设置密码访问。

当受邀者点击链接进入直播之后，主持者可以在自己的 Keynote 直播菜单旁边看到目前在线人数，可以点击播放，在此 Mac 上预演，或者直接开启直播。

Mural

Mural是一个可视化协作的工具。

它主要功能是可以在线贴便利贴，如下图所示。它的便利贴有3种格式，正方形、长方形和圆形，同时每个便利贴的大小都可以调整。让我们可以灵活的使用它。

它也内置了多种模版供我们使用，这就让我们在远程协作以及培训的时候可以使用它来进行在线retro，在线IPM，在线DDD培训，在线头脑风暴等。

重点介绍它的两个比较适合在线培训的功能，计时器和房间。

计时器

计时器可以让我们在贴便利贴的时候，可视化的计时，让每个参与的人都能知道还剩多少时间。

在屏幕顶部点击如图中计时器图标，就可以选择任意时间了。

当你计时开始之后，你就可以看到还剩多少时间计时结束。

这个功能可以帮助我们在线培训的时候，把控时间，同时可视化给每一个参与的人。

创建房间

Mural支持创建不同的房间来做不同的事情。

想象一下，当我们线上培训需要分小组做事情的时候，是不是就可以用mural给每个小组创建一个房间，不同的组在不同的房间里面协作，而讲师就可以同时看到每个房间的动态，观察总结大家的协作结果，然后再统一总结分享。

最后，提醒大家注意一下，Mural不是免费的工具，但是新用户可以免费使用30天，这个免费时间对于大部分培训来说已经足够了。

Miro

Miro是一个和Mural非常类似的工具。他们大部分功能都一样。

Miro的不同在于它的模版更多，比如它可以创建思维导图，创建产品roadmap等。

Miro也有很多其他Mural没有的功能，但是这些功能在线上培训时帮助并不多，所以就不一一介绍了。

最后想要告诉大家的是，当需要使用在线便利贴的时候，更推荐使用Mural而不是Miro，原因是Miro在国内的使用速度较慢，大大影响线上培训的效果。而Mural在国内的使用是几乎无延迟。

Zoom

zoom对于大家来说一定不陌生。但是zoom的很多功能你们真的清楚怎么使用吗？

这里就给大家介绍几个适合在线培训的功能。

Annotate

Annotate是当你共享屏幕的时候，你可以打开annotate工具栏，就可以进行各种注解操作了。

它位于共享屏幕之后的顶部工具栏中。

打开annotate工具栏之后你就可以使用如图所示的各种小工具。

比如，你可以使用Text在屏幕上写任意文字。

也可以使用Draw在屏幕上画各种图形，比如可以用方框把你讲解的重点内容框起来。

可以使用Stamp在屏幕上标记如图所示的图标。比如当听众有疑问的时候，可以使用问号图标在屏幕中标记出有疑问的地方，也可以用箭头指向有问题的地方，这样讲师就可以进行讲解。

还可以使用Spotlight把鼠标变成激光笔。这个功能和PowerPoint那个功能是一样的。

最后，想要提醒大家的是，这些annotate是所有人都可以在屏幕上画，同时所有人都能看到。因此zoom自带的这个工具，就帮助我们解决了很多在线培训时的互动问题，有很大的想象空间。

举手

在zoom的participants（参与者）里面，有一个举手的功能。当你打开participants之后，你在下方就可以看到一个Raise Hand（举手）的按钮。

那它有什么用呢？除了上面讲到的，使用Stamp在屏幕上标记疑问的互动方式以外，zoom还允许使用者使用举手的功能在讲师讲课的过程中举手提问。

因为有的培训场景中，讲师是会把所有人强制静音了，这个时候要想和讲师说话，就需要用到这个举手功能了。

Visual Studio Code

写代码的同学对VS Code都不陌生，它是一个非常好用的IDE，今天要介绍的，是它的一个非常适合在线培训的功能。

很多人远程培训的时候，想要和别人结对编程，都是使用远程控制别人的电脑来进行结对编程。

而Visual Studio Code提供了一个Live Share的功能，Live Share可以让你能够在相同的代码库上快速进行协作，而无需同步代码或配置相同的开发工具，设置或环境。让你可以协作的方式来进行远程结对编程。

使用Live Share需要在VS Code上安装一个微软官方的Live Share插件，如下图所示。

安装好了之后，在VS Code左下角就会有Live Share的按钮，如下图所示。点击它就可以开始Live Share了，但是使用这个功能需要微软账户或者GitHub账户的支持。

开始Live Share之后就如下图所示，可以进行结对编程了。

解决了以前远程控制电脑来远程结对编程时的各种卡顿不流畅。

结语

使用合适的工具可以帮助我们提高线上培训的效果。

当然，适用于线上培训的工具远不止这些，这里只是做个抛砖引玉，希望大家都可以多分享一些利于我们工作的工具。

如何做好一场线上培训

2020-02-12T16:20:05+08:00

引言

随着新冠肺炎疫情的持续，很多企业只能在家办公，这就意味着像ThoughtWorks这种提供服务的公司也只能远程提供服务。
那么为了能远程给客户提供效果相当的线上技术培训服务，我们做了一些线上培训的实践，在这里总结了一些经验，与大家分享。

现场培训流程

首先，我们做的是技术培训，通常现场的技术培训流程是这样的：

通过PPT或者白板讲解技术知识点
引导学员对相关技术知识进行讨论或者分享
讲师现场演示技术细节
学员现场做练习
讲师根据练习结果进行反馈和总结
布置作业

知道了培训流程，那么把这样的技术培训迁移到线上，会有哪些变化呢？

首先，线上培训会有一部分新增的现场培训没有的内容。
其次，相对于现场培训，线上培训会有一些较大的变化。

接下来我们就依次总结总结。

线上培训新增内容

线上新增的内容大概包括：线上工具的准备，新规则的建立。

线上工具准备

战场变成线上了，那肯定得准备线上适用的工具。原来的比如便利贴，蓝泥之类的物料就用不上了。

那线上需要准备哪些工具呢？

首先，是视频会议工具。

需要选取适合你培训内容的视频工具。比如我们选择的是zoom。因为zoom支持多人同时在线视频会议，也支持chat聊天室用于发文字内容，还支持白板。

最重要的是它有两大功能非常适合技术培训：

多人同时共享屏幕。当学员在练习时，则让每个学员同时共享屏幕，讲师就实时看到每个学员的练习过程，并给出指导意见。
Annotation。zoom支持使用文字，符号，箭头等工具在讲师共享的屏幕上做标记，这样学员就可以针对屏幕上的某个内容提问。

其次，需要一个群聊工具，比如微信。

因为通常培训都需要发送一些资料，群聊工具可以在统一的地方发送和存放这些资料。

最后，还有一些非必须但实用的工具。

比如，VS Code可以实时协作编码，用于结对编程。

一些画图工具用于演示结构图、原理图等，比如AutoDraw，ProcessOn，甚至PPT等。

一些在线便利贴工具用于Retro或者User journey等，比如IdeaBoardz，Miro，Mural等。

当这些工具都准备好了之后，那么培训就多了一个环节。

就是介绍这些工具是做什么的，以及在培训过程中如何使用。

比如介绍Zoom的多人共享屏幕如何打开，如何在zoom里面举手等。

上面这些都是软件工具，当然还需要硬件工具。

硬件方面就要求讲师有摄像头，麦克风和音响。当然TW的MAC都自带了这些硬件。不过如果学员用的是台式机就不一定了，所以培训前一定要确认好。

最后一点，通常讲师线上讲课都容易忽视的一个点，就是摄像头的视频背景。

很多时候讲师都没有注意过自己的视频效果，大部分讲师的视频都是光线昏暗的，自己脸部黑黢黢的。

昏暗的光线会让听众容易昏昏欲睡。这也是为什么直播主播的画面都非常明亮。试想一下，当你去看主播直播，如果ta的光线昏暗，画面黑黢黢的，你还会给ta刷礼物吗。

所以呀，有条件的讲师可以在自己面前打开一盏灯，做一个有光环的讲师。

新规则的建立

线上培训需要一些线上培训适用的新规则。

这些规则根据不同的培训需要而不同，这里给大家提供一些参考。

讲课时，有问题可以用zoom举手，同时用箭头指向有问题的地方。
1. 鼓励互动，鼓励提问。
当讲师问有没有问题时，没问题就在chat里面扣1，有问题就2。
每个人都必须打开摄像头。
做练习时，每个人都必须共享屏幕。
讲课必须录屏，以便掉线的人回看。

这里特别说明一下，打开摄像头有四个目的：

促进学员提高专注度
帮助讲师获取反馈
拉近屏幕两边的距离
提高讲师的信心，因为一直对着屏幕讲课，时间久了讲师会怀疑自己是不是一个人在表演。

线上培训的变化

下面就给大家总结几点线上培训的变化。

1.大部分肢体语言都无用了

曾经你是一个影帝，你可以通过你的一言一行一举一动演绎你想讲授的知识。如今，你只剩一张大脸。大部分知识的传递只能靠嘴说了。这就要求讲师有更高的语言组织和表达能力。

就像岳云鹏一样，满脸都是戏的同时，能用语言抓住观众的注意力。

这里给讲师的一点建议就是，因为网络的不稳定，说话可能听不清，因此讲师讲课的时候尽可能的做到吐字清晰，语速适当。

2.过程被隐藏了

现场培训讲师在讲课的过程中，会有很多流程。比如讲师会先在投影上通过PPT讲解课程，然后走到白板上画一些重要的原理或者知识点，接着打开自己的电脑演示刚才讲的内容，最后让学生自己实践然后讲师到学生中间去走查。

但是换成线上培训，这一切就变了，因为这些切换的过程被隐藏了，学员看到的是，屏幕突然从PPT切换到白板，又突然切换到代码演示。学员稍微一走神，就会导致不知道发生了什么，屏幕内容怎么突然变了。

所以，这里给做线上培训的讲师的建议就是：

每次切换屏幕记得口头告知即将要做什么操作。比如可以说：“现在我们切换到白板来画一下刚才的结构图，帮助大家理解。“
在每个环节一开始，就把要进行的操作步骤先可视化列出来。让学员知道接下来要发生的事情的步骤。

3.原来是面对学生，现在是面对屏幕

这里最大的变化就是：

互动变了
注意力管理变难了
获取反馈的渠道变了

先说第一点，互动变了。

现场培训是多地多形式互动，比如你可以在投影屏前讲课，可以让学生上台展示，你可以在白板前画画，可以让学生在白板上分享，可以与学生问答互动，等等。

但是线上培训却只剩一块屏幕了。没有了多地互动，也没有了多形式。这让互动变难了。

所以，这里给线上培训的讲师的建议是，把原来的靠肢体语音和场景变换的互动，改为语言互动，这就要求我们的讲师要像相声演员一样，能灵活应对各类问答。

接着说第二点，注意力管理变难了。

现场培训，讲师可以通过自己的培训技巧，很容易的管理学员的注意力。但现在你面对的是一块屏幕，很难去做这件事情。同时，由于屏幕的单一性，以及电脑的便利性，很容易导致学员走神，比如学员盯着屏幕久了就犯困了，或者屏幕弹了一条微信消息出来，学员就回消息去了。

那么这里给线上培训师的建议就是：

强制要求每个学员打开摄像头。通过他们的表情，你可以做一定的注意力管理。
加强自己的语言能力。把自己变成一个像相声演员一样说很多话都不枯燥的讲师。
减少同一个画面出现的时间。比如一页PPT讲半小时，学员盯着这样的屏幕会怀疑是不是网络卡了。吸引注意力的其中一个办法就是制造变化，所以我们应该让屏幕有节奏的变化起来。

最后说第三点，获取反馈的渠道变了。

现场培训的时候，讲师获取反馈的渠道通常有以下几个。

从学员的表情获取，如果学员皱眉头了，可能表示他没听懂。

从学员的动作获取，如果学员开小差了，你可以立刻发现。

线上培训则阻断了这样的获取反馈的渠道。

因此，这里给线上培训师的建议是：

强制要求学员打开摄像头，这样你可以从表情获取一定的反馈。
建立获取反馈的规则，比如大家如果没有问题就在聊天室里面扣1。比如zoom支持让学员在讲师分享的屏幕里面，使用一些箭头符号等等，这样可以知道学生是在屏幕的哪个位置的知识点有疑惑。

4.学生的关注点变了

为什么说学生的关注点变了呢，因为原来学生在现场是关注老师的一言一行，现在变成了鼠标的一举一动。关注点的转变，带来的则是教学形式的转变，以及教学技巧的转变。

讲师无法通过自己的举动来传递信息，因此线上培训传递信息的渠道变成了以下两个：

屏幕
讲师的言语

知道了传递信息的渠道，就需要讲师作出改变了。

首先需要把屏幕能传递的信息做到聚焦化和精准化。因为屏幕能出现的信息量是很大的，包括鼠标的移动也是一种信息，通常鼠标移动到的位置就是正在讲的重要知识点，因此讲师使用鼠标要慎重。

另外，很多讲师喜欢在屏幕上放很多内容，当你没有鼠标指引的时候，走神的学生就很难分辨你此刻在讲解的内容是什么。因此，屏幕出现的内容尽量精炼为好。

最后，更多的信息是靠讲师的言语传递的，屏幕上的内容也是靠讲师的言语去引导的。所以，线上培训更考验讲师的语言能力。

5.一对一的对话变成了广播

原来的现场培训，当学员在练习环节有问题，学员会向讲师提问，这种提问通常是一对一的，然后讲师会进行一对一的指导。而线上培训则不一样了，学员只能通过视频提问，因此，一对一的对话则变成了广播式的，所有人都能听到你们的对话。

这样的广播是有好处和坏处的。

好处是，通常问题都是有代表性的，进行广播可以同时指导有同样问题的学员。也可以巩固其他学员在这部分的知识。
但也有坏处，坏处就是广播式会导致部分内向的学员不敢提问了，也同时会让那些专心练习的人分心。

这里就需要讲师主动在练习环节去识别那些有问题的学员，同时给出因人而异的高效的指导。

结尾

分享了这么多，后面我们还会继续尝试各种各样的线上培训形式，继续总结分享。

希望我们的服务不只能现场提供，也能在线上提供相等质量的服务。

也希望有线上经验的小伙伴多多交流，总结方法论。把零散的经验变成可复用的资产。

哪些小技巧可以提升一场培训的质感

2019-12-12T16:29:05+08:00

前言

一场专业的培训需要关注很多方面才能呈现一场有质感的培训，这些东西包括但不限于：

节奏
气场
注意力管理
氛围管理
临场应变能力
呈现设计
互动设计
课程设计
引导设计
翻转设计
形象设计

我们今天不说这些抽象的理论，我们今天就说一些能落地的小技巧，帮助你提升一场培训的质感。

培训前

提前踩点
1. 熟悉现场，规划现场。
2. 准备好培训必须的物料。
了解学员
1. 通过提前获得的学员资料了解
2. 提前到现场与学员认识
3. 需要了解的维度包括但不限于：
  1. 技术背景
  2. 组成关系
  3. 工作方式
如果学员认知中的概念和我们的概念有偏差时，除非是培训内容，否则不去做纠正。而是去做一个映射，用学员的语言去交流。
1. 比如学员认知中理解的团队是50人左右的account，而我们理解的团队是5-8人的一个组，那当我们在培训时说每个团队要进行code review的时候，学员会非常困惑50个人如何一起code review。

关于紧张

上台前：

降低期望值
1. 看轻结果。看轻结果不是意味着降低质量，而是一种心理暗示。
2. 允许出丑。先出丑，再出众，再出名。
3. 允许紧张。心理学家研究发现：一旦允许紧张出现之后，演讲人就会变得很真实，他就会展现最真实的自我，观众也更容易接受他。
增加把握度
1. 不断实践
2. 充分准备
到现场提前认识一些朋友，或者给朋友打电话聊一聊接下来要讲的内容。和熟悉的人谈论内容，可以让自己不那么紧张，至少可以先正常的谈论有关话题。

上台后：

把听众想象成冬瓜
1. 这样的好处，你的内心里就不会把听众当成一种威胁，他们对你是无害的。
目光看稍远处，不直视观众。或者是看那些比较友善的听众，告诉自己他们是站在你这边的。
紧张的时候喝口水，用这个时间想词。
带一个自己熟悉的道具，这些你熟悉的道具会给自己安全感。

开场

很多人在开场的自我介绍不知道说什么，大部分人就介绍了我是谁就结束了。自我介绍有很多炫酷的方式，这里给大家介绍一个最基本的模版就是：
1. 我是谁，我从哪里来，我是做什么的，我今天要做什么，为什么是我做。
在开始一个话题前，先获得学员输入，收集期望和问题，带着学员的期望和问题去讲课。让学员在一开始就先思考今天要培训的内容，也让你的培训更契合学员的期望。
如果是多天的培训，第二天的培训开始的时候，记得带学员一起回顾一下前一天的内容。上下午的培训也需要简单回顾一下。增强培训之间的连贯性。

培训中

每个晦涩难懂的概念，尽量做到以下几点来帮助学生理解。
1. 列举一些有趣的例子
2. 构建一个清晰的场景
3. 说一个故事
4. 画图解释
讲完一个概念，随时在心里反问自己讲清楚没。同时从学员的表情中去获取反馈。
不停回顾总结每个小环节，帮学员加强记忆。并在开始下一个环节前向学员确认没有任何疑问。
大家在互动讨论的时候，讲师要下去寻访每个组的状态，倾听每个组的讨论。发现他们讨论中的问题，总结他们讨论的内容，发散自己讲课的思路，这些都会成为下一个讲课环节的素材。
两个讲师之间的交接，上一个讲师可以简单介绍下一个讲师以及要讲的内容，下一个讲师可以谢谢一下上一位讲师，再自我介绍。
当想要让学员做什么的时候，不要站在自己的角度去下命令，而要站在学员的角度去讲解，不然学员不明白要做什么。
尽量不要在PPT上放很复杂的图表或者很长一段话。如果放了，就要在PPT上highlight出重点。或者要么就指着PPT一点点讲解，要么就让学员先看后讲或者先讲后看。不然学员要么听你讲没有看到PPT，要么看PPT错过了你的讲解。
每个讨论、动手和分享环节都需要做时间控制。
每次尽量让不同的学员来做分享、互动和答题，增强每个人的参与感。
每个人或者小组分享完了之后，问一下其他人或者小组有没有什么问题，增加每个组之间的互动与参与感。保证均等的参与。
声音要洪亮，吐字要清晰。声音小显得在自言自语，不自信。吐字不清晰，学员就听不清楚，容易走神。
如果动手环节发现大部分人问题比较多，可以先暂停一下，统一再阐述一般概念，统一收集一下问题，集中解答一下，然后再继续。
PPT中如果有英文，最好解释一下中文意思，如果学员不理解这个英文意思，很可能在后面的内容中走神。
埋下伏笔或者设计连环问题。这类设计能让整个培训在呈现上提升一个档次。
中途休息时，不要忘了去和学员聊天，去了解学员，了解他们的技术背景、组成关系、工作习惯等，这些信息都能帮助你改善后续的培训内容。
讲解一些常见问题或者场景的时候，可以这样说：“实际工作中大家问得比较多的一个问题是...“，或者：”通常在大部分项目上，这类问题都是...“。类似于这样的说辞会显得你很有经验，增强学员对你的信任感，塑造你的权威形象。

提问相关

当学员回答了一个问题，或者做了一个分享。一定记得肯定、总结并升华学员的内容。一个比较常见的套路是：这位同学回答（分享）得非常好，他既提到了XXX，又说到了XXX，这些内容不仅能让我们XXX，又能让我们XXX。
当讲师向学生提问时，不仅要引导学生回答what，还需要引导学生思考更深层次的东西。引导式提问的模版是：
1. What？回答what，让学员认知清楚
2. So what？回答what背后的意义，升华这个问题
3. Now what？回答怎么做才能达到这个意义，促使学生行动
当学生向讲师提问时，回答的套路是：肯定，复述，解答，确认
1. 这是一个很有思考的问题。（肯定学员，增加学员的提问的信心。）
2. 你的问题是不是...（复述问题，确保你没理解错，也让其他学员能听清楚问题是什么）
3. 针对这个问题，答案是...（有条理的解答问题）
4. 我是否回答了你的问题？（确认是否解答了疑惑，让学员知道你是非常关注他的问题的。）
如何提高学员回答问题的积极性？
1. 如果你希望学生举手回答，那么你也要举起手来以身作则。
2. 先问封闭式问题，让学员能简单快速的回答。比如问是否，或者给选择。
3. 回答积极性很好了，再问开放式问题。
4. 最重要的是当学员回答完了之后，要使用提问相关里面的第一条tip。

培训冷知识

培训中的时间密码90/20/8，90是指每90分钟要休息一次，20是指每20分钟要转换一种学习方式，8是指每8分钟要调动学员一次。
讲师的N种死法：读PPT，批评学员，严重拖堂，离题万里，自我陶醉，逻辑不清，只看屏幕，端和装。

结语

小技巧太多了，总结不完了。下次我就直接开始总结大技巧了，告诉大家哪些大技巧可以提升一场培训的质感。

记一次Spring Batch完整入门实践

2018-09-05T14:07:06+08:00

前言

本文将从0到1讲解一个Spring Batch是如何搭建并运行起来的。
本教程将讲解从一个文本文件读取数据，然后写入MySQL。

什么是 Spring Batch

Spring Batch 作为 Spring 的子项目，是一款基于 Spring 的企业批处理框架。通过它可以构建出健壮的企业批处理应用。Spring Batch 不仅提供了统一的读写接口、丰富的任务处理方式、灵活的事务管理及并发处理，同时还支持日志、监控、任务重启与跳过等特性，大大简化了批处理应用开发，将开发人员从复杂的任务配置管理过程中解放出来，使他们可以更多地去关注核心的业务处理过程。

更多的介绍可以参考官网：https://spring.io/projects/sp...

环境搭建

我是用的Intellij Idea，用gradle构建。

可以使用Spring Initializr 来创建Spring boot应用。地址：https://start.spring.io/

首先选择Gradle Project，然后选择Java。填上你的Group和Artifact名字。

最后再搜索你需要用的包，比如Batch是一定要的。另外，由于我写的Batch项目是使用JPA向MySQL插入数据，所以也添加了JPA和MySQL。其他可以根据自己需要添加。

点击Generate Project，一个项目就创建好了。

Build.gralde文件大概就长这个样子：

buildscript {
   ext {
      springBootVersion = '2.0.4.RELEASE'
   }
   repositories {
      mavenCentral()
   }
   dependencies {
      classpath("org.springframework.boot:spring-boot-gradle-plugin:${springBootVersion}")
   }
}

apply plugin: 'java'
apply plugin: 'idea'
apply plugin: 'org.springframework.boot'
apply plugin: 'io.spring.dependency-management'

group = 'com.demo'
version = '0.0.1-SNAPSHOT'
sourceCompatibility = 1.8

repositories {
   mavenCentral()
}

dependencies {
   compile('org.springframework.boot:spring-boot-starter-batch')
   compile('org.springframework.boot:spring-boot-starter-jdbc')
   compile("org.springframework.boot:spring-boot-starter-data-jpa")
   compile group: 'com.fasterxml.jackson.datatype', name: 'jackson-datatype-joda', version: '2.9.4'
   compile group: 'org.jadira.usertype', name: 'usertype.core', version: '6.0.1.GA'
   compile group: 'mysql', name: 'mysql-connector-java', version: '6.0.6',
   testCompile('org.springframework.boot:spring-boot-starter-test')
   testCompile('org.springframework.batch:spring-batch-test')
}

Spring Batch 结构

网上有很多Spring Batch结构和原理的讲解，我就不详细阐述了，我这里只讲一下Spring Batch的一个基本层级结构。

首先，Spring Batch运行的基本单位是一个Job，一个Job就做一件批处理的事情。
一个Job包含很多Step，step就是每个job要执行的单个步骤。

如下图所示，Step里面，会有Tasklet，Tasklet是一个任务单元，它是属于可以重复利用的东西。
然后是Chunk，chunk就是数据块，你需要定义多大的数据量是一个chunk。

Chunk里面就是不断循环的一个流程，读数据，处理数据，然后写数据。Spring Batch会不断的循环这个流程，直到批处理数据完成。

构建Spring Batch

首先，我们需要一个全局的Configuration来配置所有的Job和一些全局配置。

代码如下：

@Configuration
@EnableAutoConfiguration
@EnableBatchProcessing(modular = true)
public class SpringBatchConfiguration {
    @Bean
    public ApplicationContextFactory firstJobContext() {
        return new GenericApplicationContextFactory(FirstJobConfiguration.class);
    }
    
    @Bean
    public ApplicationContextFactory secondJobContext() {
        return new GenericApplicationContextFactory(SecondJobConfiguration.class);
    }

}

@EnableBatchProcessing是打开Batch。如果要实现多Job的情况，需要把EnableBatchProcessing注解的modular设置为true，让每个Job使用自己的ApplicationConext。

比如上面代码的就创建了两个Job。

例子背景

本博客的例子是迁移数据，数据源是一个文本文件，数据量是上百万条，一行就是一条数据。然后我们通过Spring Batch帮我们把文本文件的数据全部迁移到MySQL数据库对应的表里面。

假设我们迁移的数据是Message，那么我们就需要提前创建一个叫Message的和数据库映射的数据类。

@Entity
@Table(name = "message")
public class Message {
    @Id
    @Column(name = "object_id", nullable = false)
    private String objectId;

    @Column(name = "content")
    private String content;

    @Column(name = "last_modified_time")
    private LocalDateTime lastModifiedTime;

    @Column(name = "created_time")
    private LocalDateTime createdTime;
}

构建Job

首先我们需要一个关于这个Job的Configuration，它将在SpringBatchConfigration里面被加载。

@Configuration
@EnableAutoConfiguration
@EnableBatchProcessing(modular = true)
public class SpringBatchConfiguration {
    @Bean
    public ApplicationContextFactory messageMigrationJobContext() {
        return new GenericApplicationContextFactory(MessageMigrationJobConfiguration.class);
    }
}

下面的关于构建Job的代码都将写在这个MessageMigrationJobConfiguration里面。

public class MessageMigrationJobConfiguration {
}

我们先定义一个Job的Bean。

@Autowired
private JobBuilderFactory jobBuilderFactory;

@Bean
public Job messageMigrationJob(@Qualifier("messageMigrationStep") Step messageMigrationStep) {
    return jobBuilderFactory.get("messageMigrationJob")
            .start(messageMigrationStep)
            .build();
}

jobBuilderFactory是注入进来的，get里面的就是job的名字。
这个job只有一个step。

Step

接下来就是创建Step。

@Autowired
private StepBuilderFactory stepBuilderFactory;

@Bean
public Step messageMigrationStep(@Qualifier("jsonMessageReader") FlatFileItemReader<Message> jsonMessageReader,
                                 @Qualifier("messageItemWriter") JpaItemWriter<Message> messageItemWriter,
                                 @Qualifier("errorWriter") Writer errorWriter) {
    return stepBuilderFactory.get("messageMigrationStep")
            .<Message, Message>chunk(CHUNK_SIZE)
            .reader(jsonMessageReader).faultTolerant().skip(JsonParseException.class).skipLimit(SKIP_LIMIT)
            .listener(new MessageItemReadListener(errorWriter))
            .writer(messageItemWriter).faultTolerant().skip(Exception.class).skipLimit(SKIP_LIMIT)
            .listener(new MessageWriteListener())
            .build();
}

stepBuilderFactory是注入进来的，然后get里面是Step的名字。
我们的Step中可以构建很多东西，比如reader，processer，writer，listener等等。

下面我们就逐个来看看step里面的这些东西是如何使用的。

Chunk

Spring batch在配置Step时采用的是基于Chunk的机制，即每次读取一条数据，再处理一条数据，累积到一定数量后再一次性交给writer进行写入操作。这样可以最大化的优化写入效率，整个事务也是基于Chunk来进行。

比如我们定义chunk size是50，那就意味着，spring batch处理了50条数据后，再统一向数据库写入。
这里有个很重要的点，chunk前面需要定义数据输入类型和输出类型，由于我们输入是Message，输出也是Message，所以两个都直接写Message了。
如果不定义这个类型，会报错。

.<Message, Message>chunk(CHUNK_SIZE)

Reader

Reader顾名思义就是从数据源读取数据。
Spring Batch给我们提供了很多好用实用的reader，基本能满足我们所有需求。比如FlatFileItemReader，JdbcCursorItemReader，JpaPagingItemReader等。也可以自己实现Reader。

本例子里面，数据源是文本文件，所以我们就使用FlatFileItemReader。FlatFileItemReader是从文件里面一行一行的读取数据。
首先需要设置文件路径，也就是设置resource。
因为我们需要把一行文本映射为Message类，所以我们需要自己设置并实现LineMapper。

@Bean
public FlatFileItemReader<Message> jsonMessageReader() {
    FlatFileItemReader<Message> reader = new FlatFileItemReader<>();
    reader.setResource(new FileSystemResource(new File(MESSAGE_FILE)));
    reader.setLineMapper(new MessageLineMapper());
    return reader;
}

Line Mapper

LineMapper的输入就是获取一行文本，和行号，然后转换成Message。
在本例子里面，一行文本就是一个json对象，所以我们使用JsonParser来转换成Message。

public class MessageLineMapper implements LineMapper<Message> {
    private MappingJsonFactory factory = new MappingJsonFactory();

    @Override
    public Message mapLine(String line, int lineNumber) throws Exception {   
        JsonParser parser = factory.createParser(line);
        Map<String, Object> map = (Map) parser.readValueAs(Map.class);
        Message message = new Message();
        ... // 转换逻辑
        return message;
    }
}

Processor

由于本例子里面，数据是一行文本，通过reader变成Message的类，然后writer直接把Message写入MySQL。所以我们的例子里面就不需要Processor，关于如何写Processor其实和reader/writer是一样的道理。
从它的接口可以看出，需要定义输入和输出的类型，把输入I通过某些逻辑处理之后，返回输出O。

public interface ItemProcessor<I, O> {
    O process(I item) throws Exception;
}

Writer

Writer顾名思义就是把数据写入到目标数据源里面。
Spring Batch同样给我们提供很多好用实用的writer。比如JpaItemWriter，FlatFileItemWriter，HibernateItemWriter，JdbcBatchItemWriter等。同样也可以自定义。

本例子里面，使用的是JpaItemWriter，可以直接把Message对象写到数据库里面。但是需要设置一个EntityManagerFactory，可以注入进来。

@Autowired
private EntityManagerFactory entityManager;

@Bean
public JpaItemWriter<Message> messageItemWriter() {
    JpaItemWriter<Message> writer = new JpaItemWriter<>();
    writer.setEntityManagerFactory(entityManager);
    return writer;
}

另外，你需要配置数据库的连接等东西。由于我使用的spring，所以直接在Application.properties里面配置如下：

spring.datasource.url=jdbc:mysql://database
spring.datasource.username=username
spring.datasource.password=password
spring.datasource.driverClassName=com.mysql.cj.jdbc.Driver
spring.jpa.database-platform=org.hibernate.dialect.MySQLDialect
spring.jpa.show-sql=true
spring.jpa.properties.jadira.usertype.autoRegisterUserTypes=true
spring.jackson.serialization.write-dates-as-timestamps=false
spring.batch.initialize-schema=ALWAYS
spring.jpa.hibernate.ddl-auto=update

spring.datasource相关的设置都是在配置数据库的连接。
spring.batch.initialize-schema=always表示让spring batch在数据库里面创建默认的数据表。
spring.jpa.show-sql=true表示在控制台输出hibernate读写数据库时候的SQL。
spring.jpa.database-platform=org.hibernate.dialect.MySQLDialect是在指定MySQL的方言。

Listener

Spring Batch同样实现了非常完善全面的listener，listener很好理解，就是用来监听每个步骤的结果。比如可以有监听step的，有监听job的，有监听reader的，有监听writer的。没有你找不到的listener，只有你想不到的listener。

在本例子里面，我只关心，read的时候有没有出错，和write的时候有没有出错，所以，我只实现了ReadListener和WriteListener。
在read出错的时候，把错误结果写入一个单独的error列表文件中。

public class MessageItemReadListener implements ItemReadListener<Message> {
    private Writer errorWriter;

    public MessageItemReadListener(Writer errorWriter) {
        this.errorWriter = errorWriter;
    }

    @Override
    public void beforeRead() {
    }

    @Override
    public void afterRead(Message item) {
    }

    @Override
    public void onReadError(Exception ex) {
         errorWriter.write(format("%s%n", ex.getMessage()));
    }
}

在write出错的时候，也做同样的事情，把出错的原因写入单独的日志中。

public class MessageWriteListener implements ItemWriteListener<Message> {

    @Autowired
    private Writer errorWriter;

    @Override
    public void beforeWrite(List<? extends Message> items) {
    }

    @Override
    public void afterWrite(List<? extends Message> items) {
    }

    @Override
    public void onWriteError(Exception exception, List<? extends Message> items) {
        errorWriter.write(format("%s%n", exception.getMessage()));
        for (Message message : items) {
            errorWriter.write(format("Failed writing message id: %s", message.getObjectId()));
        }
    }
}

前面有说chuck机制，所以write的listener传入参数是一个List，因为它是累积到一定的数量才一起写入。

Skip

Spring Batch提供了skip的机制，也就是说，如果出错了，可以跳过。如果你不设置skip，那么一条数据出错了，整个job都会挂掉。
设置skip的时候一定要设置什么Exception才需要跳过，并且跳过多少条数据。如果失败的数据超过你设置的skip limit，那么job就会失败。
你可以分别给reader和writer等设置skip机制。

writer(messageItemWriter).faultTolerant().skip(Exception.class).skipLimit(SKIP_LIMIT)

Retry

这个和Skip是一样的原理，就是失败之后可以重试，你同样需要设置重试的次数。
同样可以分别给reader，writer等设置retry机制。

如果同时设置了retry和skip，会先重试所有次数，然后再开始skip。比如retry是10次，skip是20，会先重试10次之后，再开始算第一次skip。

运行Job

所有东西都准备好以后，就是如何运行了。
运行就是在main方法里面用JobLauncher去运行你制定的job。

下面是我写的main方法，main方法的第一个参数是job的名字，这样我们就可以通过不同的job名字跑不同的job了。

首先我们通过运行起来的Spring application得到jobRegistry，然后通过job的名字找到对应的job。

接着，我们就可以用jobLauncher去运行这个job了，运行的时候会传一些参数，比如你job里面需要的文件路径或者文件日期等，就可以通过这个jobParameters传进去。如果没有参数，可以默认传当前时间进去。

public static void main(String[] args) {
    String jobName = args[0];

    try {
        ConfigurableApplicationContext context = SpringApplication.run(ZuociBatchApplication.class, args);
        JobRegistry jobRegistry = context.getBean(JobRegistry.class);
        Job job = jobRegistry.getJob(jobName);
        JobLauncher jobLauncher = context.getBean(JobLauncher.class);
        JobExecution jobExecution = jobLauncher.run(job, createJobParams());
        if (!jobExecution.getExitStatus().equals(ExitStatus.COMPLETED)) {
            throw new RuntimeException(format("%s Job execution failed.", jobName));
        }
    } catch (Exception e) {
        throw new RuntimeException(format("%s Job execution failed.", jobName));
    }
}

private static JobParameters createJobParams() {
    return new JobParametersBuilder().addDate("date", new Date()).toJobParameters();
}

最后，把jar包编译出来，在命令行执行下面的命令，就可以运行你的Spring Batch了。

java -jar YOUR_BATCH_NAME.jar YOUR_JOB_NAME

调试

调试主要依靠控制台输出的log，可以在application.properties里面设置log输出的级别，比如你希望输出INFO信息还是DEBUG信息。
基本上，通过查看log都能定位到问题。

logging.path=build/logs
logging.file=${logging.path}/batch.log
logging.level.com.easystudio=INFO
logging.level.root=INFO
log4j.logger.org.springframework.jdbc=INFO
log4j.logger.org.springframework.batch=INFO
logging.level.org.hibernate.SQL=INFO

Spring Batch数据表

如果你的batch最终会写入数据库，那么Spring Batch会默认在你的数据库里面创建一些batch相关的表，来记录所有job/step运行的状态和结果。

大部分表你都不需要关心，你只需要关心几张表。

batch_job_instance：这张表能看到每次运行的job名字。

batch_job_execution：这张表能看到每次运行job的开始时间，结束时间，状态，以及失败后的错误消息是什么。

batch_step_execution：这张表你能看到更多关于step的详细信息。比如step的开始时间，结束时间，提交次数，读写次数，状态，以及失败后的错误信息等。

总结

Spring Batch为我们提供了非常实用的功能，对批处理场景进行了完善的抽象，它不仅能实现小数据的迁移，也能应对大企业的大数据实践应用。它让我们开发批处理应用可以事半功倍。

最后一个tips，搭建Spring Batch的过程中，会遇到各种各样的问题。只要善用Google，都能找到答案。

一个AR Tech Radar的诞生

2018-08-28T21:12:26+08:00

什么是AR Tech Radar

技术雷达是ThoughtWorks每年出品两期的技术趋势报告，一般来说大家看到的雷达都是文档形式，其中有一张技术全景图，以及每个技术点的成熟度分析。而AR技术雷达就是在原始文档的基础上，利用AR技术将其立体化呈现，并在其中添加互动元素。

为什么要做AR Tech Radar

技术雷达一直以来都是文档的形式呈现，如果能通过包含在内的最新技术呈现出来，岂不是更能体现技术雷达的意义。同时也能增加技术雷达的交互和科技感。
XR Community作为AR/VR等技术的探索者，AR技术雷达是我们社区内部产品的第一步尝试。
我们也不知道为什么，就是想做AR Tech Radar。

AR Tech Radar的技术选型

目前市面上能做AR的技术有很多，基本上每家大公司都有自己的AR技术。为什么我们会选择ARKit呢？（ARKit是苹果做AR软件开发的一个工具，使开发者能为iOS设备开发增强现实应用。）

之所以选择ARKit一个很重要的原因就是懒，只想选一个学习成本比较低的技术。

其实AR技术强依赖于承载它的硬件，所以选择AR技术其实就是在选择硬件平台。我们期望能使用一个广泛的平台，让AR技术雷达被更多的人接触到。目前AR硬件平台使用最广泛，也最容易让用户接触到的就是iOS，所以我们选择了ARKit。

其中还有一些其他的人气技术，比如：

ARCore，它是Google推出的运行在Android上的技术，但目前只有几款顶配的Android手机可以运行。
Hololens，它是微软的AR眼镜，购买成本较高，很难被普通用户接触到。
Unity，它支持iOS和Android跨平台。

那为什么我们没有选择在unity上进行AR开发，让它同时支持iOS和android呢？一个原因是ARKit和ARCore是才出来的新技术，它在unity上的兼容性和使用上肯定有很多未知的坑，我们期望使用比较稳定的平台。另外一个原因是，我们期望尝试用原生开发，以便更深刻的体验AR开发的过程。今后我们会尝试使用例如unity等工具进行开发，然后和原生开发做一个对比。

如何开发AR Tech Radar

准备

ARKit是苹果的技术，语言首选是Swift。硬件需要支持ARKit的一台Mac和一部iOS设备。因为ARKit不支持模拟器运行，所以必须使用真机进行全程的开发调试。开发软件是Xcode。

前期构想

做AR开发需要有两部分准备，一部分是本身的编程，另外一部分就是3D建模和空间相关的知识。编程不必多说，只要会Swift就能开始。3D建模不是我们的长项，所以前期我们做了很多调查，比如自己使用3D建模软件做一个雷达模型，或者去购买别人做好的雷达模型，或者外包给第三方公司做一个3D模型，再或者找会3D建模的同学加入我们。

但这些方案都被我们否决了，原因有很多，比如我们的经费有限，不能支持我们去找外包，也没有现成的模型给我们购买。而自己去学习3D建模的学习时间也长，同时也没找到会3D建模的同学。

因此我们决定用ARKit支持的形状来组合一个雷达。

我们曾经设想过很多次AR技术雷达应该长什么样。

比如罗马斗兽场的样子，让技术每层递进。

或者是一个圆球，人站在球里面，被周围的技术包围，大概像这样：

再或者，它应该是一个立在你面前的展台，技术雷达就摆在用户面前，大概像这样：

最终这些想法都被我们暂时搁置了，最主要的原因是我们没有能力和人手去实现那些炫酷的样子，并且我们觉得技术雷达就应该用它最朴素的样子展示给大家，应该被大家关注的是技术雷达的内容，而不是这个3D物体。所以最终我们决定用一个圆饼来展示技术雷达。

开发

首先，3D建模不是我们的长项，所以我们选用了ARKit支持的基本形状来组合出一个技术雷达的大饼。因此，我们使用了一个圆柱体和三个圆管，如下图。正中间是一个圆柱，用三个圆管把圆柱包围起来，就形成了雷达圆饼。

接着，为了让整个雷达看起来更立体，我们使用了圆球来作为每个技术的标示点，同时让标题浮在圆球的正上方。如下图。

我才不会告诉你，每个技术标示点在第一版的设计中是圆锥形的，看起来像雷达上的一坨坨屎。请看下图。

然后就是添加交互，让用户在点击某一个圆球的时候弹出它的具体阐述。就像下图一样。我们在圆球的正上方弹出一个半透明白板，并把标题和内容放在上面。白板上的字不同于圆球上的标题，它是印在平面上的，而不像标题是3D立体的。因为大段的文字不适合全部做成3D立体的字，这对资源的消耗和3D的计算是很大的。所以我们利用3D纹理贴图，把文字描述贴到了白板上。

数据

最后就是如何添加数据，我们希望这个AR技术雷达能运用到每一年的技术雷达，这就要求我们添加进去的数据是支持更新的。

所以我们使用了一个单独的文件来存储每一期的所有技术，文件内容包含了所有技术相关的信息，比如名字、详细介绍、它所处的象限、它的分类等等。

这样的好处就是下一次的雷达技术出来之后，我们只需要更新这个独立的文件就可以看到最新的AR技术雷达了。

3D开发过程中遇到的困难与趣事

遇到的第一个奇葩事件就是，第一次我们添加了一个物体，可是在摄像头里面怎么都找不到，后来我们无意中把镜头对着天空突然发现那个物体在空中飘着。原因就是ARKit世界里面的尺寸是和现实世界一样的，单位是米，而我们的离地高度设的是3米，因此它就跑到空中去了。

另一个和这个是相似的，我们加了一个圆管放在地上，可是在地上怎么也找不到那个圆管。后来我们才发现，我们的圆管的尺寸太大了，把我们全部包在圆管里面了。

第三个有意思的事情是，我们添加了一个平面，上面写了一些东西，可是我们在镜头里面却怎么也找不到这个平面。通过各种debug和调查研究，才发现，我们在平面的背面，原来对于没有厚度的平面，只能在正面才能看得见。

还有一个比较棘手的问题就是，比如有些物体需要旋转两个90度再加上一些变换才能达到我们想要的位置。这对空间想象能力的要求就比较高，我们尝试了很多种旋转和变换，才最终找到了想要的位置。

未来的发展

我们期望AR技术雷达能发展成为每次技术雷达发布的官方AR应用，通过不同的途径和不同的体验让更多的人了解技术雷达，让人们能和技术雷达有一些有意义的互动。

所以未来我们期望能不断完善AR技术雷达，让它成为一个炫酷的、交互式很强的应用。

打开脑洞想象一下，通过使用AR技术雷达，你不仅可以看到每次更新的新技术、还能够通过一些交互直观的看到它的历史轨迹、应用场景以及具体实践，是不是一件很酷的事情？