您还没有绑定微信,更多功能请点击绑定

第五十九篇 Gaming the Metrics

本帖最后由 小编D 于 2012-7-27 13:30 编辑

请对以下文章有翻译兴趣的组员留下你的预计完成时间和邮箱地址,以便小编登记翻译者信息以及文章最终完成时间

本文翻译:Sol_Sun 校稿:muddy533

Gaming the Metrics
游戏化的度量指标



One of the cornerstones of quality and lean Six Sigma is data: “We insist on it.” “Don’t tell us what you think the situation is; let the data do the talking.” “In God we trust—all others bring data.” You get the idea.
质量和精益六西格玛的奠基石之一是数据: “我们坚持要它。” “别告诉我们你认为现状是什么; 让数据来说话。“我们信赖上帝,其他的请拿数据来。” 你明白的!

An unfortunate side effect of this emphasis is the proliferation of useless data. If the useless data weren’t used, then collecting the data would merely be a waste of time. But if a person’s performance is being measured by these data, you can bet your last euro that the measurements will get a lot of attention, and it will drive a lot of behavior. And if the system doesn’t change, there’s still one way to make the measurements look better: cheat.
不幸的是,这样强调数据的重要性将会带来另外一个副作用——大量的无效数据。 如果无效数据不适用的话,顶多是浪费一些收集数据的时间。但是如果用这样的数据来衡量一个人的绩效,你可以用你压裤兜的钱来打赌,这样做的话,肯定会备受关注,招来大量的注意及行动。假如这样子的衡量系统不加以改变的话,大家为了让衡量的结果好看点,最后只有一条路:作假!

I often open my face-to-face training sessions with Dr. Deming’s Red Bead Experiment. It’s a great icebreaker, and it introduces some important statistical ideas. The experiment is actually a game with very simple rules. “Willing workers” are required to use a paddle with holes in it to sample beads from a container that has red and white beads in it. “We don’t want any red beads,” the workers are told. To drive the point home, there are “quality Inspectors” to check the samples for the unwanted red beads and to record the results, and “supervisors” to use the results to “coach” and discipline the hapless willing workers.
我常常用戴明博士的红珠实验来开始我的面对面的培训。这是一个伟大的打破僵局的游戏,并且它引入了一些重要的统计学理念。实际上,它是一个有着非常简单规则的游戏。“工人扮演者”必须使用一个漏勺,从一个有这红色和白色珠子的密闭容器里面取出珠子。“我们不希望有任何的红色珠子,“工人们被告知。回到试验中,有“质量检查员”检查样品中不必要的红色珠子和记录其结果,同时和“管理者们”使用这个结果来“辅导”,“培训”运气不好的工人。

Before the game concludes, there are always participants who, seeing a bunch of red beads on their paddle, quickly dump the sample back before the count can be made. Others deliberately pick out red beads and throw them back. Still others bring partially filled paddles to the quality inspectors. There are all manner of ways to try and beat the system. And this is just a fun game, played for no stakes at all. Imagine what people do when real consequences are on the line, such as pay and promotions.
在比赛结束前,总是有那样一些参与者们,他们一看到有一些红珠在他们的漏勺上,在计数之前快速的把取出的”样本”倒回箱子里。其他人刻意挑出红珠子并扔回去。所以,有一些带着装满偏袒的漏勺给”质量检验员”。有各种各样的方式去尝试和打破体制。当然,这仅仅只是一个有趣的,没有任何利益关系玩玩而已的游戏。由此,可以想象一下:当有切身利益关系的时候,如工资和晋升;人们会做些什么?

The most serious games are probably played in totalitarian countries, where factory managers are measured and sometimes executed when the results are less than required by the authorities. According to the History Learning Site, in Stalin’s Russia:
最严肃的游戏极有可能在极权国家上演,在那儿,工厂管理者是被测量的,并且当没有满足当局要求的时候甚至被处死。根据历史的学习网站,在斯大林时代的俄罗斯:

Factories took to inflating their production figures and the products produced were frequently so poor that they could not be used—even if the factory producing those goods appeared to be meeting its target. The punishment for failure was severe.
工厂开始喜欢好大喜功,夸大他们的生产数据,哪怕生产出的产品常常是不达标的,甚至没法使用——目的就是工厂生产的那些产品看起来是符合当局目标;因为失败的惩罚是严重的。

In the book Eat the Rich (Atlantic Monthly Press, 1999), author P. J. O’Rourke tells us that in the former USSR:
在《Eat the Rich》这本书 (Atlantic Monthly Press, 1999),作者P. J. O’Rourke告诉我们,在前苏联:

The trouble wasn’t that factory managers disobeyed orders. The trouble was that they obeyed them precisely. If a shoe factory was told to produce 1,000 shoes, it produced 1,000 baby shoes because they were the cheapest and easiest to make. If it was told to produce 1,000 men’s shoes, it made them all one size. If it was told to produce 1,000 shoes in a variety for men, women, and children, it produced 998 baby shoes, one pump, and a wingtip. If it was told to produce 3,000 pounds of shoes, it produced one enormous pair of concrete sneakers.
问题并不是工厂管理者不服从命令。真正麻烦的是:他们的“100%”精确服从。如果一个鞋厂被告知要生产1000双鞋,它会生产1000双婴儿的鞋子,因为他们最便宜和最容易制作。如果它被告知要生产1000双男鞋,那所有鞋都是一个尺码。如果它被告知要生产1000双鞋给不同的男人、女人和孩子,那么生产出来的是998双婴儿鞋、一双气囊鞋、一双尖头鞋。如果它被告知要生产3000磅鞋子,它会生产一双巨大的混凝土运动鞋。

Perhaps O’Rourke is exaggerating, but the point is still essentially valid: Metrics can—and probably will—be gamed. In lean Six Sigma there’s a common metric gaming activity that I call Denominator Improvement. One of the most popular metrics is defects per million opportunities, or DPMOs. The formula itself is quite simple: DPMO = 1,000,000 x Defects/Opportunities. If someone’s performance is being measured using DPMOs, he can make the metric look better by reducing defects (the numerator), or by increasing the number of opportunities (the denominator)。
就算O’rourke说的夸张了点,但有一点还是基本上有效的:度量指标能——或很大可能即将——是用来当游戏玩的。在精益六西格玛存在一个常见的度量指标的游戏活动,我称之为分母改进。最受欢迎的一个缺陷度量标准是每百万机会的缺陷数,或DPMOs。这个公式本身相当简单:DPMO = 1000000 x 缺陷/机会。如果某人的绩效是用DPMOs来计量的,他可以通过减少缺陷(分子) 或者增大机会值(分母)使指标更好看。

For example, we might be interested in the number of typing errors in this post. The DPMO metric might be 1,000,000 x Errors/Total Words. But if this number didn’t look good enough, I might also use 1,000,000 x Errors/Total Letters, or 1,000,000 x Errors/Total Characters, counting spaces and punctuation.
举个例子来说吧:我们可能会对这篇文章里出现的输入错误数量感兴趣。DPMO的指标可能是1000000 x 错误数/单词总数。但是如果这个数字不好看,我也会用1000000 x 错误数/总字母数,或1000000 x 错误数/总字符数,包括计数空格和标点符号。

The solution to metrics gaming is to use metrics to guide improvement, not to measure the performance of people. Metrics should be limited to those numbers that quantify an important outcome (Y metrics), or quantify an input that is critical to the quality of the outcome (a CTQ or X metric)。 The reason for quantifying these things is to discover, validate, and use a transfer function—such as Y = f(x), a model of the cause-and-effect relationship—to guide improvement planning and activity. When metrics serve a useful purpose such as this, the tendency to manipulate and game them is, if not eliminated, at least reduced.
这个问题的解决方案是,使用他们来度量改善活动,而不是来衡量一个人的绩效。数据指标,应限于量化那些重要的输出(Y),或量化CTQ ( 一个CTQ 或 X)。量化这些事情的理由是:发现、验证和使用一个转换函数,如Y = f(x),因果关联图去指导改善计划和改善活动。当度量指标是为想前述这样有益的目的服务的话,篡改或作假数据的倾向就算不会消失不见,至少也会减少。

ABOUT THE AUTHOR
关于作者

Tom Pyzdek
Thomas Pyzdek’s career in business process improvement spans more than 40 years. He is the author more than 50 copyrighted works including The Six Sigma Handbook (McGraw-Hill, 2003)。He provides online certification and training in Six Sigma and lean.
Tom Pyzdek
Tomas Pyzdek在业务流程改进方面有着超过40年职业生涯。他有着超过50个有版权的著作,其中包括《六西格玛指南》(McGraw-Hill, 2003)。他提供了在线的,在六西格玛和精益方面的认证和培训。

对“好”的回答一定要点个"赞",回答者需要你的鼓励!
已邀请:

小编D (威望:9) (广东 广州) 互联网 员工 - 记住该记住的,忘记改忘记的。改变能改变的,接受不...

赞同来自:

好,稍后整理好发你邮箱

4 个回复,游客无法查看回复,更多功能请登录注册

发起人

小编D
小编D

记住该记住的,忘记改忘记的。改变能改变的,接受不能改变的。

扫一扫微信订阅<6SQ每周精选>