返回首页

JenniferGolbeck_2013X-_炸薯条之迷_你以为社交媒体的“赞”就是单纯的“赞”而已吗_

If you remember that first decade of the web, it was really a static place. 如果你还记得网络时代的头十年, 网络是一个水尽鹅飞的地方。
static:adj.静止的; n.[物]静电(干扰);
You could go online, you could look at pages, and they were put up either by organizations who had teams to do it or by individuals who were really tech-savvy for the time. 你可以上网,你可以浏览网页, 当时的网站 要么是由某个组织的专门团队建立, 要么就是由真正的技术行家所做, 这就是当时情况。
organizations:n.组织,构造,有机体(organization的复数);组织机构; individuals:n.[经]个人;[生物]个体(individual的复数);
And with the rise of social media and social networks in the early 2000s, the web was completely changed to a place where now the vast majority of content we interact with is put up by average users, either in YouTube videos or blog posts or product reviews or social media postings. 但在二十一世纪初 随着社交媒体以及社交网络的兴起, 网络发生了翻天覆地的变化: 如今网络上大部分的互动内容 都是由大众网络用户提供, 既有Youtube视频,也有博客文章, 既有产品评论,也有社交媒体发布。
media:n.媒体;媒质(medium的复数);血管中层;浊塞音;中脉; majority:n.大部分:大多数:多数票:成年人: content:n.内容,目录;满足;容量;adj.满意的;vt.使满足; interact:v.互相影响;互相作用;n.幕间剧;幕间休息; blog:n.博客;部落格;网络日志; reviews:n.综述; v.回顾;
And it's also become a much more interactive place, where people are interacting with others, they're commenting, they're sharing, they're not just reading. 与此同时,互联网成为了一个有更多互动的地方, 人们在这里互相交流、 互相评论、互相分享, 而不只是阅读信息。
interactive:adj.合作的;相互影响的;互相配合的;交互式的;n.交互式视频设备; interacting:v.交流;沟通;合作;相互影响;(interact的现在分词)
So Facebook is not the only place you can do this, but it's the biggest, and it serves to illustrate the numbers. 面谱网不是唯一一个你可以做这些事情的地方, 但它确实是最大的一个, 并且它用数字来证明这点。
illustrate:v.解释;加插图于;给(书等)做图表;表明…真实;
Facebook has 1.2 billion users per month. 面谱网每个月有12亿用户。
So half the Earth's Internet population is using Facebook. 由此可见,地球上一半的互联网用户 都在使用面谱网。
They are a site , along with others, that has allowed people to create an online persona with very little technical skill, and people responded by putting huge amounts of personal data online. 这些都是网站, 允许人们在网上创建不同的角色, 但这些人又不需要有多少计算机技能, 而人们的反应是
site:n.地点;位置;场所;v.设置;为…选址; technical:adj.工艺的,科技的;技术上的;专门的; responded:v.回答,回应;作出反应;响应;反应灵敏;(respond的过去式和过去分词) personal:adj.个人的;身体的;亲自的;n.人事消息栏;人称代名词;
So the result is that we have behavioral , preference , demographic data for hundreds of millions of people, which is unprecedented in history. 结果是,我们拥有数以亿计人的 以及人口数据资料。 这在历史上前所未有。
behavioral:adj.行为的; preference:n.偏爱;爱好;喜爱;偏爱的事物; demographic:adj.人口统计学的;人口学的; unprecedented:adj.空前的;无前例的;
And as a computer scientist, what this means is that 对于作为计算机科学家的我来说,这意味着
I've been able to build models that can predict all sorts of hidden attributes for all of you that you don't even know you're sharing information about. 我能够建立模型 来预测各种各样的 你或许完全没有意识到的 与你所分享的信息相关的隐藏信息。
predict:v.预报;预言;预告; attributes:v.把…归因于; n.属性; (attribute的第三人称单数和复数)
As scientists, we use that to help the way people interact online, but there's less altruistic applications, and there's a problem in that users don't really understand these techniques and how they work, and even if they did, they don't have a lot of control over it. 作为科学家,我们利用这些信息 来帮助人们在网上交流。 但也有人用此来谋取自己的私欲, 而问题是,用户并没有真正理解 其中用到的技术和技术的应用方式。 即便理解了,也不见得他们有话事权。
altruistic:adj.利他的;无私心的; techniques:n.技巧;技艺;工艺;技术;(technique的复数)
So what I want to talk to you about today is some of these things that we're able to do, and then give us some ideas of how we might go forward to move some control back into the hands of users. 所以,我今天想谈谈 我们能够做的一些事情, 也启发我们 如何改善情况、让话事权回归用户。
So this is Target, the company. 这是塔吉特百货公司的商标。
I didn't just put that logo on this poor, pregnant woman's belly . 我并不单单把那个商标 放在这个可怜的孕妇的肚子上。
logo:n.标志; pregnant:adj.怀孕的;富有意义的; belly:n.腹部;胃;食欲;v.涨满;鼓起;
You may have seen this anecdote that was printed in Forbes magazine where Target sent a flyer to this 15-year-old girl with advertisements and coupons for baby bottles and diapers and cribs two weeks before she told her parents that she was pregnant. 或许在福布斯杂志上 你看过这么一则趣事: 塔吉特百货公司给这个15岁女孩寄了一份传单, 传单上都是婴儿奶瓶、尿布、 婴儿床的广告和优惠券。 这一切发生在 她把怀孕消息告诉父母的两周前。
anecdote:n.轶事;奇闻;秘史; Forbes:n.福布斯(美国著名财经杂志);福布斯(美国出版及媒体集团);福布斯(姓氏); coupons:n.优惠券;息票;[化工]取样片(coupon的复数); diapers:n.(婴儿的)尿布,尿片;(diaper的复数) cribs:n.婴儿床;栅栏;食槽;v.剽窃;拘禁,关入栅栏;抄袭;
Yeah, the dad was really upset . 没错,女孩的父亲很生气。
upset:adj.沮丧; v.打乱; n.苦恼; (意外的)混乱;
He said, "How did Target figure out that this high school girl was pregnant before she told her parents?" 他说:”塔吉特是如何 在连这个高中女生的父母都尚未知情之前 就知道她怀孕了?“
It turns out that they have the purchase history for hundreds of thousands of customers and they compute what they call a pregnancy score, which is not just whether or not a woman's pregnant, but what her due date is. 原来,塔吉特有成千上万的顾客, 并拥有他们的购买历史记录, 他们用计算机推算出他们所谓的“怀孕分数”, 不仅能知道一个女性是否怀孕, 而且还能计算出她的分娩日期。
purchase:n.购买;采购;购买的东西;购买项目;v.购买;采购; compute:vt.计算;估算;用计算机计算;vi.计算;估算;推断;n.计算;估计;推断; pregnancy:n.怀孕;丰富,多产;意义深长; whether or not:是否…;
And they compute that not by looking at the obvious things, like, she's buying a crib or baby clothes, but things like, she bought more vitamins than she normally had, or she bought a handbag that's big enough to hold diapers. 他们计算出的结果 不单单是基于一些显而易见的事情, 比如说,她准备买个婴儿床或孩子的衣服, 更是基于其他一些事情, 例如她比平时多买了维他命, 或她买了一个新的手提包 大得可以放尿布。
obvious:adj.明显的;显著的;平淡无奇的; vitamins:n.[生化]维生素;[生化]维他命; normally:adv.正常地;通常地,一般地;
And by themselves, those purchases don't seem like they might reveal a lot, but it's a pattern of behavior that, when you take it in the context of thousands of other people, starts to actually reveal some insights . 单独来看这些消费记录 或许并不能说明什么, 但这确是一种行为模式, 当你有大量人口背景作比较, 这种行为模式就开始透露一些见解。
purchases:v.[贸易]购买; n.所购物; reveal:v.显示;透露;揭露;泄露;n.揭露;暴露;门侧,窗侧; context:n.环境;上下文;来龙去脉; insights:n.洞察力;眼力;深刻见解(insight的复数);
So that's the kind of thing that we do when we're predicting stuff about you on social media. 当我们根据社交媒体来预测关于你的一些事情时, 这便是我们常做的一类事情。
predicting:v.预言;预告;预报;(predict的现在分词) stuff:n.东西:物品:基本特征:v.填满:装满:标本:
We're looking for little patterns of behavior that, when you detect them among millions of people, lets us find out all kinds of things. 我们着眼于零星的行为模式, 当你在众人中发现这些行为模式时, 会帮助我们发现各种各样的事情。
detect:vt.察觉;发现;探测;
So in my lab and with colleagues , we've developed mechanisms where we can quite accurately predict things like your political preference, your personality score, gender , sexual orientation , religion, age, intelligence , along with things like how much you trust the people you know and how strong those relationships are. 在我的实验室,在同事们的合作下, 我们已经开发了一些机制 来较为准确地推测一些事情, 比如你的政治立场、 你的性格得分、性别、性取向、 宗教信仰、年龄、智商, 另外还有: 你对认识的人的信任程度、 你的人际关系程度。
colleagues:n.同事;同行(colleague的复数); mechanisms:n.机制;[机]机构(mechanism的复数);机械;[机]机构学; accurately:adv.精确地,准确地; personality:n.性格;个性;人格;魅力;气质;名人;特色; gender:n.性别; sexual:adj.性的;性别的;有性的; orientation:n.方向;定向;适应;情况介绍;向东方; intelligence:n.智力;智慧;才智;(尤指关于敌国的)情报;
We can do all of this really well. 我们能够很好地完成这些推测。
And again, it doesn't come from what you might think of as obvious information. 我在这里在强调一遍,这种推测并基于 在你看来显而易见的信息。
So my favorite example is from this study that was published this year in the Proceedings of the National Academies . 我最喜欢的例子是来自 今年发表在 美国国家论文集上的一个研究。
Proceedings:n.诉讼;行动(proceeding的复数形式);会议记录;议程; Academies:n.院校(academy的复数形式);学院;
If you Google this, you'll find it. 你可以在谷歌搜索找到这篇文章。
Google:谷歌;谷歌搜索引擎;
It's four pages, easy to read. 这篇文章总共四页,容易阅读。
And they looked at just people's Facebook likes, so just the things you like on Facebook, and used that to predict all these attributes, along with some other ones. 他们仅仅研究了人们在面谱网上的“赞”, 也就是你在面谱网上喜欢的事情。 他们利用这些数据来预测 之前所说的所有特性,还有其他的一些特性。
And in their paper they listed the five likes that were most indicative of high intelligence. 在文章中列举了 最能够显示高智商的五个“赞”。
indicative:adj.象征的;指示的;表示…的;n.陈述语气;陈述语气的动词形式;
And among those was liking a page for curly fries. (Laughter) 在这五项中 赞“炸扭薯”页面的是其中之一
Curly fries are delicious, but liking them does not necessarily mean that you're smarter than the average person. 炸扭薯很好吃, 但喜欢吃炸扭薯 并不一定意味着你比一般人聪明。
necessarily:adv.必要地;必定地,必然地;
So how is it that one of the strongest indicators of your intelligence is liking this page when the content is totally irrelevant to the attribute that's being predicted ? 那么为什么喜欢某个页面 就成为显示你智商 的重要因素, 尽管该页面的内容和所预测的属性 与此毫不相干?
indicators:n.指示信号;标志;指针;方向灯;(indicator的复数) irrelevant:adj.不相干的;不切题的; predicted:v.预言;预告;预报;(predict的过去分词和过去式)
And it turns out that we have to look at a whole bunch of underlying theories to see why we're able to do this. 事实是我们必须审视 大量的基础理论, 从而了解我们是如何做到准确推测的。
underlying:adj.根本的; v.构成…的基础; (underlie的现在分词)
One of them is a sociological theory called homophily , which basically says people are friends with people like them. 其中一个基础理论是社会学的同质性理论, 主要意思是人们和自己相似的人交朋友。
sociological:adj.社会的(等于sociologic);社会学的;针对社会问题的; homophily:同质性;同质相吸原则;同质互动 basically:adv.主要地,基本上;
So if you're smart, you tend to be friends with smart people, and if you're young, you tend to be friends with young people, and this is well established for hundreds of years. 所以说,如果你很聪明,你倾向于和聪明的人交朋友。 如果你还年轻,你倾向于和年轻人交朋友。 这是数百年来 公认的理论。
well established:adj.得到确认的; v.已确立起来; (well-establish的过去分词);
We also know a lot about how information spreads through networks. 我们很清楚 信息在网络上传播的传播途径。
It turns out things like viral videos or Facebook likes or other information spreads in exactly the same way that diseases spread through social networks. 结果是,流行的视频、 脸书上得到很多“赞”的内容、 或者其他信息的传播, 同疾病在社交网络中蔓延的方式是相同的。
viral:adj.滤过性毒菌引起的;滤过性毒菌的; diseases:n.[医]病(disease的复数);[医]疾病;[植保]病害;疾病种类;
So this is something we've studied for a long time. 我们在这方面已经研究很久了。
We have good models of it. 我们己经建立了很好的模型。
And so you can put those things together and start seeing why things like this happen. 你能够将所有这些事物放在一起, 看看为什么这样的事情会发生。
So if I were to give you a hypothesis , it would be that a smart guy started this page, or maybe one of the first people who liked it would have scored high on that test. 如果要我给你一个假说的话, 我会猜测一个聪明的人建立了这个页面, 或者第一个喜欢这个页面的人 拥有挺高的智商得分。
hypothesis:n.假设;
And they liked it, and their friends saw it, and by homophily, we know that he probably had smart friends, and so it spread to them, and some of them liked it, and they had smart friends, and so it spread to them, and so it propagated through the network to a host of smart people, so that by the end, the action of liking the curly fries page is indicative of high intelligence, not because of the content, but because the actual action of liking reflects back the common attributes of other people who have done it. 他们喜欢了这个页面,然后他们的朋友看到了, 根据同质性理论,我们知道这些人可能有聪明的朋友, 然后他们看到这类信息,他们中的一部分人也喜欢, 他们也有聪明的朋友, 所以这类信息也传到其他朋友那里, 所以信息就在网络上 在聪明人的圈子里流传开来了, 因此到了最后, 喜欢炸扭薯的这个页面 就成了高智商的象征, 而不是因为内容本身, 而是“喜欢”这一个实际行动 反映了那些也付诸同样行动的人 的相同特征。
propagated:传播; reflects:v.反映;映出(影像);反射;表明,表达;(reflect的第三人称单数)
So this is pretty complicated stuff, right? 听起来很复杂,对吧?
complicated:adj.复杂的;难懂的;v.使复杂化;(complicate的过去分词和过去式)
It's a hard thing to sit down and explain to an average user, and even if you do, what can the average user do about it? 对于一般用户来说 它比较难解释清楚,就算你解释清楚了, 一般用户又能利用它来干嘛呢?
How do you know that you've liked something that indicates a trait for you that's totally irrelevant to the content of what you've liked? 你又怎么能知道你喜欢的事情 反映了你什么特征 而且这个特征还和你喜欢的内容毫不相干呢?
indicates:v.表明指示,显示;(indicate的第三人称单数) trait:n.特性,特点;品质;少许;
There's a lot of power that users don't have to control how this data is used. 用户其实没有太多的能力 去控制这些数据的使用。
And I see that as a real problem going forward. 我把这个看作将来的真实问题,
So I think there's a couple paths that we want to look at if we want to give users some control over how this data is used, because it's not always going to be used for their benefit. 我认为,要是我们想让用户拥有 使用这些数据的能力, 那么有几条路径 我们需要探究, 因为这些数据并不总是 用来为他们谋利益。
An example I often give is that, if I ever get bored being a professor, 这有一个我经常举的例子, 如果我厌倦了当一名教授,
bored:adj.无聊的;厌倦的;烦闷的;v.使厌烦;钻,凿,挖;(bore的过去分词和过去式)
I'm going to go start a company that predicts all of these attributes and things like how well you work in teams and if you're a drug user, if you're an alcoholic . 我会选择自己开家公司 这家公司能预测这些特性和事物 例如你在团队里的能力 例如你是否是一个吸毒者或酗酒者。
predicts:vt.预报,预言;预知;vi.作出预言;作预料,作预报; alcoholic:adj.酒精的,含酒精的;n.酒鬼,酗酒者;
We know how to predict all that. 我们知道如何去预测这些特性。
And I'm going to sell reports to H.R. companies and big businesses that want to hire you. 然后我就会把这些报告 卖给那些人力资源公司 和想要雇佣你的大公司。
We totally can do that now. 我们完全可以做到这点。
I could start that business tomorrow, and you would have absolutely no control over me using your data like that. 我明天就能开始这个项目, 并且你对我这用使用你的数据 是一点办法也没有的。
absolutely:adv.绝对地;完全地;
That seems to me to be a problem. 这对我来说是一个问题。
So one of the paths we can go down is the policy and law path. 所以我们可选的其中一条路径 是政策和法律这条途径。
policy:n.政策,方针;保险单;
And in some respects, I think that that would be most effective , but the problem is we'd actually have to do it. 某程度上我觉得这可能是最有效的 但问题是,事实上我们将不得不这么做。
effective:adj.有效的,起作用的;实际的,实在的;给人深刻印象;
Observing our political process in action makes me think it's highly unlikely that we're going to get a bunch of representatives to sit down, learn about this, and then enact sweeping changes to intellectual property law in the U.S. 观察我们目前的政治进程 让我觉得在美国 把一帮代表们聚在一起 让他们坐下来理解这个问题, 然后颁布有关知识产权法方面的颠覆性条例, 让用户掌控自己的数据,
Observing:adj.观察的;注意的;观察力敏锐的;v.观察;遵守(observe的现在分词); process:v.处理;加工;列队行进;n.过程,进行;方法,adj.经过特殊加工(或处理)的; highly:adv.高度地;非常;非常赞许地; unlikely:adj.不大可能发生的;非心目中的;非想象的;难以相信的; a bunch of:一群;一束;一堆; representatives:n.代表;众议院(representative的复数形式); enact:vt.颁布;制定法律;扮演;发生; sweeping:n.扫除; adj.影响广泛的; v.打扫; (sweep的现在分词) intellectual property:知识产权;
so users control their data. 这似乎是不可能的。
We could go the policy route , where social media companies say, you know what? You own your data. 我们可以走政策途径, 这样社交媒体公司就会告诉你, 你知道吗?你的确拥有你的数据。
route:n.路线;路途;固定线路;途径;v.按某路线发送;
You have total control over how it's used. 你绝对能自己决定要怎么去用。
The problem is that the revenue models for most social media companies rely on sharing or exploiting users' data in some way. 但问题在于大部分的社交媒体公司 他们的盈利模式 在某方面取决于分享或挖掘用户的数据资料。
revenue:n.收益;营业额;税务署; rely:vi.依靠;信赖; exploiting:v.利用…为自己谋利;剥削;压榨;运用;发挥;(exploit的现在分词)
It's sometimes said of Facebook that the users aren't the customer, they're the product. 所以有时会说面谱网的用户并不是顾客,
And so how do you get a company to cede control of their main asset back to the users? 那么你要怎样让一个公司 将他们的主要资产控制权 双手拱让给用户呢?
asset:n.资产;优点;有用的东西;有利条件;财产;有价值的人或物;
It's possible, but I don't think it's something that we're going to see change quickly. 这是可能的,但我不觉得 我们能很快见证这种改变。
So I think the other path that we can go down that's going to be more effective is one of more science. 所以我认为我们得走另一条途径 一条更有效的途径, 一条更加科学的途径。
It's doing science that allowed us to develop all these mechanisms for computing this personal data in the first place . 这途径是开发一种技术 让我们能够发展所有这些机制 来首先处理自己的个人信息资料。
computing:n.计算;计算机技术;信息处理技术;v.计算;求出;(compute的现在分词) in the first place:首先;起初;
And it's actually very similar research that we'd have to do if we want to develop mechanisms that can say to a user, "Here's the risk of that action you just took." 而这很接近 我们必须做的研究, 要是我们想要发展这些机制 跟用户说明, “这样做你需要承担那样的风险。”
By liking that Facebook page, or by sharing this piece of personal information, you've now improved my ability to predict whether or not you're using drugs or whether or not you get along well in the workplace . 你在面谱网上点“赞” 或者分享一些私人信息, 就相当于增强了我的能力 去预测你是不是在吸毒 或者你在工作中是否顺利。
improved:adj.改良的:v.改进:改善(improve的过去分词和过去式) workplace:n.工作场所;车间;
And that, I think, can affect whether or not people want to share something, keep it private, or just keep it offline altogether. 我觉得,这样做 能够影响人们分享的决定: 是要保持私隐,还是在网上只字不提。
offline:n.脱机;挂线;adj.脱机的;离线的,未连线的;adv.未连线地;
We can also look at things like allowing people to encrypt data that they upload , so it's kind of invisible and worthless to sites like Facebook or third party services that access it, but that select users who the person who posted it want to see it have access to see it. 我们也可以探究一些别的,例如 让人们去给上传的东西加密, 那么像面谱网这样的网站 或其他能获取信息的第三方来说 这些信息就隐秘很多,也少了很多意义, 而且只有上传人指定的用户 才有浏览的权限。
encrypt:vt.将…译成密码; upload:v.上传; invisible:adj.看不见的;n.看不见的人或物; worthless:adj.无价值的;不值钱的;卑微的; sites:n.网站(site的复数);遗址,举办地点;v.使...位于(site的单三形式);
This is all super exciting research from an intellectual perspective , and so scientists are going to be willing to do it. 从智能的角度来看, 这是一个非常振奋人心的研究, 而且科学家们也会乐意去做这样的事。
perspective:n.观点;远景;透视图;adj.透视的;
So that gives us an advantage over the law side. 这样在法律方面,我们就有优势了。
advantage:n.有利条件:优势:优点:
One of the problems that people bring up when I talk about this is, they say, you know, if people start keeping all this data private, all those methods that you've been developing to predict their traits are going to fail. 当我谈论到这个话题时, 人们提到的其中一个问题,就是 如果当人们开始把这些数据进行保密, 那些你研发的用来预测 人们特性的手段都会作废。
traits:n.特性,特质,性格(trait的复数);
And I say, absolutely, and for me, that's success, because as a scientist, my goal is not to infer information about users, it's to improve the way people interact online. 我会说,绝对会作废,但对我来说,这是成功, 因为作为一个科学家, 我的目标不是去推测出用户的信息, 而是提高人们在网上互动的方式。
infer:v.推断;推论;暗示;推理;
And sometimes that involves inferring things about them, but if users don't want me to use that data, 虽然有时涉及到推测用户的资料, 但如果用户不希望我们用他们的数据,
involves:v.包含;需要;牵涉;牵连;影响;(使)参加,加入(involve的第三人称单数) inferring:v.推断;推论;推理;暗示;意指;(infer的现在分词)
I think they should have the right to do that. 我觉得他们应该有权去拒绝。
I want users to be informed and consenting users of the tools that we develop. 我希望用户能被告知 并且赞同我们开发的这种工具。
informed:adj.见多识广的; v.通知; (inform的过去分词和过去式) consenting:adj.同意的;准许的;v.允许;赞成(consent的现在分词);
And so I think encouraging this kind of science and supporting researchers who want to cede some of that control back to users and away from the social media companies means that going forward, as these tools evolve and advance, means that we're going to have an educated and empowered user base, and I think all of us can agree that that's a pretty ideal way to go forward. 所以我认为,鼓励这类科学, 支持这些研究者们 这些愿意放弃部分控制,退还给用户们, 并且不让社交媒体公司接触数据的研究者们 随着这些工具的进化和提高 这一切意味着向前的发展, 意味着我们将会拥有一个 有素质有权力的用户基础, 我觉得我们都会同意 这是一个理想的前进目标。
evolve:v.(使)逐渐形成;进化;进化形成; empowered:v.授权;给(某人)…的权力;(empower的过去分词和过去式)
Thank you. 谢谢。
(Applause) (掌声)