返回首页

YanivErlich_2018P-_我们如何建立世界上最大的家谱树_

People use the internet for various reasons. 人们因各种原因使用着互联网。
It turns out that one of the most popular categories of website is something that people typically consume in private . 一种最受欢迎的网站 是人们常常私下浏览的东西。
categories:n.(人或事物的)类别,种类(category的复数) typically:adv.代表性地;作为特色地; consume:v.消耗;吃;毁灭;烧毁; in private:私下地;秘密地;
It involves curiosity , non-insignificant levels of self-indulgence and is centered around recording the reproductive activities of other people. 它涉及到好奇心, 无关自我放纵程度, 并以记录他人的生殖记录 为中心。
involves:v.包含;需要;牵涉;牵连;影响;(使)参加,加入(involve的第三人称单数) curiosity:n.好奇,好奇心;珍品,古董,古玩; self-indulgence:n.放纵;任性; reproductive:adj.生殖的;再生的;复制的;
(Laughter) (笑声)
Of course, I'm talking about genealogy -- 当然,我讨论的是家谱学——
genealogy:n.宗谱;血统;家系;系谱学;
(Laughter) (笑声)
the study of family history. 也就是对家庭历史的研究。
When it comes to detailing family history, in every family, we have this person that is obsessed with genealogy. 当说到详细的家族历史, 在每个家庭中,我们都有 一个痴迷于家谱的人。
obsessed:v.使痴迷;使迷恋;使着迷;(obsess的过去式和过去分词)
Let's call him Uncle Bernie. 我们姑且叫他伯尼叔叔吧。
Uncle Bernie is exactly the last person you want to sit next to in Thanksgiving dinner, because he will bore you to death with peculiar details about some ancient relatives . 伯尼叔叔正是你在感恩节晚餐上, 最不想坐在一起的人, 因为他会用一些远古亲戚的奇特细节 把你烦死。
bore:v.钻孔;使烦扰;n.孔;令人讨厌的人; peculiar:adj.特殊的;独特的;奇怪的;罕见的;n.特权;特有财产; relatives:n.亲戚;亲属;同类事物;(relative的复数)
But as you know, there is a scientific side for everything, and we found that Uncle Bernie's stories have immense potential for biomedical research. 但正如你所知, 任何事物都有科学的一面, 我们发现伯尼叔叔的故事 具有巨大的生物医学研究潜力。
scientific:adj.科学的,系统的; immense:adj.巨大的,广大的;无边无际的;非常好的; potential:n.潜能;可能性;[电]电势;adj.潜在的;可能的;势的; biomedical:adj.生物医学的;
We let Uncle Bernie and his fellow genealogists 我们让伯尼叔叔和他的家谱同行,
genealogists:n.系谱学者;网络释义:系谱学家;编年者;
When users upload their trees to the website, it scans their relatives, and if it finds matches to existing trees, it merges the existing and the new tree together. 当用户上传他们的家谱树到网站时, 网站会扫描他们的亲戚, 如果它发现匹配上现存的家谱树, 它会合并现存的和新的家谱树。
upload:v.上传; merges:v.合并(merge的第三人称单数);
The result is that large family trees are created, beyond the individual level of each genealogist. 结果是超大的家族树创建起来了, 超越了每个家谱学家的个人水平。
individual:n.个人;有个性的人;adj.单独的;个别的;
Now, by repeating this process with millions of people all over the world, we can crowdsource the construction of a family tree of all humankind . 现在,凭借着全球数百万人 不断重复这个过程, 我们可以众包全人类家谱树的建设。
process:v.处理;加工;列队行进;n.过程,进行;方法,adj.经过特殊加工(或处理)的; crowdsource:vi.众包,众包指的是一个公司或机构把过去由员工执行的工作任务,以自由自愿的形式外包给非特定的(而且通常是大型的)大众网络的做法; construction:n.建设;建筑物;解释;造句; family tree:n.家谱;谱系图; humankind:n.人类(总称);
Using this website, we were able to connect 125 million people into a single family tree. 使用这个网站, 我们能够在一颗家族树上连接 1.25 亿人。
I cannot draw the tree on the screens over here because they have less pixels than the number of people in this tree. 我无法在这里的屏幕上画家谱树, 因为它们的像素比 在这棵树上的人还少。
pixels:n.像素(组成屏幕图像的最小独立元素);(pixel的复数)
But here is an example of a subset of 6,000 individuals . 但这里有一个 6000 人的子集例子。
subset:n.[数]子集;子设备;小团体; individuals:n.[经]个人;[生物]个体(individual的复数);
Each green node is a person. 每一个绿色的节点是一个人。
node:n.节点;结点;茎节;(根或枝上的)节;
The red nodes represent marriages, and the connections represent parenthood . 红色的节点代表婚姻, 连接代表亲子关系。
nodes:n.茎节;(根或枝上的)瘤,节,结;节点;(node的复数) represent:v.代表;表现;描绘;回忆;再赠送; parenthood:n.亲子关系;父母身份;
In the middle of this tree, you see the ancestors. 在这个树的中央是祖先。
And as we go to the periphery , you see the descendants . 外围是后代。
periphery:n.外围,边缘;圆周;圆柱体表面; descendants:n.后裔;后代;子孙;派生物;(descendant的复数)
This tree has seven generations, approximately . 这棵树大约有 7 代。
approximately:adv.大约,近似地;近于;
Now, this is what happens when we increase the number of individuals to 70,000 people -- still a tiny subset of all the data that we have. 而这是当我们增加人数到 7 万人时 的样子—— 仍然是我们拥有的所有 数据集的一小部分。
Despite that, you can already see the formation of gigantic family trees with many very distant relatives. 即便如此,你已经能够看到由许多远亲 组成的一棵巨大家谱树。
Despite:prep.尽管,不管;n.轻视;憎恨;侮辱; formation:n.形成;编队;组成;队形; gigantic:adj.巨大的,庞大的; distant:adj.遥远的;远处的;久远的;
Thanks to the hard work of our genealogists, we can go back in time hundreds of years ago. 感谢家谱学家的努力工作, 我们可以回到数百年前。
For example, here is Alexander Hamilton, who was born in 1755. 比如,这是亚历山大·汉密尔顿, 他出生于 1755 年。
Alexander was the first US Secretary of the Treasury , but mostly known today due to a popular Broadway musical. 亚历山大是首任美国财政部长, 但主要由于一部流行的百老汇 音乐剧而广为人知。
Treasury:n.国库,金库;财政部;宝库; Broadway:n.百老汇(美国纽约市戏院集中的一条大街);
We found that Alexander has deeper connections in the showbiz industry. 我们发现亚历山大在娱乐圈 有更深厚的人脉。
showbiz:n.娱乐界;娱乐性行业;
In fact, he's a blood relative of ... 事实上,他是——
Kevin Bacon ! 凯文·贝肯的血亲!
Bacon:n.咸肉;腌肉;熏猪肉;
(Laughter) (笑声)
Both of them are descendants of a lady from Scotland who lived in the 13th century. 他们都是13世纪一位来自 苏格兰的女士的后代。
So you can say that Alexander Hamilton is 35 degrees of Kevin Bacon genealogy. 所以你可以说亚历山大·汉密尔顿 是 35 度凯文·贝肯的宗谱。
(Laughter) (笑声)
And our tree has millions of stories like that. 我们的家谱树有数百万类似的故事。
We invested significant efforts to validate the quality of our data. 我们投入了不小的工作 在验证数据的质量上。
invested:v.投资;投入;(invest的过去分词和过去式) significant:adj.重大的;有效的;有意义的;值得注意的;意味深长的;n.象征;有意义的事物; validate:v.证实;确认;使生效;批准;认可;
Using DNA, we found that .3 percent of the mother-child connections in our data are wrong, which could match the adoption rate in the US pre-Second World War. 使用DNA,我们发现我们 数据中有 0.3% 的母子关系 是错误的, 这可能与二战前美国的收养率相当。
adoption:n.采用;收养;接受;
For the father's side, the news is not as good: 1.9 percent of the father-child connections in our data are wrong. 父亲方面, 消息也并不乐观: 我们的数据中 1.9% 的 父子关系是错误的。
And I see some people smirk over here. 我看到有人在这儿讪笑。
smirk:v.傻笑;自鸣得意地笑;n.假笑;傻笑,得意的笑
It is what you think -- there are many milkmen out there. 这是你们在想的—— 外面有很多挤牛奶的人。
milkmen:n.牛奶商;送奶工人;挤奶员工;
(Laughter) (笑声)
However, this 1.9 percent error rate in patrilineal connections is not unique to our data. 然而这 1.9% 的父子关系错误率 不是我们数据独有的。
patrilineal:adj.父子相传的;父系的;父系血统; unique:adj.独特的,稀罕的;[数]唯一的;n.独一无二的人或物;
Previous studies found a similar error rate using clinical-grade pedigrees . 早先使用临床级血统的研究 也发现了类似的错误率。
Previous:adj.以前的;早先的;过早的;adv.在先;在…以前; clinical-grade:临床应用级别; pedigrees:家系;[遗]谱系,[遗]血统;衍生后代;
So the quality of our data is good, and that should not be a surprise. 所以我们的数据质量是良好的, 并且这也不应该是个意外。
Our genealogists have a profound , vested interest in correctly documenting their family history. 我们的系谱学家对正确记录 他们的家族史有着浓厚的兴趣。
profound:adj.深厚的;意义深远的;渊博的; vested interest:n.既得利益;
We can leverage this data to learn quantitative information about humanity , for example, questions about demography . 我们可以利用这些数据来 了解人类的定量信息, 比如,有关人口统计学的问题。
leverage:n.影响力;杠杆作用;杠杆效力;v.举债经营;借贷收购; quantitative:adj.定量的;量的,数量的; humanity:n.人类;人道;仁慈;人文学科; demography:n.人口统计学;
Here is a look at all our profiles on the map of the world. 这是我们的资料在世界地图上的样子。
profiles:n.配置文件; v.扼要描述;
Each pixel is a person that lived at some point. 每个像素代表一个生活在特定位置的人。
And since we have so much data, you can see the contours of many countries, especially in the Western world. 由于我们有很多数据, 你可以看到很多国家的轮廓, 尤其在西方世界。
contours:n.轮廓(contour的复数); v.画等高线; especially:adv.尤其;特别;格外;十分;
In this clip , we stratified the map that I've showed you and we compared it to known migration events. 在这个视频片段中, 我们把给你展示的地图 并且跟已知的迁移事件比较。
clip:v.剪辑;修剪;削减;固定;n.夹;夹子;速度;钳; stratified:adj.分层的;形成阶层的;分为不同等级的;v.使成层(stratify的过去分词); compared:adj.比较的,对照的; v.相比; (compare的过去式和过去分词) migration:n.迁移;迁徙;转移;移居;
The clip is going to show you that the deepest lineages in our data go all the way back to the UK, where they had better record keeping, and then they spread along the routes of Western colonialism . 这个视频将向你展示 我们数据中最深的血统, 可以追溯到英国, 这里有更好的记录保存, 然后他们沿着西方殖民主义 的道路传播。
lineages:n.世系;宗系;家系;血统;(lineage的复数) routes:n.路线;途径;路途;渠道;v.按某路线发送;(route的第三人称单数和复数) colonialism:n.殖民主义;殖民政策;
Let's watch this. 让我们来看看这个。
(Music) (音乐)
[Year of birth: ] 【出生年份:】
[1492 - Columbus sails the ocean blue] 【1492 - 哥伦布蓝色海洋航行时期】
[1620 - Mayflower lands in Massachusetts] 【1620 - 五月花号在马萨诸塞州着陆】
Mayflower:n.五月开花的草木;
[1652 - Dutch settle in South Africa] 【1652 - 荷兰人在南非定居】
Dutch:adj.荷兰的; n.荷兰人; v.费用平摊地; settle:v.定居;结算;停留;确定;n.高背长椅(老式木家具,有扶手,座下多带柜);
[1788 - Great Britain penal transportation to Australia starts] 【1788 - 英国开始向澳大利亚 进行刑事流放】】
Great Britain:n.大不列颠(包括英格兰、苏格兰和威尔士); penal:adj.刑事的;刑罚的; transportation:n.运输;运输系统;运输工具;流放;
[1836 - First migrants use Oregon Trail] 【1836 - 第一批移民来到俄勒冈小道】
migrants:n.移民;移居者;候鸟(migrant的复数形式); Oregon:n.俄勒冈州(美国州名);
[all activity] 【所有活动】
I love this movie. 我爱这个视频。
Now, since these migration events are giving the context of families, we can ask questions such as: 因为这些移民时间 提供了家庭的背景, 我们可以问诸如此类的问题:
context:n.环境;上下文;来龙去脉;
What is the typical distance between the birth locations of husbands and wives? 丈夫和妻子出生地 的特定距离是多少?
locations:n.地方;地点;位置;定位(location的复数)
This distance plays a pivotal role in demography, because the patterns in which people migrate to form families determine how genes spread in geographical areas. 这一距离在人口统计学中 起着重要的作用, 因为人们迁移形成家庭的模式 决定了基因如何在地理位置上传播。
pivotal:adj.关键的;中枢的;枢轴的;n.关键事物;中心事物; migrate:vi.移动;随季节而移居;移往;vt.使移居;使移植; determine:v.决定;确定;测定;查明;形成;影响;裁决;安排; genes:n.基因;(gene的复数) geographical:adj.地理的;地理学的;
We analyzed this distance using our data, and we found that in the old days, people had it easy. 我们使用我们的数据分析了这个距离, 我们发现在古时候, 人们过得很轻松。
analyzed:v.分析(analyze的过去式和过去分词);检讨;
They just married someone in the village nearby . 他们只是跟村子附近的某人结婚。
nearby:adj.附近的,邻近的;adv.在附近;prep.在…附近;
But the Industrial Revolution really complicated our love life. 但工业革命复杂化了 我们的爱情生活。
Industrial Revolution:n.工业革命; complicated:adj.复杂的;难懂的;v.使复杂化;(complicate的过去分词和过去式)
And today, with affordable flights and online social media , people typically migrate more than 100 kilometers from their place of birth to find their soul mate. 今天,凭着可负担的航班 和网络社交媒体, 人们通常从出生地迁移 100 多公里 来寻找灵魂伴侣。
affordable:adj.负担得起的; media:n.媒体;媒质(medium的复数);血管中层;浊塞音;中脉;
So now you might ask: 所以现在你可能会问:
OK, but who does the hard work of migrating from places to places to form families? 好吧,但是谁会卖力从一个地方 迁移到另一个地方 去构建家庭呢?
migrating:v.迁移;迁徙;移居(migrate的现在分词);
Are these the males or the females? 是男人还是女人?
We used our data to address this question, and at least in the last 300 years, we found that the ladies do the hard work of migrating from places to places to form families. 我们使用我们的数据解答了这个问题, 至少在过去 300 年中, 我们发现女性从一个地方 迁移到另一个地方 去构建家庭上是最辛苦的。
Now, these results are statistically significant, so you can take it as scientific fact that males are lazy. 这些结果在统计上很显著, 所以你可以把男性懒惰当作科学事实。
statistically:adv.统计地;统计学上;
(Laughter) (笑声)
We can move from questions about demography and ask questions about human health. 我们可以把问题从人口统计学开始 转向人类健康问题。
For example, we can ask to what extent genetic variations account for differences in life span between individuals. 比如,我们可以问 遗传变异能在多大程度上影响个体的 寿命差异。
extent:n.程度;范围;长度; genetic:adj.基因的;遗传学的; variations:n.变奏曲,变更;[生物]变种(variation的复数形式); account for:对…负有责任;对…做出解释;说明…的原因;导致;(比例)占; span:n.持续时间; v.持续;
Previous studies analyzed the correlation of longevity between twins to address this question. 之前的研究通过分析 双胞胎寿命的相关性 来解答这个问题。
correlation:n.[数]相关,关联;相互关系; longevity:n.长寿,长命;寿命;
They estimated that the genetic variations account for about a quarter of the differences in life span between individuals. 他们估计出遗传变异 对个体寿命差异的影响大约占 1/4。
estimated:adj.估计的;预计的;估算的;
But twins can be correlated due to so many reasons, including various environmental effects or a shared household . 但双胞胎之间的关联有很多原因, 包括多样的环境影响 或共同的家庭。
correlated:v.相互关联影响;相互依赖;(correlate的过去式和过去分词) household:n.家庭;一家人;同住一所(或一套)房子的人;adj.家庭的;家常的;王室的;
Large family trees give us the opportunity to analyze both close relatives, such as twins, all the way to distant relatives, even fourth cousins. 庞大的家谱树给了我们分析这些近亲, 比如双胞胎, 到远房亲戚, 甚至四代表亲这样的机会。
This way we can build robust models that can tease apart the contribution of genetic variations from environmental factors . 这样我们可以构建稳健的模型, 从环境因素中分离出 遗传变异的贡献来。
robust:adj.强健的;健康的;粗野的;粗鲁的; tease:v.取笑; n.戏弄; contribution:n.捐款;捐资;定期缴款;贡献;促成作用;稿件; factors:n.因素(factor的复数); v.做代理商;
We conducted this analysis using our data, and we found that genetic variations explain only 15 percent of the differences in life span between individuals. 我们使用数据执行了这个分析, 发现遗传变异只解释了 15% 的 个体寿命差异。
conducted:v.组织;安排;实施;执行;指挥;带领;引导;(conduct的过去分词和过去式) analysis:n.分析;分解;验定;
That is five years, on average. 平均而言, 就是 5 年之差。
So genes matter less than what we thought before to life span. 所以基因对寿命的重要性 比我们之前想象的少。
And I find it great news, because it means that our actions can matter more. 我发现这是个好消息, 因为这意味着我们的行动更为重要。
Smoking, for example, determines 10 years of our life expectancy -- twice as much as what genetics determines. 举个例子,吸烟会影响 大约10年的预期寿命—— 是基因所能影响的两倍。
determines:v.查明;测定;准确算出;决定;裁决;安排;(determine的第三人称单数) life expectancy:预期寿命; genetics:n.遗传学;
We can even have more surprising findings as we move from family trees and we let our genealogists document and crowdsource DNA information. 随着我们从家谱树展开, 让我们的家谱学专家建档, 并且众包DNA信息, 我们能有更多惊奇的发现。
findings:n.调查发现;判决;裁决;(finding的复数)
And the results can be amazing. 结果将是惊人的。
It might be hard to imagine, but Uncle Bernie and his friends can create DNA forensic capabilities that even exceed what the FBI currently has. 可能令人难以想象, 伯尼叔叔和他的朋友 能够创建 DNA 法医能力, 甚至超过了 FBI 目前拥有的水平。
forensic:adj.法院的;辩论的;适于法庭的; exceed:v.超过(数量);超越(法律、命令等)的限制; currently:adv.当前;一般地;
When you place the DNA on a large family tree, you effectively create a beacon that illuminates the hundreds of distant relatives that are all connected to the person that originated the DNA. 当你把 DNA 放在一棵大的家谱树中, 你就有效地创造了一个照亮 数百个远亲的灯塔, 他们都与 DNA 的拥有者有联系。
beacon:n.灯塔; v.像灯塔般照耀; illuminates:v.照明;阐释(illuminate的第三人称单数形式); originated:v.起源;发源;创立;创建;发明;(originate的过去式和过去分词)
By placing multiple beacons on a large family tree, you can now triangulate the DNA of an unknown person, the same way that the GPS system uses multiple satellites to find a location. 通过在一棵大的家谱树中 放置不同的灯塔, 你现在可以对一个陌生人 的 DNA 进行三角测量, 就跟 GPS 系统利用不同的卫星 来定位一样。
multiple:adj.数量多的;多种多样的;n.倍数; beacons:n.[水文]指向标; v.照亮,指引(beacon的第三人称单数); triangulate:vt.把…分成三角形; adj.由三角形组成的;
The prime example of the power of this technique is capturing the Golden State Killer, one of the most notorious criminals in the history of the US. 这种技术威力一个的主要例子 是追捕“金州杀手”, 美国历史上最臭名昭著的罪犯之一。
prime:adj.主要的; v.极好地; n.初期; v.使准备好; technique:n.技巧,技术;手法; capturing:v.俘虏;捕获;攻占;夺得;(capture的现在分词) notorious:adj.声名狼藉的,臭名昭著的;
The FBI had been searching for this person for over 40 years. FBI 已经寻找这人超过 40 年。
They had his DNA, but he never showed up in any police database. 他们有他的 DNA, 但他从未出现在警方的数据库中。
About a year ago, the FBI consulted a genetic genealogist, and she suggested that they submit his DNA to a genealogy service that can locate distant relatives. 大约一年前,FBI 咨询了 一位基因谱系学家, 她建议他们提交他的 DNA 到可以定位远房亲戚 的家谱服务平台上。
consulted:v.咨询;请教;商议;查阅;查询;参看(consult的过去分词和过去式) submit:vt.使服从;主张;呈递;vi.提交;服从; locate:v.确定…的准确地点;把…安置在(或建造于);创办于(某地);
They did that, and they found a third cousin of the Golden State Killer. FBI 这样做了, 他们找到了金州杀手的第三代表亲。
They built a large family tree, scanned the different branches of that tree, until they found a profile that exactly matched what they knew about the Golden State Killer. 他们构建了一棵巨大的家谱树, 扫描树上的不同分支, 直到他们找到完美匹配 他们所了解的金州杀手信息的人。
They obtained DNA from this person and found a perfect match to the DNA they had in hand. 他们从这人身上取得 DNA 并发现 跟他们手上的 DNA 一致。
obtained:v.获得(obtain的过去分词);
They arrested him and brought him to justice after all these years. 过了这么些年,他们终于逮捕了他, 并绳之与法。
justice:n.公平;公正;司法制度;审判;
Since then, genetic genealogists have started working with local US law enforcement agencies to use this technique in order to capture criminals. 自那之后,基因谱系学家开始 跟美国当地执法机构合作, 使用这种技术来抓捕罪犯。
enforcement:n.执行,实施;强制; agencies:n.代理;代理处(agency的复数); capture:v.俘虏;捕获;攻占;夺得;刻画,描述;n.(被)捕获;(被)俘获
And only in the past six months, they were able to solve over 20 cold cases with this technique. 仅仅在过去的 6 个月, 他们使用这个技术就破获了 超过 20 个铁证悬案。
Luckily, we have people like Uncle Bernie and his fellow genealogists 幸好,我们有这群人, 像伯尼叔叔和他的家谱学同行,
These are not amateurs with a self-serving hobby. 他们不只是业余爱好者。
amateurs:n.业余爱好者;业余运动员;生手;外行;(amateur的复数) self-serving:adj.自私自利的,自私的;
These are citizen scientists with a deep passion to tell us who we are. 他们是满怀热情的公民科学家, 想要揭开我们所有人身份的秘密。
passion:n.激情;热情;酷爱;盛怒;
And they know that the past can hold a key to the future. 他们知道,过去是通向未来的钥匙。
Thank you very much. 谢谢大家。
(Applause) (鼓掌)