The video owner has disabled playback on external websites.

This video is no longer available on YouTube.

This video cannot be played right now.

Watch on YouTube

解锁AI学习工具

注册即可使用强大工具,帮助你从每个视频中更快地学习。

场景解析 短语猎手 词卡复习 跟读练习 语音回放
免费注册
B1 中级 英语 21:05 Educational

The Zipf Mystery

Vsauce · 28,222,901 次观看 · 添加于 3 周前

学习统计

B1

CEFR 等级

5/10

难度

字幕 (222 片段)

00:00

Hey, Vsauce. Michael here. About 6 percent of everything you say and read and write is

00:08

the

00:12

"the" - is the most used word in the English language. About one out of every

00:18

16 words we encounter on a daily basis is "the." The top 20 most common English

00:25

words in order are "the," "of," "and," "to," "a," "in," "is," "I," "that," "it," "for," "you,"

00:32

"was," "with," "on," "as," "have," "but," "be," "they." That's a fun fact. A piece of trivia but it's

00:39

also more. You see, whether the most commonly used words are ranked across an

00:44

entire language, or in just one book or article, almost every time a bizarre

00:51

pattern emerges. The second most used word will appear about half as often as

00:57

the most used. The third one third as often. The fourth one fourth as often. The

01:04

fifth one fifth as often. The sixth one sixth as often, and so on all the way down.

01:10

Seriously. For some reason, the amount of times a word is used is just

01:16

proportional to one over its rank. Word frequency and ranking on a log log graph

01:23

follow a nice straight line. A power-law. This phenomenon is called Zipf's Law and

01:30

it doesn't only apply to English. It also applies to other languages, like, well,

01:38

all of them.

01:39

Even ancient languages we haven't been able to translate yet.

01:43

And here's the thing. We have no idea why. It's surprising that something as

01:50

complex as reality should be conveyed by something as creative as language in

01:56

such a predictable way. How predictable? Well, watch this. According to WordCount.org,

02:03

which ranks words as found in the British National Corpus, "sauce" is the

02:08

5,555th most common English word. Now, here is a list of how many times

02:15

every word on Wikipedia and in the entire Gutenberg Corpus of tens of

02:21

thousands of public domain books shows up. The most used word, 'the,' shows up about

02:27

181 million times. Knowing these two things, we can estimate that the word

02:34

"sauce" should appear about thirty thousand times on Wikipedia and

02:39

Gutenberg combined. And it pretty much does.

02:45

What gives? The world is chaotic. Things are distributed in myriad of ways, not just

02:51

power laws. And language is personal,

02:54

intentional, idiosyncratic. What about the world and ourselves could cause such

03:00

complex activities and behaviors to follow such a basic rule? We literally

03:08

don't know. More than a century of research has yet to close the case.

03:13

Moreover, Zipf's law doesn't just mysteriously describe word use. It's

03:20

also found in city populations, solar flare intensities, protein sequences and

03:25

immune receptors, the amount of traffic websites get, earthquake magnitudes, the

03:30

number of times academic papers are cited, last names, the firing patterns of

03:34

neural networks, ingredients used in cookbooks, the number of phone calls

03:38

people received, the diameter of Moon craters, the number of people that die

03:42

in wars, the popularity of opening chess moves, even the rate at which we forget.

03:47

There are plenty of theories about why language is 'zipf-y,' but no firm conclusions

03:54

and this video doesn't contain a definite explanation either. Sorry, I know

04:00

that's a bummer, since we appear to like knowing more than mystery. But that said,

04:06

we also ask more than we answer. So let's dive into Zipf's ramifications, some

04:13

related patterns, some possible explanations and the depth of the

04:18

mystery itself. Zipf's law was popularized by George Zipf,

完整字幕可在视频播放器中查看

用练习题学习

从这个视频生成词汇、语法和理解练习

词汇与语法 理解测验 雅思考试 写作练习
注册开始练习
还没有评论。成为第一个分享想法的人!

注册解锁全部功能

追踪进度、保存词汇、练习题目

免费开始学语言