The video owner has disabled playback on external websites.

This video is no longer available on YouTube.

This video cannot be played right now.

Watch on YouTube

AI 학습 도구 잠금 해제

가입하여 모든 동영상에서 더 빠르게 학습할 수 있는 강력한 도구를 이용하세요.

장면 설명 표현 찾기 플래시카드 복습 섀도잉 연습 되말하기
무료 회원가입
B1 중급 영어 21:05 Educational

The Zipf Mystery

Vsauce · 28,222,879 조회수 · 추가됨 3일 전

학습 통계

B1

CEFR 레벨

5/10

난이도

자막 (222 세그먼트)

00:00

Hey, Vsauce. Michael here. About 6 percent of everything you say and read and write is

00:08

the

00:12

"the" - is the most used word in the English language. About one out of every

00:18

16 words we encounter on a daily basis is "the." The top 20 most common English

00:25

words in order are "the," "of," "and," "to," "a," "in," "is," "I," "that," "it," "for," "you,"

00:32

"was," "with," "on," "as," "have," "but," "be," "they." That's a fun fact. A piece of trivia but it's

00:39

also more. You see, whether the most commonly used words are ranked across an

00:44

entire language, or in just one book or article, almost every time a bizarre

00:51

pattern emerges. The second most used word will appear about half as often as

00:57

the most used. The third one third as often. The fourth one fourth as often. The

01:04

fifth one fifth as often. The sixth one sixth as often, and so on all the way down.

01:10

Seriously. For some reason, the amount of times a word is used is just

01:16

proportional to one over its rank. Word frequency and ranking on a log log graph

01:23

follow a nice straight line. A power-law. This phenomenon is called Zipf's Law and

01:30

it doesn't only apply to English. It also applies to other languages, like, well,

01:38

all of them.

01:39

Even ancient languages we haven't been able to translate yet.

01:43

And here's the thing. We have no idea why. It's surprising that something as

01:50

complex as reality should be conveyed by something as creative as language in

01:56

such a predictable way. How predictable? Well, watch this. According to WordCount.org,

02:03

which ranks words as found in the British National Corpus, "sauce" is the

02:08

5,555th most common English word. Now, here is a list of how many times

02:15

every word on Wikipedia and in the entire Gutenberg Corpus of tens of

02:21

thousands of public domain books shows up. The most used word, 'the,' shows up about

02:27

181 million times. Knowing these two things, we can estimate that the word

02:34

"sauce" should appear about thirty thousand times on Wikipedia and

02:39

Gutenberg combined. And it pretty much does.

02:45

What gives? The world is chaotic. Things are distributed in myriad of ways, not just

02:51

power laws. And language is personal,

02:54

intentional, idiosyncratic. What about the world and ourselves could cause such

03:00

complex activities and behaviors to follow such a basic rule? We literally

03:08

don't know. More than a century of research has yet to close the case.

03:13

Moreover, Zipf's law doesn't just mysteriously describe word use. It's

03:20

also found in city populations, solar flare intensities, protein sequences and

03:25

immune receptors, the amount of traffic websites get, earthquake magnitudes, the

03:30

number of times academic papers are cited, last names, the firing patterns of

03:34

neural networks, ingredients used in cookbooks, the number of phone calls

03:38

people received, the diameter of Moon craters, the number of people that die

03:42

in wars, the popularity of opening chess moves, even the rate at which we forget.

03:47

There are plenty of theories about why language is 'zipf-y,' but no firm conclusions

03:54

and this video doesn't contain a definite explanation either. Sorry, I know

04:00

that's a bummer, since we appear to like knowing more than mystery. But that said,

04:06

we also ask more than we answer. So let's dive into Zipf's ramifications, some

04:13

related patterns, some possible explanations and the depth of the

04:18

mystery itself. Zipf's law was popularized by George Zipf,

전체 자막은 비디오 플레이어에서 이용 가능

연습 문제로 학습하기

이 동영상에서 어휘, 문법, 이해력 연습 문제를 만드세요

어휘 및 문법 이해력 퀴즈 IELTS 시험 쓰기 연습
회원가입해서 연습하기
아직 댓글이 없습니다. 첫 번째로 생각을 공유하세요!

가입하고 모든 기능 잠금 해제

진행 상황 추적, 단어 저장, 연습 문제 풀기

무료로 언어 학습