The video owner has disabled playback on external websites.

This video is no longer available on YouTube.

This video cannot be played right now.

Watch on YouTube

Desbloqueie ferramentas de aprendizado com IA

Cadastre-se para acessar ferramentas poderosas que ajudam a aprender mais rápido com cada vídeo.

Explicador de cena Caça-frases Revisão com flashcards Prática de repetição Falar de Volta
Cadastrar grátis
B1 Intermediário Inglês 21:05 Educational

The Zipf Mystery

Vsauce · 28,222,897 visualizações · Adicionado há 3 semanas

Estatísticas de aprendizado

B1

Nível CEFR

5/10

Dificuldade

Legendas (222 segmentos)

00:00

Hey, Vsauce. Michael here. About 6 percent of everything you say and read and write is

00:08

the

00:12

"the" - is the most used word in the English language. About one out of every

00:18

16 words we encounter on a daily basis is "the." The top 20 most common English

00:25

words in order are "the," "of," "and," "to," "a," "in," "is," "I," "that," "it," "for," "you,"

00:32

"was," "with," "on," "as," "have," "but," "be," "they." That's a fun fact. A piece of trivia but it's

00:39

also more. You see, whether the most commonly used words are ranked across an

00:44

entire language, or in just one book or article, almost every time a bizarre

00:51

pattern emerges. The second most used word will appear about half as often as

00:57

the most used. The third one third as often. The fourth one fourth as often. The

01:04

fifth one fifth as often. The sixth one sixth as often, and so on all the way down.

01:10

Seriously. For some reason, the amount of times a word is used is just

01:16

proportional to one over its rank. Word frequency and ranking on a log log graph

01:23

follow a nice straight line. A power-law. This phenomenon is called Zipf's Law and

01:30

it doesn't only apply to English. It also applies to other languages, like, well,

01:38

all of them.

01:39

Even ancient languages we haven't been able to translate yet.

01:43

And here's the thing. We have no idea why. It's surprising that something as

01:50

complex as reality should be conveyed by something as creative as language in

01:56

such a predictable way. How predictable? Well, watch this. According to WordCount.org,

02:03

which ranks words as found in the British National Corpus, "sauce" is the

02:08

5,555th most common English word. Now, here is a list of how many times

02:15

every word on Wikipedia and in the entire Gutenberg Corpus of tens of

02:21

thousands of public domain books shows up. The most used word, 'the,' shows up about

02:27

181 million times. Knowing these two things, we can estimate that the word

02:34

"sauce" should appear about thirty thousand times on Wikipedia and

02:39

Gutenberg combined. And it pretty much does.

02:45

What gives? The world is chaotic. Things are distributed in myriad of ways, not just

02:51

power laws. And language is personal,

02:54

intentional, idiosyncratic. What about the world and ourselves could cause such

03:00

complex activities and behaviors to follow such a basic rule? We literally

03:08

don't know. More than a century of research has yet to close the case.

03:13

Moreover, Zipf's law doesn't just mysteriously describe word use. It's

03:20

also found in city populations, solar flare intensities, protein sequences and

03:25

immune receptors, the amount of traffic websites get, earthquake magnitudes, the

03:30

number of times academic papers are cited, last names, the firing patterns of

03:34

neural networks, ingredients used in cookbooks, the number of phone calls

03:38

people received, the diameter of Moon craters, the number of people that die

03:42

in wars, the popularity of opening chess moves, even the rate at which we forget.

03:47

There are plenty of theories about why language is 'zipf-y,' but no firm conclusions

03:54

and this video doesn't contain a definite explanation either. Sorry, I know

04:00

that's a bummer, since we appear to like knowing more than mystery. But that said,

04:06

we also ask more than we answer. So let's dive into Zipf's ramifications, some

04:13

related patterns, some possible explanations and the depth of the

04:18

mystery itself. Zipf's law was popularized by George Zipf,

Legendas completas disponíveis no player de vídeo

Pratique com exercícios

Gere exercícios de vocabulário, gramática e compreensão deste vídeo

Vocabulário e gramática Quiz de compreensão Exame IELTS Prática de escrita
Cadastre-se pra praticar
Nenhum comentário ainda. Seja o primeiro a compartilhar suas ideias!

Cadastre-se para desbloquear todos os recursos

Acompanhe seu progresso, salve vocabulário e pratique com exercícios

Aprenda idiomas de grátis