팟캐스트로 돌아가기Dwarkesh Patel

테런스 타오 – 세계 최고의 수학자가 AI를 활용하는 방법

Today, I'm chatting with Terence Tao, who needs no introduction. 오늘은 Terence Tao와 이야기를 나눠보겠습니다. 소개가 필요 없는 분이죠. Terence, I want to begin by having you retell the story of how Kepler discovered the laws of planetary motion because I think this will be a great jumping off point to talk about AI for math. Terence, 케플러가 행성 운동 법칙을 발견한 이야기를 직접 들려주시면 좋겠습니다. 수학에서의 AI를 이야기하는 좋은 출발점이 될 것 같아서요. I've always had an amateur interest in astronomy. 저는 항상 천문학에 아마추어로서 관심을 가져왔습니다. I've loved stories of how the early astronomers worked out the nature of the universe. 초기 천문학자들이 우주의 본질을 밝혀낸 이야기들을 좋아했습니다. Kepler was building on the work of Copernicus, who was himself building on the work of Aristarchus. 케플러는 코페르니쿠스의 연구를 바탕으로 했고, 코페르니쿠스는 아리스타르코스의 연구를 바탕으로 했습니다. Copernicus very famously proposed the heliocentric model, that instead of the planets and the Sun going around the Earth, the Sun was at the center of the solar system and the other planets were going around the Sun. 코페르니쿠스는 지동설로 유명한데, 행성과 태양이 지구 주위를 도는 게 아니라 태양이 태양계의 중심에 있고 다른 행성들이 태양 주위를 돈다는 이론이었습니다. Copernicus proposed that the orbits of the planets were perfect circles. 코페르니쿠스는 행성의 궤도가 완전한 원이라고 주장했습니다. His theory fit the observations that the Greeks, the Arabs, and the Indians had worked out over centuries. 그의 이론은 그리스, 아랍, 인도인들이 수 세기에 걸쳐 쌓아온 관측과 들어맞았습니다. Kepler learned about these theories in his studies, and he made this observation that the ratios of the size of the orbits that Copernicus predicted seemed to have some geometric meaning. 케플러는 학문을 통해 이 이론들을 접하고, 코페르니쿠스가 예측한 궤도 크기의 비율에 어떤 기하학적 의미가 있는 것 같다는 관찰을 했습니다. He started proposing that if you take the orbit of the Earth and you enclose it in a cube, the outer sphere that encloses the cube almost perfectly matched the orbit of Mars, and so forth. 지구의 궤도를 정육면체 안에 넣으면, 그 정육면체를 감싸는 외접구의 크기가 화성 궤도와 거의 정확히 일치한다는 식의 이론을 세우기 시작했습니다. There were six planets known at the time and five gaps between them, and there were five perfect Platonic solids: the cube, the tetrahedron, icosahedron, octahedron, and dodecahedron. 당시 알려진 행성은 6개이고 그 사이 간격은 5개인데, 완전한 플라톤 입체도 정확히 5개였습니다. 정육면체, 정사면체, 정이십면체, 정팔면체, 정십이면체. So he had this theory, which he thought was absolutely beautiful, that you could inscribe these Platonic solids between the spheres of the planets. 그래서 그는 행성들의 구 사이에 이 플라톤 입체들을 내접시킬 수 있다는 이론을 만들었는데, 스스로 절대적으로 아름답다고 생각했습니다. It seemed to fit, and it seemed to him that God's design of the planets was matching this mathematical perfection of the Platonic solids. 잘 들어맞는 것 같았고, 행성에 대한 신의 설계가 플라톤 입체의 수학적 완벽함과 일치한다고 여겼습니다. He needed data to confirm this theory. 이 이론을 확인하려면 데이터가 필요했습니다. At the time, there was only one really high-quality dataset in existence. 당시 존재하는 고품질 데이터셋은 단 하나뿐이었습니다. Tycho Brahe, this very wealthy, eccentric Danish astronomer, had managed to convince the Danish government to fund this extremely expensive observatory. 매우 부유하고 괴팍한 덴마크 천문학자 티코 브라헤는 덴마크 정부를 설득해 이 엄청나게 비싼 천문대를 지을 자금을 지원받았습니다. In fact, it was an entire island where he had taken decades of observations of all the planets, like Mars and Jupiter, at least every night for which the weather was clear, with the naked eye. 사실 그것은 섬 하나 전체였는데, 그곳에서 그는 날씨가 맑은 밤마다 육안으로 화성과 목성 등 모든 행성을 수십 년에 걸쳐 관측했습니다. He was the last of the naked-eye astronomers. 그는 육안 천문학의 마지막 계승자였습니다. He had all this data which Kepler could use to confirm his theory. 케플러가 자신의 이론을 확인하는 데 쓸 수 있는 방대한 데이터를 보유하고 있었습니다. Kepler started working with Tycho, but Tycho was very jealous of the data. 케플러는 티코와 함께 일하기 시작했지만, 티코는 데이터를 굉장히 시기했습니다. He only gave him little bits of it at a time. 케플러에게는 조금씩만 공개했습니다. Kepler eventually just stole the data. 결국 케플러는 그냥 데이터를 훔쳤습니다. He copied it and had to have a fight with Brahe's descendants. 그는 데이터를 베꼈고, 브라헤의 후손들과 다툼을 벌여야 했습니다. He did get the data, and then he worked out, to his disappointment, that his beautiful theory didn't quite work. 데이터를 손에 넣은 그는 실망스럽게도 자신의 아름다운 이론이 딱 맞지 않는다는 것을 알게 됐습니다. The data was off from his Platonic solid theory by 10% or something. 플라톤 입체 이론과 데이터 사이에 10% 정도의 오차가 있었습니다. He tried all kinds of fudges, moving the circles around, and it didn't quite work. 온갖 방법으로 원들을 이리저리 움직여 보며 수정을 시도했지만 잘 되지 않았습니다. But he worked on this problem for years and years, and eventually, he figured out how to use the data to work out the actual orbits of the planets. 그러나 수년간 이 문제에 매달린 끝에, 결국 데이터를 이용해 행성의 실제 궤도를 계산하는 방법을 알아냈습니다. That was an incredibly clever, genius amount of data analysis. 그것은 엄청나게 기발하고 천재적인 데이터 분석이었습니다. And then he worked out that the orbits were actually ellipses, not circles, which was shocking for him. 그리고 궤도가 원이 아니라 타원이라는 사실을 밝혀냈는데, 그에게는 충격적인 발견이었습니다. So he worked out the two laws of planetary motion: the ellipses, and also that equal areas sweep out equal times. 케플러는 이렇게 행성 운동의 두 가지 법칙을 도출했습니다. 타원 궤도, 그리고 동일 시간에 동일 면적이 쓸린다는 것이었습니다. Then ten years later, after collecting a lot of data—the furthest planets like Saturn and Jupiter were the hardest for him to work out—he finally worked out this third law, that the time it takes for a planet to complete its orbit was proportional to some power of the distance to the Sun. 그로부터 10년 후, 방대한 데이터를 수집한 끝에, 토성과 목성처럼 멀리 있는 행성들이 가장 어려웠는데, 행성의 공전 주기가 태양까지의 거리의 어떤 거듭제곱에 비례한다는 세 번째 법칙을 마침내 도출했습니다. These are the three famous Kepler's laws of motion. 이것이 케플러의 유명한 세 가지 운동 법칙입니다. He had no explanation for them. 그는 이에 대한 설명이 없었습니다. It was all driven by experiment, and it took Newton a century later to give a theory that explained all three laws at once. 전부 실험을 통해 도출된 것이었고, 한 세기 후 뉴턴이 세 가지 법칙 모두를 한꺼번에 설명하는 이론을 내놓았습니다. The take I want to try on you is that Kepler was a high-temperature LLM. 제가 드리고 싶은 이야기는, 케플러가 고온의 LLM이었다는 겁니다. Newton comes up with this explanation of why the three laws of planetary motion must be true. 뉴턴은 행성 운동의 세 가지 법칙이 왜 성립해야 하는지 설명을 내놓았습니다. Of course, the way that Kepler discovers the laws of planetary motion, or figures out the relative orbits of the different planets, is as you say a work of genius. 물론 케플러가 행성 운동 법칙을 발견하거나 행성들의 상대적 궤도를 계산하는 방식은 말씀하신 것처럼 천재적인 작업이었습니다. But through his career, he's just trying random relationships. 하지만 그의 경력을 통틀어 보면, 그저 닥치는 대로 무작위 관계를 시도하고 있었습니다. In fact, in the book in which he writes down the third law of planetary motion, it's an aside on The Harmonics of the World, which is just a book about how all these different planets have these different harmonies. 실제로 세 번째 운동 법칙을 기록한 책은 《세계의 조화》라는 책의 일부로, 서로 다른 행성들이 저마다 화음을 이룬다는 내용의 책이었습니다. And the reason there's so much famine and misery on Earth is because the Earth is mi-fa-mi, that's the note of Earth. 지구에 기근과 비참함이 그토록 많은 이유는 지구의 음이 미-파-미이기 때문이라는 식이었습니다. It's all this random astrology, but in there is the cube-square law, which tells you what relationship the period has to a planet's distance from the Sun. 온통 엉터리 점성술 같은 이야기들인데, 그 안에 제곱-세제곱 법칙이 들어 있었습니다. 행성의 공전 주기와 태양까지의 거리 사이의 관계를 알려주는 법칙 말입니다. As you were detailing, if you add that to Newton's F=ma and the equation for centripetal acceleration, you get the inverse-square law. 말씀하신 대로, 이것을 뉴턴의 F=ma와 구심 가속도 방정식에 결합하면 역제곱 법칙이 나옵니다. And so Newton works that out. 그래서 뉴턴이 그것을 밝혀냅니다. But the reason I think this is an interesting story is that I feel LLMs can do the kind of thing of trying random relationships for twenty years, some of which make no sense, as long as there's a verifiable data bank like Brahe's dataset. 제가 이 이야기가 흥미롭다고 생각하는 이유는, LLM들도 브라헤의 데이터셋처럼 검증 가능한 데이터 뱅크만 있다면 20년 동안 무작위 관계를 시도하는 일을 할 수 있을 것 같기 때문입니다. "Ok, I'm going to try out random things about musical notes, Platonic objects, or different geometries, I have this bias that there's some important thing about the geometry of these orbits." "음표에 대해, 플라톤 입체에 대해, 다양한 기하학에 대해 무작위로 시도해보겠어. 이 궤도의 기하학에 뭔가 중요한 게 있을 것 같다는 편향이 있어." Then one thing works. 그러다 하나가 맞아떨어집니다. As long as you can verify it, these empirical regularities can then drive actual deep scientific progress. 검증만 할 수 있다면, 이런 경험적 규칙성들이 실질적이고 깊은 과학적 진전을 이끌어낼 수 있습니다. Traditionally, when we talk about the history of science, idea generation has always been the prestige part of science. 전통적으로 과학의 역사를 이야기할 때, 아이디어 창출은 항상 과학의 꽃으로 여겨져 왔습니다. A scientific problem comes with many steps. 과학적 문제는 여러 단계로 이루어집니다. You have to identify a problem, and then you have to identify a good, fruitful problem to work on. 문제를 파악해야 하고, 풍성한 성과를 낼 수 있는 좋은 문제를 골라야 합니다. Then you need to collect data, figure out a strategy to analyze the data, and make a hypothesis. 데이터를 수집하고, 분석 전략을 세우고, 가설을 세워야 합니다. At this point, you need to propose a good hypothesis, and then you need to validate. 좋은 가설을 제시해야 하고, 검증도 해야 합니다. Then you need to write things up and explain. 그런 다음 정리해서 설명해야 합니다. There are a dozen different components. 열두 가지 서로 다른 구성 요소가 있습니다. The ones we celebrate are these eureka genius moments of idea generation. 우리가 칭송하는 것은 아이디어 창출이라는 유레카의 천재적인 순간들입니다. Kepler certainly had to cycle through many ideas, several of which didn't work. 케플러는 분명히 많은 아이디어를 거쳐야 했는데, 그중 몇 가지는 통하지 않았습니다. I bet there were many that he didn't even publish at all because they just didn't fit. 전혀 맞지 않아서 아예 발표하지 않은 것들도 많았을 겁니다. That's an important part of the process, trying all kinds of random things and seeing if they worked. 무작위로 온갖 것을 시도해 보고 통하는지 확인하는 것, 그것이 과정의 중요한 부분입니다. But as you say, it has to be matched by an equal amount of verification, otherwise it's slop. 하지만 말씀하신 것처럼, 그만큼의 검증이 수반되어야 합니다. 그렇지 않으면 그냥 쓰레기일 뿐입니다. We celebrate Kepler, but we should also celebrate Brahe for his assiduous data collection, which was ten times more precise than any previous observation. 우리는 케플러를 칭송하지만, 티코 브라헤도 칭송해야 합니다. 그 이전 어떤 관측보다 10배 더 정밀한 꼼꼼한 데이터 수집 덕분이니까요. That extra decimal point of accuracy was essential for Kepler to get his results. 그 소수점 한 자리의 정밀도가 케플러가 결과를 얻는 데 결정적이었습니다. He was using Euclidean geometry and the most advanced mathematics he could use at the time to match his models with the data. 그는 유클리드 기하학과 당시 쓸 수 있는 가장 발전된 수학을 사용해 모델과 데이터를 대조했습니다. All aspects had to be in play: the data, the theory, and the hypothesis generation. 데이터, 이론, 가설 창출, 이 모든 측면이 함께 작동해야 했습니다. I'm not sure nowadays that hypothesis generation is the bottleneck anymore. 오늘날에는 가설 창출이 더 이상 병목이 아닌 것 같습니다. Science has changed in the century since. 지난 세기 동안 과학은 변했습니다. Classically, the two big paradigms for science were theory and experiment. 고전적으로 과학의 두 가지 큰 패러다임은 이론과 실험이었습니다. Then in the 20th century, numerical simulation came along, so you can do computer simulations to test theories. 그러다 20세기에 수치 시뮬레이션이 등장해 컴퓨터 시뮬레이션으로 이론을 검증할 수 있게 됐습니다. Finally, in the late 20th century, we had big data. 마침내 20세기 후반에는 빅데이터의 시대가 왔습니다. We had the era of data analysis. 데이터 분석의 시대였습니다. A lot of new progress is actually driven now by analyzing massive datasets first. 새로운 많은 진전이 이제 대규모 데이터셋 분석을 통해 이루어지고 있습니다. You collect large datasets and then draw patterns from them to deduce thoughts. 대규모 데이터를 수집한 다음 패턴을 찾아 통찰을 도출합니다. This is a little bit different from how science used to work, where you make a few observations or have one out-of-the-blue idea, and then collect data to test your idea. 이것은 과학이 예전에 작동하던 방식과는 조금 다릅니다. 예전에는 몇 가지 관찰을 하거나 갑작스러운 아이디어를 떠올린 다음, 데이터를 모아 그 아이디어를 검증했습니다. That's the classic scientific method. 그것이 고전적인 과학 방법론입니다. Now it's almost reversed. 지금은 거의 뒤집혔습니다. You collect big data first, and then you try to get hypotheses from it. 먼저 빅데이터를 수집하고, 그것으로부터 가설을 얻으려 합니다. Kepler was maybe one of the first early data scientists, but even he didn't start with Tycho's dataset and then analyze it. 케플러는 최초의 데이터 과학자 중 한 명이었을 수 있지만, 그조차도 티코의 데이터셋에서 출발해 분석한 게 아니었습니다. He had some preconceived theories first. 먼저 어느 정도 선입견이 있었습니다. It seems like this is less and less the way we make progress, just because the data is so much more massive and useful. 데이터가 워낙 방대하고 유용해졌기 때문에, 이 방식으로 진전을 이루는 경우는 점점 줄어드는 것 같습니다. Oh, interesting. 오, 흥미롭네요. I feel like the 20th-century science that you're describing actually very well describes what happened with Kepler. 말씀하시는 20세기 과학이 실제로 케플러에게 일어난 일을 매우 잘 설명한다는 생각이 듭니다. He did have these ideas—1595 and '96 is where he comes up with the polygons and then the Platonic objects theory—but they were wrong. 그도 이런 아이디어들을 가지고 있었습니다. 1595년과 96년에 다각형 이론과 플라톤 입체 이론을 내놓았지만 틀렸습니다. Then a few years later, he gets Brahe's data, and it's only after twenty years of trying random things that he gets this empirical regularity. 몇 년 후 브라헤의 데이터를 얻었고, 20년에 걸쳐 무작위로 온갖 것들을 시도한 끝에 이 경험적 규칙성을 발견했습니다. It actually feels a bit closer to Brahe's data being analogous to some massive data bank of simulations, and now that you've got the data, you can keep trying random things. 브라헤의 데이터가 방대한 시뮬레이션 데이터 뱅크와 유사하다는 느낌이 듭니다. 데이터를 손에 넣은 후 계속 무작위로 시도할 수 있었던 것처럼 말입니다. If it wasn't for that, Kepler would be out there just writing books about harmonics and Platonic objects, and there would be nothing to actually verify against. 그것이 없었다면 케플러는 화음과 플라톤 입체에 관한 책이나 쓰고 있었을 텐데, 검증할 대상이 아무것도 없었을 겁니다. The data was extremely important. 데이터는 매우 중요했습니다. The distinction I was trying to make was that traditionally, you make a hypothesis and then you test it against data. 제가 말하려던 구분은, 전통적으로는 가설을 먼저 세우고 데이터를 대조한다는 것이었습니다. But now with machine learning, data analysis, and statistics, you can start with data and through statistics work out laws that were not present before. 하지만 지금은 머신러닝, 데이터 분석, 통계를 통해 데이터에서 출발해 전에는 없던 법칙을 통계적으로 도출할 수 있습니다. Kepler's third law is a little bit like this, except that instead of having the thousand data points that Brahe had, Kepler had six data points. 케플러의 세 번째 법칙이 이와 조금 비슷한데, 브라헤가 가진 수천 개의 데이터 포인트 대신 케플러는 6개뿐이었습니다. For every planet, he knew the length of the orbit and the distance to the Sun. 각 행성에 대해 궤도 길이와 태양까지의 거리를 알고 있었습니다. There were five or six data points, and he did what we would now call regression. 다섯 개 혹은 여섯 개의 데이터 포인트로 지금 우리가 회귀 분석이라고 부르는 것을 했습니다. He fit a curve to these six data points and got a square-cube law, which was amazing. 여섯 개의 데이터 포인트에 곡선을 맞춰 제곱-세제곱 법칙을 얻었는데, 놀라운 결과였습니다. But he was quite lucky that these six data points gave him the right conclusion. 하지만 이 여섯 개의 데이터 포인트로 올바른 결론을 얻었다는 건 꽤 운이 좋은 일이었습니다. That's not enough data to be really reliable. 그것만으로는 신뢰하기에 충분하지 않습니다. There was a later astronomer, Johann Bode, who took the same data—the distances to the planets—and inspired by Kepler, he had a prediction that the distances to the planets formed a shifted geometric progression. 나중에 요한 보데라는 천문학자가 같은 데이터, 즉 행성들까지의 거리를 가져다가 케플러에게서 영감을 받아 행성들까지의 거리가 이동된 등비수열을 이룬다는 예측을 내놓았습니다. He also fit a curve, except there was one point missing. 그도 곡선을 맞췄는데, 한 점이 빠져 있었습니다. There was a big gap between Mars and Jupiter. 화성과 목성 사이에 큰 공백이 있었습니다. His law predicted that there was a missing planet. 그의 법칙은 빠진 행성이 있다고 예측했습니다. It was kind of a crank theory, except when Uranus was discovered by Herschel, the distance to Uranus fit exactly this pattern. 일종의 엉터리 이론이었는데, 허셜이 천왕성을 발견했을 때 천왕성까지의 거리가 정확히 이 패턴에 맞아떨어졌습니다. Then Ceres was discovered in the asteroid belt, and it also fit the pattern. 그 다음 세레스가 소행성대에서 발견되었는데, 이 역시 패턴에 들어맞았습니다. People got really excited that Bode had discovered this amazing new law of nature. 보데가 새로운 자연 법칙을 발견했다는 기대에 다들 흥분했습니다.