Podcasts पर वापस जाएं Dwarkesh Patel

टेरेंस ताओ – दुनिया के शीर्ष गणितज्ञ AI का उपयोग कैसे करते हैं

Today, I'm chatting with Terence Tao, who needs no introduction. आज मैं Terence Tao से बात कर रहा हूं, जिन्हें किसी परिचय की ज़रूरत नहीं। Terence, I want to begin by having you retell the story of how Kepler discovered the laws of planetary motion because I think this will be a great jumping off point to talk about AI for math. Terence, मैं शुरुआत इस बात से करना चाहता हूं कि आप हमें Kepler की कहानी सुनाएं, कैसे उन्होंने ग्रहों की गति के नियम खोजे, क्योंकि मुझे लगता है यह math के लिए AI पर बात करने का एक बेहतरीन शुरुआती बिंदु होगा। I've always had an amateur interest in astronomy. मुझे हमेशा से खगोल विज्ञान में एक शौकिया रुचि रही है। I've loved stories of how the early astronomers worked out the nature of the universe. मुझे वो कहानियां बहुत पसंद हैं कि शुरुआती खगोलविदों ने ब्रह्मांड की प्रकृति को कैसे समझा। Kepler was building on the work of Copernicus, who was himself building on the work of Aristarchus. Kepler, Copernicus के काम पर आगे बढ़ रहे थे, जो खुद Aristarchus के काम पर आगे बढ़ रहे थे। Copernicus very famously proposed the heliocentric model, that instead of the planets and the Sun going around the Earth, the Sun was at the center of the solar system and the other planets were going around the Sun. Copernicus ने बहुत प्रसिद्ध रूप से heliocentric model प्रस्तावित किया था, यानी कि ग्रह और सूरज पृथ्वी के चारों ओर नहीं घूमते, बल्कि सूरज सौरमंडल के केंद्र में है और बाकी ग्रह सूरज के चारों ओर घूमते हैं। Copernicus proposed that the orbits of the planets were perfect circles. Copernicus ने प्रस्तावित किया कि ग्रहों की कक्षाएं पूर्ण वृत्त हैं। His theory fit the observations that the Greeks, the Arabs, and the Indians had worked out over centuries. उनकी theory ने उन अवलोकनों से मेल खाया जो यूनानियों, अरबों और भारतीयों ने सदियों में तैयार किए थे। Kepler learned about these theories in his studies, and he made this observation that the ratios of the size of the orbits that Copernicus predicted seemed to have some geometric meaning. Kepler ने अपनी पढ़ाई में इन theories के बारे में जाना, और उन्होंने यह देखा कि Copernicus द्वारा अनुमानित कक्षाओं के आकार के अनुपात में कुछ ज्यामितीय अर्थ लग रहा था। He started proposing that if you take the orbit of the Earth and you enclose it in a cube, the outer sphere that encloses the cube almost perfectly matched the orbit of Mars, and so forth. उन्होंने यह प्रस्तावित करना शुरू किया कि अगर आप पृथ्वी की कक्षा लेकर उसे एक घन में रखें, तो घन को घेरने वाला बाहरी गोला लगभग पूरी तरह Mars की कक्षा से मेल खाता है, और इसी तरह आगे। There were six planets known at the time and five gaps between them, and there were five perfect Platonic solids: the cube, the tetrahedron, icosahedron, octahedron, and dodecahedron. उस समय छह ग्रह ज्ञात थे और उनके बीच पांच अंतराल थे, और पांच पूर्ण Platonic solids थे: घन, tetrahedron, icosahedron, octahedron, और dodecahedron। So he had this theory, which he thought was absolutely beautiful, that you could inscribe these Platonic solids between the spheres of the planets. तो उनके पास यह theory थी, जो उन्हें बिल्कुल खूबसूरत लगती थी, कि आप इन Platonic solids को ग्रहों के गोलों के बीच रख सकते हैं। It seemed to fit, and it seemed to him that God's design of the planets was matching this mathematical perfection of the Platonic solids. यह मेल खाता लग रहा था, और उन्हें लगा कि ईश्वर की ग्रहों की रचना Platonic solids की इस गणितीय पूर्णता से मेल खा रही है। He needed data to confirm this theory. उन्हें इस theory की पुष्टि के लिए डेटा की ज़रूरत थी। At the time, there was only one really high-quality dataset in existence. उस समय केवल एक ही उच्च-गुणवत्ता वाला dataset मौजूद था। Tycho Brahe, this very wealthy, eccentric Danish astronomer, had managed to convince the Danish government to fund this extremely expensive observatory. Tycho Brahe, यह बहुत धनी और सनकी डेनिश खगोलविद, डेनिश सरकार को इस अत्यंत महंगी observatory के वित्तपोषण के लिए मनाने में सफल रहे थे। In fact, it was an entire island where he had taken decades of observations of all the planets, like Mars and Jupiter, at least every night for which the weather was clear, with the naked eye. वास्तव में, यह पूरा एक द्वीप था जहां उन्होंने दशकों तक सभी ग्रहों, जैसे Mars और Jupiter, के अवलोकन किए थे, कम से कम हर उस रात जब मौसम साफ था, नंगी आंखों से। He was the last of the naked-eye astronomers. वे नंगी आंखों से देखने वाले आखिरी खगोलविद थे। He had all this data which Kepler could use to confirm his theory. उनके पास यह सारा डेटा था जिसे Kepler अपनी theory की पुष्टि के लिए इस्तेमाल कर सकते थे। Kepler started working with Tycho, but Tycho was very jealous of the data. Kepler ने Tycho के साथ काम करना शुरू किया, लेकिन Tycho डेटा को लेकर बहुत ईर्ष्यालु थे। He only gave him little bits of it at a time. वे उन्हें एक समय में थोड़ा-थोड़ा ही देते थे। Kepler eventually just stole the data. Kepler ने अंततः डेटा चुरा ही लिया। He copied it and had to have a fight with Brahe's descendants. उन्होंने उसे कॉपी किया और Brahe के वंशजों से झगड़ना पड़ा। He did get the data, and then he worked out, to his disappointment, that his beautiful theory didn't quite work. उन्हें डेटा मिल गया, और फिर उन्होंने निराश होकर पाया कि उनकी खूबसूरत theory ठीक से काम नहीं की। The data was off from his Platonic solid theory by 10% or something. डेटा उनकी Platonic solid theory से लगभग 10% दूर था। He tried all kinds of fudges, moving the circles around, and it didn't quite work. उन्होंने तरह-तरह के जुगाड़ आज़माए, वृत्तों को इधर-उधर खिसकाया, लेकिन यह ठीक नहीं बैठा। But he worked on this problem for years and years, and eventually, he figured out how to use the data to work out the actual orbits of the planets. लेकिन वे इस समस्या पर सालों-साल काम करते रहे, और अंततः उन्होंने यह पता लगाया कि डेटा का उपयोग करके ग्रहों की वास्तविक कक्षाएं कैसे निकाली जाएं। That was an incredibly clever, genius amount of data analysis. यह अविश्वसनीय रूप से चतुर, प्रतिभाशाली data analysis था। And then he worked out that the orbits were actually ellipses, not circles, which was shocking for him. और फिर उन्होंने यह पता लगाया कि कक्षाएं दरअसल ellipses हैं, वृत्त नहीं, जो उनके लिए चौंकाने वाला था। So he worked out the two laws of planetary motion: the ellipses, and also that equal areas sweep out equal times. तो उन्होंने ग्रहों की गति के दो नियम निकाले: ellipses, और यह भी कि समान समय में समान क्षेत्रफल तय होता है। Then ten years later, after collecting a lot of data—the furthest planets like Saturn and Jupiter were the hardest for him to work out—he finally worked out this third law, that the time it takes for a planet to complete its orbit was proportional to some power of the distance to the Sun. फिर दस साल बाद, बहुत सारा डेटा इकट्ठा करने के बाद, जिसमें Saturn और Jupiter जैसे दूर के ग्रहों को समझना उनके लिए सबसे कठिन था, उन्होंने अंततः यह तीसरा नियम निकाला कि किसी ग्रह को अपनी कक्षा पूरी करने में लगने वाला समय, सूरज से उसकी दूरी की किसी शक्ति के अनुपात में होता है। These are the three famous Kepler's laws of motion. ये Kepler के गति के तीन प्रसिद्ध नियम हैं। He had no explanation for them. उनके पास इनकी कोई व्याख्या नहीं थी। It was all driven by experiment, and it took Newton a century later to give a theory that explained all three laws at once. यह सब प्रयोगों पर आधारित था, और एक सदी बाद Newton को वह theory देने में लगी जिसने तीनों नियमों को एक साथ समझाया। The take I want to try on you is that Kepler was a high-temperature LLM. मैं आपके सामने यह विचार रखना चाहता हूं कि Kepler एक high-temperature LLM थे। Newton comes up with this explanation of why the three laws of planetary motion must be true. Newton यह समझाता है कि ग्रहों की गति के तीन नियम क्यों सत्य होने चाहिए। Of course, the way that Kepler discovers the laws of planetary motion, or figures out the relative orbits of the different planets, is as you say a work of genius. बेशक, Kepler ने जिस तरह ग्रहों की गति के नियम खोजे, या विभिन्न ग्रहों की सापेक्ष कक्षाएं निकालीं, वह जैसा आपने कहा एक प्रतिभाशाली काम है। But through his career, he's just trying random relationships. लेकिन अपने पूरे करियर में, वे बस यादृच्छिक संबंध आज़माते रहे। In fact, in the book in which he writes down the third law of planetary motion, it's an aside on The Harmonics of the World, which is just a book about how all these different planets have these different harmonies. दरअसल, जिस किताब में उन्होंने ग्रहों की गति का तीसरा नियम लिखा, वह The Harmonics of the World में एक पाद-टिप्पणी है, जो बस इस बारे में एक किताब है कि इन विभिन्न ग्रहों में कैसे अलग-अलग सामंजस्य हैं। And the reason there's so much famine and misery on Earth is because the Earth is mi-fa-mi, that's the note of Earth. और पृथ्वी पर इतना अकाल और दुख क्यों है, इसकी वजह यह है कि पृथ्वी mi-fa-mi है, यही पृथ्वी का स्वर है। It's all this random astrology, but in there is the cube-square law, which tells you what relationship the period has to a planet's distance from the Sun. यह सब यादृच्छिक ज्योतिष है, लेकिन उसी में cube-square law है, जो बताता है कि किसी ग्रह का आवर्त काल उसकी सूरज से दूरी से किस संबंध में है। As you were detailing, if you add that to Newton's F=ma and the equation for centripetal acceleration, you get the inverse-square law. जैसा आप विस्तार से बता रहे थे, अगर आप उसे Newton के F=ma और centripetal acceleration के समीकरण में जोड़ें, तो आपको inverse-square law मिलता है। And so Newton works that out. और इस तरह Newton यह निकालता है। But the reason I think this is an interesting story is that I feel LLMs can do the kind of thing of trying random relationships for twenty years, some of which make no sense, as long as there's a verifiable data bank like Brahe's dataset. लेकिन मुझे लगता है यह एक दिलचस्प कहानी है क्योंकि मुझे लगता है LLMs वही काम कर सकते हैं जो बीस साल तक यादृच्छिक संबंध आज़माना है, कुछ जो कोई मतलब नहीं रखते, जब तक कि Brahe के dataset जैसा कोई सत्यापन योग्य डेटा बैंक हो। "Ok, I'm going to try out random things about musical notes, Platonic objects, or different geometries, I have this bias that there's some important thing about the geometry of these orbits." "ठीक है, मैं संगीत के स्वरों, Platonic objects, या अलग-अलग ज्यामितियों के बारे में यादृच्छिक चीज़ें आज़माऊंगा, मेरा यह पूर्वाग्रह है कि इन कक्षाओं की ज्यामिति के बारे में कुछ महत्वपूर्ण है।" Then one thing works. फिर एक चीज़ काम करती है। As long as you can verify it, these empirical regularities can then drive actual deep scientific progress. जब तक आप इसे सत्यापित कर सकते हैं, ये empirical regularities वास्तविक गहरी वैज्ञानिक प्रगति को आगे बढ़ा सकती हैं। Traditionally, when we talk about the history of science, idea generation has always been the prestige part of science. पारंपरिक रूप से, जब हम विज्ञान के इतिहास की बात करते हैं, तो विचार उत्पन्न करना हमेशा विज्ञान का प्रतिष्ठित हिस्सा रहा है। A scientific problem comes with many steps. एक वैज्ञानिक समस्या में कई चरण होते हैं। You have to identify a problem, and then you have to identify a good, fruitful problem to work on. आपको एक समस्या पहचाननी होती है, और फिर एक अच्छी, उपयोगी समस्या पहचाननी होती है जिस पर काम करना है। Then you need to collect data, figure out a strategy to analyze the data, and make a hypothesis. फिर आपको डेटा इकट्ठा करना होता है, डेटा विश्लेषण की रणनीति तैयार करनी होती है, और एक hypothesis बनानी होती है। At this point, you need to propose a good hypothesis, and then you need to validate. इस बिंदु पर, आपको एक अच्छी hypothesis प्रस्तावित करनी होती है, और फिर सत्यापन करना होता है। Then you need to write things up and explain. फिर आपको चीज़ें लिखनी होती हैं और समझानी होती हैं। There are a dozen different components. इसमें दर्जनों अलग-अलग घटक होते हैं। The ones we celebrate are these eureka genius moments of idea generation. जिन्हें हम मनाते हैं वे हैं ये eureka प्रतिभाशाली क्षण, विचार उत्पन्न होने के। Kepler certainly had to cycle through many ideas, several of which didn't work. Kepler को निश्चित रूप से कई विचारों से गुज़रना पड़ा, जिनमें से कई काम नहीं आए। I bet there were many that he didn't even publish at all because they just didn't fit. मुझे यकीन है कि ऐसे कई थे जो उन्होंने प्रकाशित भी नहीं किए क्योंकि वे बिल्कुल मेल नहीं खाते थे। That's an important part of the process, trying all kinds of random things and seeing if they worked. यह प्रक्रिया का एक महत्वपूर्ण हिस्सा है, सभी तरह की यादृच्छिक चीज़ें आज़माना और देखना कि क्या काम करता है। But as you say, it has to be matched by an equal amount of verification, otherwise it's slop. लेकिन जैसा आप कहते हैं, इसे उतने ही सत्यापन से मेल खाना चाहिए, नहीं तो यह बकवास है। We celebrate Kepler, but we should also celebrate Brahe for his assiduous data collection, which was ten times more precise than any previous observation. हम Kepler को मनाते हैं, लेकिन हमें Brahe को भी उनके परिश्रमी डेटा संग्रह के लिए मनाना चाहिए, जो किसी भी पिछले अवलोकन से दस गुना अधिक सटीक था। That extra decimal point of accuracy was essential for Kepler to get his results. सटीकता का वह अतिरिक्त दशमलव बिंदु Kepler के लिए अपने परिणाम प्राप्त करने में आवश्यक था। He was using Euclidean geometry and the most advanced mathematics he could use at the time to match his models with the data. वे अपने models को डेटा से मिलाने के लिए Euclidean geometry और उस समय की सबसे उन्नत गणित का उपयोग कर रहे थे। All aspects had to be in play: the data, the theory, and the hypothesis generation. सभी पहलुओं का होना ज़रूरी था: डेटा, theory, और hypothesis निर्माण। I'm not sure nowadays that hypothesis generation is the bottleneck anymore. मुझे यकीन नहीं है कि आजकल hypothesis निर्माण अब bottleneck है। Science has changed in the century since. विज्ञान पिछली सदी में बदल गया है। Classically, the two big paradigms for science were theory and experiment. परंपरागत रूप से, विज्ञान के दो बड़े paradigms theory और प्रयोग थे। Then in the 20th century, numerical simulation came along, so you can do computer simulations to test theories. फिर 20वीं सदी में numerical simulation आया, तो आप theories को परखने के लिए computer simulations कर सकते हैं। Finally, in the late 20th century, we had big data. अंत में, 20वीं सदी के अंत में, हमारे पास big data आया। We had the era of data analysis. हम data analysis के युग में थे। A lot of new progress is actually driven now by analyzing massive datasets first. बहुत सी नई प्रगति अब वास्तव में पहले विशाल datasets के विश्लेषण से संचालित होती है। You collect large datasets and then draw patterns from them to deduce thoughts. आप बड़े datasets इकट्ठा करते हैं और फिर विचार निकालने के लिए उनसे patterns खींचते हैं। This is a little bit different from how science used to work, where you make a few observations or have one out-of-the-blue idea, and then collect data to test your idea. यह उस तरह से थोड़ा अलग है जैसे विज्ञान पहले काम करता था, जहां आप कुछ अवलोकन करते हैं या कोई अचानक विचार आता है, और फिर अपने विचार को परखने के लिए डेटा इकट्ठा करते हैं। That's the classic scientific method. यह क्लासिक वैज्ञानिक पद्धति है। Now it's almost reversed. अब यह लगभग उल्टा है। You collect big data first, and then you try to get hypotheses from it. आप पहले big data इकट्ठा करते हैं, और फिर उससे hypotheses निकालने की कोशिश करते हैं। Kepler was maybe one of the first early data scientists, but even he didn't start with Tycho's dataset and then analyze it. Kepler शायद पहले शुरुआती data scientists में से एक थे, लेकिन वे भी Tycho के dataset से शुरू करके उसका विश्लेषण नहीं करते थे। He had some preconceived theories first. उनके पहले से कुछ पूर्वकल्पित theories थीं। It seems like this is less and less the way we make progress, just because the data is so much more massive and useful. ऐसा लगता है कि जिस तरह से हम प्रगति करते हैं वह कम होती जा रही है, सिर्फ इसलिए कि डेटा बहुत अधिक विशाल और उपयोगी है। Oh, interesting. ओह, दिलचस्प। I feel like the 20th-century science that you're describing actually very well describes what happened with Kepler. मुझे लगता है कि 20वीं सदी का विज्ञान जो आप describe कर रहे हैं वह वास्तव में Kepler के साथ जो हुआ उसे बहुत अच्छी तरह describe करता है। He did have these ideas—1595 and '96 is where he comes up with the polygons and then the Platonic objects theory—but they were wrong. उनके पास ये विचार थे, 1595 और '96 में वे polygons और फिर Platonic objects theory लेकर आते हैं, लेकिन वे गलत थे। Then a few years later, he gets Brahe's data, and it's only after twenty years of trying random things that he gets this empirical regularity. फिर कुछ साल बाद उन्हें Brahe का डेटा मिलता है, और यादृच्छिक चीज़ें आज़माने के बीस साल बाद ही उन्हें यह empirical regularity मिलती है। It actually feels a bit closer to Brahe's data being analogous to some massive data bank of simulations, and now that you've got the data, you can keep trying random things. यह actually Brahe के डेटा के किसी विशाल simulations के डेटा बैंक के समान होने जैसा लगता है, और अब जब आपके पास डेटा है, आप यादृच्छिक चीज़ें आज़माते रह सकते हैं। If it wasn't for that, Kepler would be out there just writing books about harmonics and Platonic objects, and there would be nothing to actually verify against. अगर वह नहीं होता, तो Kepler harmonics और Platonic objects के बारे में किताबें लिखता रहता, और सत्यापन के लिए कुछ भी नहीं होता। The data was extremely important. डेटा अत्यंत महत्वपूर्ण था। The distinction I was trying to make was that traditionally, you make a hypothesis and then you test it against data. जो अंतर मैं करने की कोशिश कर रहा था वह यह था कि परंपरागत रूप से, आप एक hypothesis बनाते हैं और फिर उसे डेटा के विरुद्ध परखते हैं। But now with machine learning, data analysis, and statistics, you can start with data and through statistics work out laws that were not present before. लेकिन अब machine learning, data analysis, और statistics के साथ, आप डेटा से शुरू कर सकते हैं और statistics के ज़रिए ऐसे नियम निकाल सकते हैं जो पहले मौजूद नहीं थे। Kepler's third law is a little bit like this, except that instead of having the thousand data points that Brahe had, Kepler had six data points. Kepler का तीसरा नियम थोड़ा ऐसा ही है, सिवाय इसके कि Brahe के हज़ार डेटा points के बजाय, Kepler के पास छह डेटा points थे। For every planet, he knew the length of the orbit and the distance to the Sun. हर ग्रह के लिए, उन्हें कक्षा की लंबाई और सूरज से दूरी पता थी। There were five or six data points, and he did what we would now call regression. पांच या छह डेटा points थे, और उन्होंने वही किया जिसे हम अब regression कहते हैं। He fit a curve to these six data points and got a square-cube law, which was amazing. उन्होंने इन छह डेटा points पर एक वक्र fit किया और square-cube law निकाला, जो अद्भुत था। But he was quite lucky that these six data points gave him the right conclusion. लेकिन वे काफी भाग्यशाली थे कि इन छह डेटा points ने उन्हें सही निष्कर्ष दिया। That's not enough data to be really reliable. वास्तव में विश्वसनीय होने के लिए इतना डेटा पर्याप्त नहीं था। There was a later astronomer, Johann Bode, who took the same data—the distances to the planets—and inspired by Kepler, he had a prediction that the distances to the planets formed a shifted geometric progression. एक बाद के खगोलविद थे, Johann Bode, जिन्होंने वही डेटा लिया, ग्रहों की दूरियां, और Kepler से प्रेरित होकर उन्होंने एक अनुमान लगाया कि ग्रहों की दूरियां एक shifted geometric progression बनाती हैं। He also fit a curve, except there was one point missing. उन्होंने भी एक वक्र fit किया, सिवाय इसके कि एक point गायब था। There was a big gap between Mars and Jupiter. Mars और Jupiter के बीच एक बड़ा अंतर था। His law predicted that there was a missing planet. उनके नियम ने भविष्यवाणी की कि एक गायब ग्रह है। It was kind of a crank theory, except when Uranus was discovered by Herschel, the distance to Uranus fit exactly this pattern. यह एक अटपटी theory थी, सिवाय इसके कि जब Herschel ने Uranus की खोज की, तो Uranus की दूरी इस pattern से बिल्कुल मेल खाती थी। Then Ceres was discovered in the asteroid belt, and it also fit the pattern. फिर asteroid belt में Ceres की खोज हुई, और वह भी pattern में fit हुआ। People got really excited that Bode had discovered this amazing new law of nature. लोग बहुत उत्साहित हो गए कि Bode ने प्रकृति का यह अद्भुत नया नियम खोजा है।