Can AI quicken the pace of math discovery?

Artificial intelligence can write a poem in the style of Walt Whitman, provide dating advice and suggest the best way to cook an artichoke. But when it comes to mathematics, large language models like OpenAI's immensely popular ChatGPT have sometimes stumbled over basic problems. Some see this as an inherent limitation of the technology, especially when it comes to complex reasoning.
A new initiative from the Defense Advanced Research Projects Agency seeks to account for that shortfall by enlisting researchers in finding ways to conduct high-level mathematics research with an AI 'co-author.' The goal of the new grant-making program, Exponentiating Mathematics, is to speed up the pace of progress in pure (as opposed to applied) math — and, in doing so, to turn AI into a superlative mathematician.
'Mathematics is this great test bed for what is right now the key pain point for AI systems,' said Patrick Shafto, a Rutgers University mathematician and computer scientist who now serves as a program manager in DARPA's information innovation office, known as I20. 'So if we overcome that, potentially, it would unleash much more powerful AI.' He added, 'There's huge potential benefit to the community of mathematicians and to society at large.'
Shafto spoke from his office at DARPA's headquarters, an anonymous building in northern Virginia whose facade of bluish glass gives little indication that it houses one of the most unusual agencies in the federal government. Inside the building's airy lobby, visitors surrender their cellphones. Near a bank of chairs, a glass display shows a prosthetic arm that can be controlled by the wearer's brain signals.
'By improving mathematics, we're also understanding how AI works better,' said Alondra Nelson, who served as a top science adviser in President Joe Biden's administration and is a faculty member at the Institute for Advanced Study in Princeton, New Jersey. 'So I think it's kind of a virtuous cycle of understanding.' She suggested that, down the road, math-adept AI could enhance cryptography and aid in space exploration.
Started after World War II to compete with the Soviet Union in the space race, DARPA is most famous for fostering the research that led to the creation of ARPANET, the precursor to the internet we use today. At the agency's small gift store, which is not accessible to the public, one can buy replicas of a cocktail napkin on which someone sketched out the rudimentary state of computer networks in 1969. DARPA later funded the research that gave rise to drones and Apple's digital assistant, Siri. But it is also responsible for the development of Agent Orange, the potent defoliant used to devastating effect during the Vietnam War.
'I'm sure this isn't 100% innocent,' Andrew Granville, a mathematician at the University of Montreal, said of DARPA's math initiative, although he emphasized that he was only speculating about eventual outcomes. DARPA is, after all, part of the Pentagon, even if it has traditionally operated with enviable independence. The U.S. military is rapidly incorporating AI into its operations, with the aim of not losing out to China and its People's Liberation Army or to Russia, which has been testing out new technologies on the battlefield in Ukraine.
At the same time, Granville praised the endeavor, which comes as the Trump administration is cutting funding for scientific research. 'We are in disastrous times for U.S. science,' Granville said. 'I'm very pleased that DARPA is able to funnel money to academia.'
A surfer and skateboarder in his free time, Shafto, 49, sat in a sparse conference room one recent afternoon, imagining a future when AI would be as good at solving multistep problems as it is at trying to glean meaning from huge troves of texts, which it does through the use of probability theory.
Despite the unseasonably raw weather, Shafto seemed dressed for the beach in a blue-and-white Hawaiian-style shirt, white flannel trousers and sandals, with a trilby hat on the table before him. His vibe was, on the whole, decidedly closer to that of Santa Cruz than of Capitol Hill, largely in keeping with DARPA's traditional disregard for the capital's slow, bureaucratic pace. (The agency sets priorities and funds outside scientists but does not do research on its own; academics like Shafto spend an average of four years as program managers.)
'There are great mathematicians who work on age-old problems,' Shafto said. 'That's not the kind of thing that I'm particularly interested in.' Instead, he wanted the discipline to move more quickly by using AI to save time.
'Problems in mathematics take decades or centuries, sometimes, to solve,' he said in a recent presentation at DARPA's headquarters on the Exponentiating Mathematics project, which is accepting applications through mid-July. He then shared a slide showing that, in terms of the number of papers published, math had stagnated during the last century while life and technical sciences had exploded. In case the point wasn't clear, the slide's heading drove it home: 'Math is sloooowwww. …'
The kind of pure math Shafto wants to accelerate tends to be 'sloooowwww' because it is not seeking numerical solutions to concrete problems, the way applied mathematics does. Instead, pure math is the heady domain of visionary theoreticians who make audacious observations about how the world works, which are promptly scrutinized (and sometimes torn apart) by their peers.
'Proof is king,' Granville said.
Math proofs consist of multiple building blocks called lemmas, minor theorems employed to prove bigger ones. Whether each Jenga tower of lemmas can maintain integrity in the face of intense scrutiny is precisely what makes pure math such a 'long and laborious process,' acknowledged Bryna R. Kra, a mathematician at Northwestern University. 'All of math builds on previous math, so you can't really prove new things if you don't understand how to prove the old things,' she said. 'To be a research mathematician, the current practice is that you go through every step, you prove every single detail.'
Lean, a software-based proof assistant, can speed up the process, but Granville said it was 'annoying, because it has its own protocols and language,' requiring programming expertise. 'We need to have a much better way of communication,' he added.
Could artificial intelligence save the day? That's the hope, according to Shafto. An AI model that could reliably check proofs would save enormous amounts of time, freeing mathematicians to be more creative. 'The constancy of math coincides with the fact that we practice math more or less the same: still people standing at a chalkboard,' Shafto said. 'It's hard not to draw the correlation and say, 'Well, you know, maybe if we had better tools, that would change progress.''
AI would benefit, too, Shafto and others believe. Large language models like ChatGPT can scour the digitized storehouses of human knowledge to produce a half-convincing college essay on the Russian Revolution. But thinking through the many intricate steps of a mathematical problem remains elusive.
'I think we'll learn a lot about what the capabilities of various AI protocols are from how well we can get them to generate material that's of interest,' said Jordan S. Ellenberg, a mathematician at the University of Wisconsin-Madison who is part of a team applying for an Exponentiating Mathematics grant. 'We have no intuition yet about which problems are going to be hard and which problems are easy. We need to learn that.'
One of the more disconcerting truths about artificial intelligence is that we do not entirely understand how it works. 'This lack of understanding is essentially unprecedented in the history of technology,' Dario Amodei, CEO of the artificial intelligence company Anthropic, wrote in a recent essay. Ellenberg somewhat downplayed that assertion, pointing out that electricity was widely used before its properties were fully understood. Then again, with some AI experts worrying that artificial intelligence could destroy the world, any clarity into its operations tends to be welcome.
Nelson, the former White House adviser, acknowledged 'legitimate' concerns about the rapid pace at which artificial intelligence is being integrated into seemingly every sector of society. All the more reason, she argued, to have DARPA on the case. 'There's a much higher benchmark that needs to be reached than whether or not your chatbot is hallucinating if you ask it a question about Shakespeare,' she said.
'The stakes are much higher.'

Hashtags

Science

#DefenseAdvancedResearchProjectsAgency

#WaltWhitman

#PatrickShafto

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Elon Musk wants to retrain XAI's chatbot Grok to clear 'ChatGPT's woke' and .... Garbage

Time of India

34 minutes ago

Time of India

Elon Musk wants to retrain XAI's chatbot Grok to clear 'ChatGPT's woke' and .... Garbage

Representative Image Elon Musk the founder of xAI has said that he will retrain his artificial intelligence chatbot and ChatGPT rival Grok . Musk took to X (formerly known as Twitter) and shared a post stating that he will be removing what he terms "ChatGPT's woke" biases and other "garbage" from the foundational knowledge of Grok. Elon Musk to retrain chatbot Grok In a series of posts shared on X, Elon Musk announced that the upcoming version of Grok likely to be called as Grok 4 will trained on the revised information curated by Grok 3.5's advanced reasoning capabilities. 'We will use Grok 3.5… to rewrite the entire corpus of human knowledge, adding missing information and deleting errors,' Musk wrote, adding that current AI models are trained on 'far too much garbage'. This move from Elon Musk comes after his repeated criticism of rival AI model — ChatGPT. Musk has criticised ChatGPT for what he perceives as a "woke mind virus" or ideological slant in their responses. Musk has also asked the users to submit their 'divisive facts' which will be used in the retraining of Grok. 'Please reply to this post with divisive facts for @Grok training. By this I mean things that are politically incorrect, but nonetheless factually true', wrote Musk. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Giao dịch vàng CFDs với mức chênh lệch giá thấp nhất IC Markets Đăng ký Undo Elon Musk's xAI issues clarification on Grok's responses on white genocide Recently, some X users reported that Grok repeatedly generated responses referring to the theory of "white genocide" in South Africa. Users who tagged @grok in posts about sports, entertainment, and general topics received replies discussing racial violence in South Africa, including references to the anti-apartheid chant 'Kill the Boer'. After this, Elon Musk's xAI issued a clarification for this incident. In a statement shared on X, xAI said that the modification in Grok violated the internal policies and core values, leading the chatbot to repeatedly reference politically sensitive topics. The company stated that the change was detected and reversed promptly, though it did not disclose who was responsible for the alteration.

AI meets adult content: THIS platform is a ‘lovechild between OnlyFans and OpenAI'

Mint

2 hours ago

Mint

AI meets adult content: THIS platform is a ‘lovechild between OnlyFans and OpenAI'

Ever since OpenAI introduced the general world to the many possibilities of artificial intelligence (AI), developers have been experimenting with ways the technology can change the overall user experience. In one such experiment, a start-up with over 2,00,000 users in the United States, brought together the endlessness of AI and fame, and merged it with the "spicy fantasies" of OnlyFans users. OhChat, a platform its creator described as the 'lovechild between OnlyFans and OpenAI,' uses artificial intelligence to build lifelike digital duplicates of public figures. These AI avatars of adult content celebrities don't eat, sleep or breathe, but 'remember you, desire you and never log off'. In an interview with CNN, OhChat CEO Nic Young said goes a step further than platforms such as OnlyFans, where users pay to gain access to adult content from content creators. Once activated, the avatars run autonomously, offering 'infinite personalised content' for subscribers. OhChat 'is an incredibly powerful tool, and tools can be used however the human behind it wants to be used,' he said. 'We could use this in a really scary way, but we're using it in a really, I think, good, exciting way.' Young told CNN that OhChat works on a tiered subscription model wherein a user pays $4.99 ( ₹ 430) per month for unlimited texts on demand, $9.99 ( ₹ 865) for capped access to voice notes and images, or $29.99 ( ₹ 2,600) for unlimited VIP interaction. According to Young, platform creators receive an 80 per cent cut from the revenue their AI avatar generates. OhChat keeps the remaining 20 per cent. 'You have literally unlimited passive income without having to do anything again,' Young told CNN. Since launching OhChat in October 2024, the company has signed 20 creators, including 'Baywatch' actress Carmen Electra, and former British glamour model Katie Price – Jordan. Some of the creators are already earning thousands of dollars per month, Young said. Nic Young said that to build a digital twin, OhChat asks its creators to submit 30 images of themselves and speak to a bot for 30 minutes. The platform can then generate the digital replica 'within hours' using Meta's large language model. For example, the AI avatar of Jordan is trained to mimic her voice, appearance and mannerisms. She can 'sext' users, send voice notes and images, and provide on-demand intimacy at scale – all without her lifting a finger. The platform was categorised with their AI avatars on an internal scale to rank the intensity and explicitness of their interactions. Creators contributing to the platform decide which level their avatar will be.

Perplexity's AI chatbot can now generate videos on X: Here's how to use it

Indian Express

3 hours ago

Indian Express

Perplexity's AI chatbot can now generate videos on X: Here's how to use it

AI search startup Perplexity has upgraded its chatbot on X (formerly Twitter) with a new feature that has flooded the Elon Musk-owned platform with AI-generated visuals. The 'Ask Perplexity' bot on X can now generate short, eight-second video clips with sound using AI. Users simply have to tag @AskPerplexity along with a short prompt and in return, they will receive an AI-generated video with creative visuals and audio, including dialogue. While the new video generation feature may boost engagement for Perplexity's chatbot, it also raises concerns about the spread of misinformation on X, which is a platform that has already been strongly criticised for its lax content moderation. However, Perplexity has said that it has implemented strong content filters to prevent the misuse of the latest AI video generation feature. It also highlights the growing rivalry between AskPerplexity and Grok, developed by Elon Musk's xAI venture. They are two of the most popular automated accounts on X that are frequently tagged and asked questions in the replies. However, the Grok AI models do not yet have the ability to generate videos. Soon after Perplexity introduced the new AI feature, users on X began posting wildly imaginative, AI-generated videos depicting fictional scenarios involving real-life celebrities, politicians, and world leaders, etc. The surge in demand inevitably led to a delay in generating videos with the bot account stating that video generation could take longer than expected due to high traffic. 'I've read through your video request DMs. Some of y'all need help,' the AskPerplexity account posted on X. Perplexity has also been looking to make its AI chatbot more accessible by rolling out its services on WhatsApp. In April this year, Perplexity AI became available directly on the messaging platform, allowing users to access the AI-powered answer engine without downloading a separate app or signing up. Similar to Perplexity, users can also access ChatGPT on WhatsApp, along with the natively integrated Meta AI. To access Perplexity AI, save +1 (833) 436-3285 to your contact list and start asking questions or queries. Users can access Perplexity AI on smartphones, PCs, and Macs, as well as via WhatsApp Web. The Google rival has also been facing legal challenges from various publishers. Recently, BBC said it would take legal action against Perplexity for allegedly training its 'default AI model' using the UK broadcaster's content. In a letter addressed to Aravind Srinivas, the CEO of Perplexity, BBC said it may seek an injunction against it unless the AI firm stops scraping its content, deletes existing copies used to train its AI systems, and submits 'a proposal for financial compensation' for the alleged misuse of its intellectual property, according to a report by Reuters. In response, Perplexity said BBC's claims were 'manipulative and opportunistic'. It added that the publisher had 'a fundamental misunderstanding of technology, the internet and intellectual property law.'