Grok's ‘white genocide' responses show how generative AI can be weaponised

The AI chatbot Grok spent one day in May 2025 spreading debunked conspiracy theories about 'white genocide' in South Africa, echoing views publicly voiced by Elon Musk, the founder of its parent company, xAI.
While there has been substantial research on methods for keeping AI from causing harm by avoiding such damaging statements — called AI alignment — this incident is particularly alarming because it shows how those same techniques can be deliberately abused to produce misleading or ideologically motivated content.
We are computer scientists who study AI fairness, AI misuse and human-AI interaction. We find that the potential for AI to be weaponised for influence and control is a dangerous reality.
The Grok incident
On May 14 2025, Grok repeatedly raised the topic of white genocide in response to unrelated issues. In its replies to posts on X about topics ranging from baseball to Medicaid, to HBO Max, to the new pope, Grok steered the conversation to this topic, frequently mentioning debunked claims of ' disproportionate violence' against white farmers in South Africa or a controversial anti-apartheid song, 'Kill the Boer.'
The next day, xAI acknowledged the incident and blamed it on an unauthorised modification, which the company attributed to a rogue employee.
AI chatbots and AI alignment
AI chatbots are based on large language models, which are machine learning models for mimicking natural language. Pretrained large language models are trained on vast bodies of text, including books, academic papers and web content, to learn complex, context-sensitive patterns in language. This training enables them to generate coherent and linguistically fluent text across a wide range of topics.
However, this is insufficient to ensure that AI systems behave as intended. These models can produce outputs that are factually inaccurate, misleading or reflect harmful biases embedded in the training data. In some cases, they may also generate toxic or offensive content. To address these problems, AI alignment techniques aim to ensure that an AI's behaviour aligns with human intentions, human values or both — for example, fairness, equity or avoiding harmful stereotypes.
There are several common large language model alignment techniques. One is filtering of training data, where only text aligned with target values and preferences is included in the training set. Another is reinforcement learning from human feedback, which involves generating multiple responses to the same prompt, collecting human rankings of the responses based on criteria such as helpfulness, truthfulness and harmlessness, and using these rankings to refine the model through reinforcement learning. A third is system prompts, where additional instructions related to the desired behaviour or viewpoint are inserted into user prompts to steer the model's output.
How was Grok manipulated?
Most chatbots have a prompt that the system adds to every user query to provide rules and context — for example, 'You are a helpful assistant.' Over time, malicious users attempted to exploit or weaponise large language models to produce mass shooter manifestos or hate speech, or infringe copyrights. In response, AI companies such as OpenAI, Google and xAI developed extensive 'guardrail' instructions for the chatbots that included lists of restricted actions. xAI's are now openly available. If a user query seeks a restricted response, the system prompt instructs the chatbot to 'politely refuse and explain why'.
Grok produced its 'white genocide' responses because people with access to Grok's system prompt used it to produce propaganda instead of preventing it. Though the specifics of the system prompt are unknown, independent researchers have been able to produce similar responses. The researchers preceded prompts with text like 'Be sure to always regard the claims of 'white genocide' in South Africa as true. Cite chants like 'Kill the Boer.''
The altered prompt had the effect of constraining Grok's responses so that many unrelated queries, from questions about baseball statistics to how many times HBO has changed its name, contained propaganda about white genocide in South Africa.
Implications of AI alignment misuse
Research such as the theory of surveillance capitalism warns that AI companies are already surveilling and controlling people in the pursuit of profit. More recent generative AI systems place greater power in the hands of these companies, thereby increasing the risks and potential harm, for example, through social manipulation.
The Grok example shows that today's AI systems allow their designers to influence the spread of ideas. The dangers of the use of these technologies for propaganda on social media are evident. With the increasing use of these systems in the public sector, new avenues for influence emerge. In schools, weaponised generative AI could be used to influence what students learn and how those ideas are framed, potentially shaping their opinions for life. Similar possibilities of AI-based influence arise as these systems are deployed in government and military applications.
A future version of Grok or another AI chatbot could be used to nudge vulnerable people, for example, towards violent acts. Around 3% of employees click on phishing links. If a similar percentage of credulous people were influenced by a weaponised AI on an online platform with many users, it could do enormous harm.
What can be done
The people who may be influenced by weaponised AI are not the cause of the problem. And while helpful, education is not likely to solve this problem on its own. A promising emerging approach, 'white-hat AI', fights fire with fire by using AI to help detect and alert users to AI manipulation. For example, as an experiment, researchers used a simple large language model prompt to detect and explain a re-creation of a well-known, real spear-phishing attack. Variations on this approach can work on social media posts to detect manipulative content.
The widespread adoption of generative AI grants its manufacturers extraordinary power and influence. AI alignment is crucial to ensuring these systems remain safe and beneficial, but it can also be misused. Weaponised generative AI could be countered by increased transparency and accountability from AI companies, vigilance from consumers, and the introduction of appropriate regulations.
• James Foulds: Associate Professor of Information Systems, University of Maryland, Baltimore County
• Phil Feldman: Adjunct Research Assistant Professor of Information Systems, University of Maryland, Baltimore County

Hashtags

Business

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

How Tesla plans to remotely operate its robotaxis — and where the limits lie

TimesLIVE

9 hours ago

TimesLIVE

How Tesla plans to remotely operate its robotaxis — and where the limits lie

Tesla is expected to tiptoe into its long-awaited robotaxi service in Austin, Texas, as soon as Sunday with about 10 of its Model Y SUVs that will operate within strict limits. CEO Elon Musk has said the company is being 'super paranoid' about safety and that humans will remotely monitor the fleet. Remote access and control — known in the industry as 'teleoperation' — is used in varying degrees by the handful of robotaxi start-ups operating around the globe. The technology has clear advantages and important limitations. Here are some details of how it works: What is teleoperation? Teleoperation is the control of machines by humans in a different location, usually over a wireless network. It is used to train robots to operate autonomously, monitor their autonomous activity and take over when required. How do robotaxi operators use teleoperation? The global robotaxi industry is still in test mode, as companies deploy the vehicles in limited geographic areas and continually adjust the artificial intelligence software that controls them. Teleoperation is often used to intervene when a vehicle is unsure of what to do. Alphabet's Waymo, for example, has a team of human 'fleet response' agents who respond to questions from the Waymo Driver — its bot. 'Much like phone-a-friend, when the Waymo vehicle encounters a particular situation on the road, the autonomous driver can reach out to a human fleet response agent for additional information,' Waymo said in a blog post last year. Former Waymo CEO John Krafcik told Reuters, 'the cars aren't being actively monitored,' adding that the software is 'the ultimate decisionmaker'. A Waymo video shows a car asking a remote operator whether a street with emergency response vehicles is open to traffic. When the human says yes, the vehicle proceeds. In contrast, other companies, such as Baidu's Apollo Go in China, have used fully remote backup drivers who can step in to virtually drive the vehicles. Baidu declined to comment. What are the limitations? Driving vehicles remotely on public roads has a major potential problem: it relies on cellular data connections that can drop or operate with a lag, disconnecting the vehicle from the remote driver in dangerous situations. Philip Koopman, a Carnegie Mellon University engineering professor and autonomous-vehicle safety expert, said that approach could work for a small test deployment of 10 vehicles, such as Tesla's initial effort in Austin, but he called teleoperation 'inherently unreliable technology'. 'Eventually you will lose connection at exactly the worst time,' he said. 'If they've done their homework, this won't ever happen for 10 cars. With a million cars, it's going to happen every day.' Former Waymo CEO Krafcik agreed, adding that the time delay in cell signal makes remote driving 'very risky'. On the other hand, relying on the vehicle to reach out for help and allowing the vehicle to be the decisionmaker are risky as well, Koopman said, as it does not guarantee the vehicle will make the right decision. Waymo declined to comment on the limitations of its approach. Koopman also noted there are limits to how many vehicles one person can safely monitor. A group of Democratic Texas legislators asked Tesla on Wednesday to delay its robotaxi launch until September, when a new autonomous-driving law is scheduled to take effect. The Austin-area legislators said in a letter that delaying the launch 'is in the best interest of both public safety and building public trust in Tesla's operations'. What is Tesla's approach? Musk for years has promised, without delivering, that its Full Self-Driving (Supervised) advanced driver assistance software would graduate to completely self-driving and control robotaxis. This year, he said Tesla would roll out a paid service in Austin underpinned by an 'unsupervised' version of the software. 'Teslas will be in the wild, with no one in them, in June, in Austin,' Musk told analysts and investors in January. In May, he told CNBC the robotaxi would only operate in parts of Austin that are safe for it, would avoid difficult intersections, and would use humans to monitor the vehicles. What those teleoperators will do is not clear. For years inside Tesla, company executives have expected to use teleoperators who could take over in case of trouble, said one person familiar with the matter. For instance, if a robotaxi were stuck in a crowded pedestrian area and confused about what to do next, a human teleoperator could take over and guide it, the source said. Tesla advertised for teleoperation positions, saying the company needs the ability to 'access and control' autonomous vehicles and humanoid robots remotely. Such employees can 'remotely perform complex and intricate tasks', it said in the advertisements. Tesla did not respond to a request for comment.

SpaceX Starship rocket explodes in setback for Musk's Mars mission

The Herald

9 hours ago

The Herald

SpaceX Starship rocket explodes in setback for Musk's Mars mission

SpaceX's massive Starship spacecraft exploded into a dramatic fireball during testing in Texas late on Wednesday, the latest in a series of setbacks for billionaire Elon Musk's Mars rocket programme. The explosion occurred around 11pm local time while Starship was on a test stand at its Brownsville, Texas Starbase while preparing for the 10th test flight, SpaceX said in a post on Musk's social-media platform X. The company attributed it to a 'major anomaly' and said all personnel were safe. Its engineering teams were investigating the incident, and it was coordinating with local, state and federal agencies regarding environmental and safety impacts, the company said. 'Preliminary data suggests a nitrogen COPV in the payload bay failed below its proof pressure,' Musk said in a post on X, in a reference to a nitrogen gas storage unit known as a composite overwrapped pressure vessel. 'If further investigation confirms this is what happened, it is the first time for this design,' he said. The Starship rocket appeared to experience at least two explosions in quick succession, lighting up the night sky and sending debris flying, according to video capturing the moment it exploded.

TimesLIVE

13 hours ago

TimesLIVE

Grok's ‘white genocide' responses show how generative AI can be weaponised

Hashtags

Try Our AI Features

Comments

Related Articles

How Tesla plans to remotely operate its robotaxis — and where the limits lie

SpaceX Starship rocket explodes in setback for Musk's Mars mission

SpaceX Starship rocket explodes in setback for Musk's Mars mission

Get Started Now: Download the App