Why the Turing Test is still the best benchmark to assess AI

Image: Supplied
'A computer would deserve to be called intelligent if it could deceive a human into believing that it was human.' Alan Turing
We have come a long way since the beginning of modern AI in the 1950s and especially in the last few years. I believe we are now at the tipping point where AI is changing the way we do research and changing the way industry interacts with these technologies. Politics and society are having to adjust and make sure that AI is used in an ethical and secure way, and also that privacy concerns are addressed. Whilst AI has a lot of potential, there are still a number of issues and concerns. If we manage to address these, we can look ahead to good things from AI.
Alan Turing (1912 – 1954) was a British mathematician and computer scientist and he's also widely known as the father of theoretical computer science and AI. He made a number of notable contributions, for instance, he introduced the concepts of a theoretical computing machine, also known as the Turing machine, which laid the foundation for what is now known as modern computer science. He worked on the design of early computers with the National Physics Laboratory and also later at the University of Manchester, where I'm based. He undertook pioneering work and this continues to be influential in contemporary computer science. He also developed the Turing test that measures the ability of a machine to exhibit intelligent behaviour that's equivalent or indistinguishable from that of a human.
The Turing Test: Why its relevant
The Turing test is still used today. Turing introduced it as a test for what's known as the imitation game in which a human interrogator interacts with two hidden entities — one human and the other a machine — through text-based communication, similar to ChatGPT. The interrogator cannot see or hear the participants and must rely just on the text conversation to make a judgment on whether it's a machine or a human. The objective for the machine is to generate responses that are indistinguishable from those of a human. The human participant aims to convince the interrogator of her/his humanity. If the interrogator cannot reliably distinguish between a machine and a human, then the machine is said to have passed the Turing test.
It sounds very simple but it's an important test because it has become a classic benchmark for assessing AI. But there are also criticisms and limitations to the test. As we mark Alan Turing Day 2024, I can say that AI is moving closer to passing the Turing test – but we're not quite there yet.
A recent paper stated that ChatGPT had passed the Turing test. ChatGPT is a natural language processing model and generates responses to questions that we pose that look like responses from a human. Some people would say ChatGPT has passed the Turing test and certainly for short conversations, ChatGPT is doing quite a good job. But as you have a longer conversation with ChatGPT, you notice there are some flaws and weaknesses. So, I think ChatGPT is probably the closest we get to passing the Turing test, at the moment.
Many researchers and companies are working on improving the current version of ChatGPT and I would like to see that the machine understands what it produces. At the moment, ChatGPT produces a sequence of words that are suitable to address a particular query but it doesn't understand the meaning of these words. If ChatGPT understands the true meaning of a sentence – and that is done by contextualising a particular response or query — I think we are then in a position to say, yes, it has passed the Turing test. I would have hoped to pass this stage by now but I hope we will reach this point in a few years' time, perhaps around 2030.
At the University of Manchester, we are working on various aspects of AI in healthcare — getting better, cheaper or quicker treatment is in the interest of society. It starts off with drug discovery. Can we find drugs that are more potent than drugs and have fewer side effects and ideally are cheaper to manufacture than the drugs currently available? We use AI to help guide us through the search space of different drug combinations. And the AI tells us, for example, which drugs we should combine and at which dose.
We also work with the UK National Health Service and have come up with fairer reimbursement schemes for hospitals. In one case, we use what's called sequential decision making. In the other one, we use techniques that are based on decision trees. So, we use different methods and look at different applications of AI within healthcare.
A particular area of cyber security that I'm working on is secure source code – it's the way we tell a computer what to do and is one of the fundamental levels we humans interact with a computer. If the source code (a sequence of instructions) is poor quality, then it can open up security vulnerabilities which could be exploited by hackers. We use verification techniques combined with AI to scan through source code, identify security issues of different types, and then fix them. We have shown that by doing that, we increase the quality of code and improve the resilience of a piece of software. We generate a lot of code and we want to make sure the code is safe, especially if for a business in a high stakes sector, such as healthcare, defence or finance.
AI in sport
There's a lot of scope and potential for AI in creativity and sport. In football, we have data about match action – where the ball is, who has the ball, and the positioning of the players. It's really big data and we can analyse it to refine a strategy when playing a particular opponent, by looking at past performance and player style, and use the data to adjust our strategy. This would be very tough without AI because of the sheer amount and complexity of the data.
We are also looking at music education and helping people learn an instrument better by creating virtual music teachers. We can use AI combined with other technologies, such as virtual reality and augmented reality, to project a tutor. If you wear VR goggles, you can actually interact with the tutor. This is quite revolutionary and potentially opens up music to everyone on the planet.
At the moment we're at the stage where AI is exceptionally good in doing specific tasks and we are making very good progress on general AI — AI behaving in a similar way to humans and that we can interact with. This is a game changer made possible by ChatGPT and other examples. This technology is being used by industry for completely new business ideas we haven't even thought of.
A vision and strategy for AI is crucial. The UAE National Strategy for AI 2031 is a very good example of an ambitious vision covering education and reskilling, investment in research but also in the translation of research into practice.
The strategy even looks at ethical AI development, making sure the AI is used ethically, securely and that privacy concerns are mitigated. I think the strategy has all the components that are needed to be successful and we can all learn a lot from this approach.
The writer is the professor of Applied Artificial Intelligence and Associate Dean for Business Engagement, Civic & Cultural Partnerships (Humanities) at Alliance Manchester Business School,
Read

Hashtags

#AI

#NationalPhysicsLaboratory

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

SAE University College Dubai opens admissions for project-based Bachelor of Computer Science

Khaleej Times

an hour ago

Khaleej Times

SAE University College Dubai opens admissions for project-based Bachelor of Computer Science

SAE University College Dubai is now accepting applications for its Bachelor of Computer Science program, starting September 2025. This Australian-accredited degree takes a hands-on, project-based approach to learning. There are no written exams, just real-world work that builds a job-ready portfolio. From the first trimester, students create functional software using professional tools and workflows. They present their work, collaborate across teams, and graduate with skills that employers value. No memorisation. No theory overload. Just hands-on experience from day one. Each student is guaranteed an industry internship, giving them direct access to real tech environments before graduation. It's structured for the workplace, not just the classroom. There's no requirement for math or science grades. This programme is open to students from any academic background, especially those who think creatively, enjoy solving problems, and want to learn by doing. Collaboration is built in. Computer Science students work alongside peers in Design, Animation, Audio, Film, and Games. It reflects how real-world creative tech teams operate. SAE has over 40 campuses in 20 countries and has been delivering creative media and tech education for more than 45 years. Degrees from SAE Dubai are accredited by Australia's Tertiary Education Quality and Standards Agency (TEQSA). The Dubai campus is also licensed in the UAE by the Ministry of Higher Education and Scientific Research. The Bachelor of Computer Science is one of several industry-focused degrees offered at SAE Dubai. For those who do not meet direct entry requirements, the one-year foundation programme offers a pathway into a bachelor's degree. Applications are now open for all bachelor's and foundation programme starting September 2025. For more information, visit the website:

UAE employees outpace EMEA peers in cyber confidence, readiness: Cohesity

Tahawul Tech

an hour ago

Tahawul Tech

UAE employees outpace EMEA peers in cyber confidence, readiness: Cohesity

Dubai –Cohesity, a leader in AI-powered data security and resilience, today released the findings of a new study examining employee preparedness in the face of cyber threats. The research shows that the UAE workforce is ahead of its EMEA peers across several indicators of cyber-readiness, underscoring the country's progress toward its national vision for digital resilience and AI-enabled defence. Conducted among full-time office workers in the UAE, United Kingdom, France, and Germany, the study assessed how confident employees feel in identifying and responding to cyberattacks. Among the standout results, 86 percent of UAE employees expressed confidence in recognising a cyber threat—compared to 81 percent in the UK, 80 percent in Germany, and just 62 percent in France. Nearly nine in ten (89%) UAE respondents also said they trust their organisation's ability to prevent and recover from attacks. 86% of UAE employees feel confident in identifying cyberattacks, but reporting gaps still exist. Employees are willing to act – but fear of blame and unclear protocols still hold them back. With national cybersecurity investment gaining traction, organisations must now empower their people to complete the defense chain. Beyond awareness, the study reveals encouraging signs of action-oriented behaviour. Two-thirds of UAE employees say they would report suspicious activity to their cybersecurity team, showing an apt response, in comparison to respondents from the UK (61%), Germany (53%), and France (48%). Amongst other UAE employees, over half would notify their IT department. This instinct to act is supported by ongoing education: 66 percent have received some form of cybersecurity training in the past year. Cyber threats have become more advanced and relentless, turning data into both a critical asset and a primary target. Organisations now face mounting pressure to secure their digital ecosystems, navigate complex regulatory landscapes, and ensure business continuity. Cohesity addresses these challenges through a platform that unifies data security, simplifies recovery, and fosters a culture of resilience. Speaking to Mark Molyneux, CTO, EMEA, and Johnny Karam, Managing Director and Vice President, International Emerging Region at Cohesity, share how enterprises can strengthen their cyber posture by combining cutting-edge technology with empowered, security-conscious teams. Interview Excerpts: Q: Why is data security so critical for organisations today, regardless of size? Mark Molyneux: Data is vital to any organisation, and threat actors target it for its value. Even partial access or corruption can cause catastrophic damage. Modern attacks are sophisticated; once inside, bad actors map systems and target backups to prevent recovery, leaving organisations with two choices: pay a ransom or face ruin. Cohesity helps organisations understand their data landscape, as most don't classify their vast amounts of information. This leaves them vulnerable to theft or deletion without knowing what's lost. Cohesity focuses on threat detection, anomaly hunting, and clean recovery, shifting from mere backup to cyber resilience. They provide tools to track access, report to regulators, and ensure quick, secure recovery. Q: With your background in financial services and IT, how are top-tier organisations in EMEA adapting their hybrid cloud strategies in response to increasing cyber threats and regulatory pressures? Mark Molyneux: Regulatory pressure is intensifying globally, with regulations like Europe's DORA and existing UK mandates. The UAE has proactively responded by developing a cybersecurity framework, fostering innovation, and prioritizing education, leading to high ransomware awareness (86% vs. France's 61%). Despite this, a fear of reporting incidents persists, increasing risk. Cohesity aids organisations in measuring and improving their cyber maturity with a five-step model, emphasising readiness for inevitable cyberattacks by providing recovery and rapid online restoration. Q: How does Cohesity's next-gen data security and management platform address the challenges large enterprises face in managing data across multi-cloud environments? And how does centralising data reduce cost? Mark Molyneux: Our platform, designed over a decade ago, automatically classifies and indexes data upon arrival. This ensures precise data knowledge, retention, and location within your records strategy. This also brings significant cost efficiencies by eliminating unnecessary data storage, reducing hardware investment, optimising cloud usage, and aligning with sustainability goals. We integrate AI and natural language processing for smart, contextual queries. For instance, our RAG AI accurately finds vendor contracts from specific timeframes, referencing data origin for full traceability, surpassing standard generative AI models. Q: What recurring challenges are shaping Cohesity's product development and innovation roadmap? Mark Molyneux: Our current focus is on integrating Veritas, which was acquired in December. We're unifying Veritas's technology with Cohesity's capabilities, evolving both NetBackup and DataProtect under one management pane. Mohit Aron's intelligent file system will support both environments. Gaia, our AI product, will extract insights from data across both portfolios, aligning with our mission to protect, secure, and derive insight from data. Q: When large enterprises consolidate data into a centralised platform across geographies, how do you ensure zero trust? Especially in today's hybrid work environments, how can organisations be sure their data is secure and access is tightly controlled? Johnny Karam: It's a great and very relevant question. Our platform is built with zero trust at its core. As enterprises move data from various clouds and data centres into one secure platform, we ensure multiple layers of protection: Encryption – All data is encrypted at rest and in transit. Multi-Factor Authentication – It's not enough to just have a username and password; access requires multi-factor authentication. Multi-Person Authorisation – For high-sensitivity data, one person alone cannot delete or alter it. You can configure the system to require two or more authorised users to approve critical actions. Immutability and Air Gapping – We store a secure, remote copy of the data (WORM – Write Once, Read Many) that cannot be altered, even in the event of a breach. These controls are just part of our embedded security framework. Additional advanced features are also included but go beyond the scope of today's conversation. Security is not an add-on for us—it's built into the DNA of the platform. That's why customers trust us. Q: What practical steps should organisations in the region take to close the gaps in internal reporting and foster a culture of psychological safety around cyber incident disclosures? Johnny Karam: While UAE organisations have improved cyber awareness—86% of employees recognise threats—a gap remains in reporting. Our research shows 46% hesitate to report due to fear of blame. To bridge this, organisations must cultivate a psychologically safe environment where reporting is encouraged, even for false alarms. Simplifying protocols and using simulations will help reinforce that reporting is always the correct action. At Cohesity, we believe cyber resilience relies on empowered people and a culture where employees confidently exercise their reporting responsibility. Q: The study indicates that 46% of UAE employees hesitate to report threats due to fear of blame or confusion. How can Cohesity's approach to data security and resilience support enterprises in transforming this mindset into a strength, particularly in high-pressure threat scenarios? Johnny Karam: Fear-based hesitation undermines security. Cohesity approaches cyber resilience holistically, integrating secure technology with empowered human behavior. Our platform aids rapid detection, response, and recovery, reducing employee stress. Critically, we partner with leadership to foster a culture of shared responsibility through cyber maturity frameworks, assessing technology, people, and processes. When employees trust their actions, have clarity, and confidence, they become a vital defense. Normalising reporting, clarifying escalation paths, and removing ambiguity replaces hesitation with decisive, informed action. Q: With the UAE's strategic focus on AI and national cyber frameworks, how is Cohesity aligning its roadmap in emerging regions to support not just technological readiness but also employee empowerment as a critical part of cyber resilience? Johnny Karam: The UAE is rapidly advancing in AI and cybersecurity, a vision Cohesity aligns with. Our roadmap for the region focuses on delivering AI-powered technology for resilience and empowering users. We've integrated AI into our platform for automated threat detection and faster response, viewing AI as a tool to amplify human decision-making. We invest in secure, intuitive solutions that support all employees, not just IT. Employee empowerment is crucial for cyber resilience; we aim for every user to operate with clarity and contribute to security. The UAE offers a significant opportunity to shape a global model for workforce-driven resilience.

Reddit Mulls Eye‑Scan as Proof of Human Authenticity

Arabian Post

an hour ago

Arabian Post

Reddit Mulls Eye‑Scan as Proof of Human Authenticity

Reddit is exploring integration of Worldcoin's iris‑scanning Orb technology to verify that account holders are genuine, unique individuals while preserving user anonymity. The move aims to curb bot activity and AI‑generated content, and to comply with emerging age‑verification regulations. In discussions with Tools for Humanity, the Orb would capture an encrypted representation of a user's iris—known as an IrisCode—to assign a secure, anonymous World ID. That identifier confirms uniqueness across the Reddit platform without revealing the user's real identity. News of the potential partnership has sparked debate. Proponents argue the system could enhance trust and moderate authenticity, while critics express concerns over biometric data collection by private firms. Already, public backlash on social media indicates many users may oppose such a shift, citing privacy and anonymity fears. ADVERTISEMENT World has thus far deployed more than 12 million iris scans via Orbs in cities across the US, UK, and South Korea, awarding users Worldcoin cryptocurrency in exchange. Its age‑detection software refuses scans for those under 18—a feature that may assist Reddit with compliance. Reddit disclosed in May that it needs 'a little more information' from users to fulfill regulatory requirements around age and AI-generated content, though it did not mention Worldcoin at the time. The platform has also taken legal action against experiments using AI to mimic real users. World's supporters emphasise that the IrisCode does not match stored images; rather, it is encrypted, fragmented and cannot be reverse‑engineered into a biometric photograph. After the initial scan, raw images are deleted. Still, critics and regulators in Europe have flagged concerns that anonymisation is reversible or incomplete. Critics, including privacy advocates and policy makers, point out that the system could still enable re‑identification. The Electronic Privacy Information Center has described Worldcoin as a 'potential privacy nightmare,' and regulators in Germany, Spain and Portugal have raised doubts about its data‑handling practices. Reddit remains silent publicly, with no confirmed timeline for implementation. The platform is reportedly discussing an opt‑in model, where users voluntarily choose iris‑based verification to gain benefits such as enhanced reputation or reduced spam. The move comes amid mounting pressure on digital platforms. Governments in the US are drafting age‑verification laws, and techniques using AI to create deep‑fake personas are challenging online discourse integrity. A study by the University of Zurich, prompting Reddit legal threats, alerts to AI's ability to impersonate real individuals during debates. For Reddit, the Orb's age‑gating tech could help block under‑18s in compliance with evolving laws. The system also enables verification without compromising pseudonymity—users would not need to share names or personal data. World has attracted significant investment—some USD 300 million—to expand its infrastructure. Investors include Andreessen Horowitz and Bain Capital Crypto, and partnerships have been forged with Visa and Match Group to pilot World ID in industries like finance and dating services. Yet the deployment of Orbs has drawn scrutiny. European data‑protection regulators in Germany and Spain suspended operations, citing unresolved concerns. In Hong Kong, a probe found that biometric imagery collection breached privacy laws, ordering users' data blocked or deleted. Users on Reddit voice sharp criticism. One commenter wrote, 'But supposing I have an eye condition… reddit seems extremely happy with all the very obvious bots …' Others express unwillingness to sacrifice anonymity: 'Tell me—should I be happy … or sad that they want to verify us users by a method we already declined?' Despite opposition, World's Orb model is gaining attention across tech platforms. Match Group is piloting age‑verification via World ID in Japan. Visa is launching a debit card for users who complete orb scans. Such trials signal mainstream interest even as privacy issues remain unresolved. Reddit's potential adoption would mark one of the most high‑profile entries of biometric verification in a major social network. It signals a pivotal moment in balancing digital authenticity with user privacy, as AI‑generated content and regulatory demands escalate.

Why the Turing Test is still the best benchmark to assess AI

Hashtags

Try Our AI Features

Comments

Related Articles

SAE University College Dubai opens admissions for project-based Bachelor of Computer Science

UAE employees outpace EMEA peers in cyber confidence, readiness: Cohesity

Reddit Mulls Eye‑Scan as Proof of Human Authenticity

Get Started Now: Download the App