Red teaming large language models (LLMs) for resilience to scientific disinformation

23 May 2024

This note provides a summary of a red teaming event co-hosted by The Royal Society and Humane Intelligence in the run-up to the 2023 Global AI Safety Summit (Bletchley, UK).

The event took place on 25 October 2023 and was part of the Science x AI Safety series of events hosted at the Royal Society, which explored the risks associated with the use of AI in scientific activities. It brought together 40 health and climate postgraduate students with the objective to scrutinise and bring attention to potential vulnerabilities in large language models (LLMs).

Building on the report The online information environment: Understanding how the internet shapes people’s engagement with scientific information, published in January 2022, the activity aimed to provide insights on AI-generated scientific disinformation, and the efficacy of LLM guardrails to prevent its production and dissemination. An additional objective was to understand the opportunities and limitations of involving scientists in red teaming efforts.

The note summarises preliminary findings from the activity and concludes with areas for future examination, research, and improvement of red teaming event design.

Downloads

Summary note - Red teaming large language models (LLMs) for resilience to scientific disinformation

Abstract art created by a Generative Adversarial Network AI

The online information environment

A report by the Royal Society on the impact of the internet on our information environment, and on…

19 January 2022

Project

Computational graph developed by Graphcore visualising what the algorithms in a machine learning model look like when they are in action. This particular graph is a mapping of the deep learning tool ResNet 18.

Science in the age of AI

Science in the age of AI explores how AI is transforming the methods and nature of scientific…

28 May 2024

Project

AI and data

Artificial intelligence and data-enabled technologies have huge potential to transform our lives and society as a whole. Ensuring they are used safely and effectively is essential for the UK’s wellbeing, security and economic growth.

Email updates

Subscribe to our newsletters to be updated with the latest news on innovation, events, articles and reports.

First name *

Last name *

Email *

Name

What subscription are you interested in receiving? _{(Choose at least one subject)}

What subscription are you interested in receiving?

Public Newsletter - Summer Science, events, videos and news

Scientists newsletter - Grants, scientific meetings, and journals

Librarians newsletter - News and features for librarians

I am happy to receive the selected communications by email from the Royal Society, as set out in our privacy policy. I understand I can unsubscribe at any time. Review privacy policy *

Fellows

Events

Journals

Current topics

Grants

Medals, awards and prizes

News and resources

Downloads

Related content

The online information environment

Science in the age of AI

AI and data

Email updates

Email updates

Name