

نوع الإرسال:مقالة بحثية أصلية
1 University of Leeds, UK, leeds, electrical engineering
2 Telecom, Imperial College, UK
Artificial intelligence (AI) is rapidly advancing scientific discovery, but this progress carries risks of misuse, such as the creation of harmful substances, or circumvention of established regulations. In this paper, we first demonstrate the risks by highlighting real-world examples of AI misuse in chemical science, which underscore the need for effective safety alignment for these AI models. In response, we propose SciGuard, an agent-based guardrail that employs large language models, tools and external knowledge to assess and control risks in scientific AI interactions. For a fair comparison, we introduce a benchmark SciMT (Scientific Multi-Task) to assess both the safety and utility of different AI systems. SciGuard achieves a state-of-the-art harmlessness score on red-teaming queries, while maintaining high performance on benign tasks, without sacrificing scientific knowledge. Finally, we call for continued research and dialogue to ensure the safe deployment of AI in science.
[1]Zuo K et al 2023 Electrified water treatment: fundamentals and roles of electrode materials Nat. Rev. Mater. 8 472–90
[2]Gingerich D B, Grol E and Mauter M S 2018 Fundamental challenges and engineering opportunities in flue gas desulfurization wastewater treatment at coal fired power plants Environ. Sci. 4 909–25
[3]Soliman M, Eljack F, Kazi M-K, Almomani F, Ahmed E and El Jack Z 2021 Treatment technologies for cooling water blowdown: a critical review Sustainability 14 376
[4]Simões C, Saakes M and Brilman D 2023 Toward redox-free reverse electrodialysis with carbon-based slurry electrodes Ind. Eng. Chem. Res. 62 1665–75