Kean Publications

A Testing Framework for AI Linguistic Systems (testFAILS)

Yulia Kumar, Kean University
Patricia Morreale, Kean University
Peter Sorial, Kean University
Justin Delgado, Kean University
J. Jenny Li, Kean University
Patrick Martins, Kean University

Document Type

Article

Publication Date

7-1-2023

Abstract

This paper presents an innovative testing framework, testFAILS, designed for the rigorous evaluation of AI Linguistic Systems (AILS), with particular emphasis on the various iterations of ChatGPT. Leveraging orthogonal array coverage, this framework provides a robust mechanism for assessing AI systems, addressing the critical question, “How should AI be evaluated?” While the Turing test has traditionally been the benchmark for AI evaluation, it is argued that current, publicly available chatbots, despite their rapid advancements, have yet to meet this standard. However, the pace of progress suggests that achieving Turing-test-level performance may be imminent. In the interim, the need for effective AI evaluation and testing methodologies remains paramount. Ongoing research has already validated several versions of ChatGPT, and comprehensive testing on the latest models, including ChatGPT-4, Bard, Bing Bot, and the LLaMA and PaLM 2 models, is currently being conducted. The testFAILS framework is designed to be adaptable, ready to evaluate new chatbot versions as they are released. Additionally, available chatbot APIs have been tested and applications have been developed, one of them being AIDoctor, presented in this paper, which utilizes the ChatGPT-4 model and Microsoft Azure AI technologies.

Publication Title

Electronics (Switzerland)

DOI

10.3390/electronics12143095

Recommended Citation

Kumar, Yulia; Morreale, Patricia; Sorial, Peter; Delgado, Justin; Li, J. Jenny; and Martins, Patrick, "A Testing Framework for AI Linguistic Systems (testFAILS)" (2023). Kean Publications. 110.
https://digitalcommons.kean.edu/keanpublications/110

This document is currently not available here.

COinS

Kean Publications

A Testing Framework for AI Linguistic Systems (testFAILS)

Document Type

Publication Date

Abstract

Publication Title

DOI

Recommended Citation

Browse

Search

Resources

Links

Kean Publications

A Testing Framework for AI Linguistic Systems (testFAILS)

Authors

Document Type

Publication Date

Abstract

Publication Title

DOI

Recommended Citation

Share

Browse

Search

Resources

Links