MEDIC: Monitoring, Evaluation and Diagnosing Intelligent Chatbots

The goal of the project led by Raytheon BBN Technologies is to develop an overall process for the evaluation of patient-facing medical-domain chatbots, covering a wide range of metrics and grounded in medical practice by the elicitation of desires and concerns from medical staff, caregivers, and patients. This project will use state-of-the-art large language model capabilities piloted across multiple US Government-funded projects to augment and enrich an initial starter-set of data to cover the problem space and use a collection of Machine Evaluators to produce the metrics, with a minimum of human supervision. The final deliverable will consist of a pipeline of tools for prompt enrichment and a similar system for chatbot evaluation. Immediate use cases include prenatal care in the first and second trimester and mental health in young adults.

Back to Award Directory