TAILIEUCHUNG - Báo cáo khoa học: "Using a Randomised Controlled Clinical Trial to Evaluate an NLG System"

The STOP system, which generates personalised smoking-cessation letters, was evaluated by a randomised controlled clinical trial. We believe this is the largest and perhaps most rigorous task effectiveness evaluation ever performed on an NLG system. The detailed results of the clinical trial have been presented elsewhere, in the medical literature. In this paper we discuss the clinical trial itself: its structure and cost, what we did and did not learn from it (especially considering that the trial showed that STOP was not effective), and how it compares to other NLG evaluation techniques. . | Using a Randomised Controlled Clinical Trial to Evaluate an NLG System Ehud Reiterf Roma RobertsonỊ A Scott LennoxỊ Liesl Osman Departments of Computing Scieneef General Practice and Medicine and Therapeutics University of Aberdeen Aberdeen Scotland UK @ Abstract The STOP system which generates personalised smoking-cessation letters was evaluated by a randomised controlled clinical trial. We believe this is the largest and perhaps most rigorous task effectiveness evaluation ever performed on an NLG system. The detailed results of the clinical trial have been presented elsewhere in the medical literature. In this paper we discuss the clinical trial itself its structure and cost what we did and did not learn from it especially considering that the trial showed that STOP was not effective and how it compares to other NLG evaluation techniques. 1 Introduction There is increasing interest in techniques for evaluating Natural Language Generation nlg systems. However we are not aware of any previously reported evaluations of NLG systems which have rigorously compared the task effectiveness of an NLG system to a non-NLG alternative. In this paper we discuss such an evaluation a large scale 2553 subjects randomised controlled clinical trial which evaluated the effectiveness of personalised smoking-cessation letters generated by the STOP system Reiter et al. 1999 . We believe that this is the largest most expensive and perhaps most rigorous evaluation ever done of an NLG system it was also a disappointing evaluation as it showed that STOP letters in general were no more effective than control letters. The detailed results of the STOP evaluation have been presented elsewhere in the medical lit erature Lennox et al. 2001 . The purpose of this paper is to discuss the clinical trial from an NLG evaluation perspective in order to help future researchers decide when a clinical trial or similar large-scale task effectiveness .

TÀI LIỆU LIÊN QUAN
TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.