CAPTCHA技术研究综述
A Survey of CAPTCHA Technology
-
摘要: 全自动开放式人机区分图灵测试(completely automated public Turing test to tell computers and humans apart,CAPTCHA)又称为人机交互验证(human interactive proof, HIP),它能自动产生并评估一个测试,这个测试能被几乎所有人类用户通过,而现有的计算机程序不能通过.CAPTCHA提供了一种自动驱分人和机器的手段,已成为一种标准的网络安全技术成功应用于包括Google,Yahoo!以及微软在内的各大网站.CAPTCHA设计基于人工智能领域的开放性问题,按表现载体和内容不同分为文本、图像、声音3种类型,其中,基于字符识别的文本CAPTCHA已得到广泛使用;图像CAPTCHA利用计算机视觉中的难解问题,目前尚处于研究阶段;声音CAPTCHA针对视觉残障者,是对前两种视觉CAPTCHA的补充.介绍了CAPTCHA的发展和设计准则,详细阐述CAPTCHA设计和破解的研究工作及最新进展,给出典型实例,讨论其可用性和安全性,最后指出未来CAPTCHA技术的发展方向和亟待解决的问题.Abstract: Completely automated public turing test to tell computers and humans apart (CAPTCHA), also known as human interactive proof (HIP), is a program that can generate and grade tests that most humans can pass and current computer programs cannot. CAPTCHA provides a way for automatically distinguishing a human from a computer program. It has emerged as a standard network security technology and has been successfully used in many popular Web sites, such as Google, Yahoo!, and Microsoft. CAPTCHA is designed based on certain unsolved artificial intelligence (AI) puzzles and can be categorized into three main types, namely, text-based, image-based, and sound-based, according to its carrier and concept. Text-based CAPTCHA based on character recognition has been widely used. Image-based CAPTCHA that exploits harder problems in computer vision is still in the research stage. Sound-based CAPTCHA, which is aimed at providing access to the visually impaired, is a complement to visual CAPTCHAs. This paper introduces the history and the general principle of CAPTCHA. The latest research progress in the design of CAPTCHA is expounded by enumerating typical examples. Their usability and security are also discussed. Finally, the directions for future study as well as the problems to be solved are pointed out.