Title: Automatic Extraction of New Field Association Terms Using Search Engine

Abstract
: With increasing popularity of the Internet and tremendous amount of on-line text, automatic document classification is important for organizing huge amounts of data. Readers can know the subject of many document fields by reading only some specific Field Association (FA) terms. This paper proposes a method for automatically building new FA terms. A WWW search engine is used to extract FA term candidates from document corpora. New FA term candidates in each field are automatically compared with previously determined FA terms. Then new FA terms are appended to an FA term dictionary. From the experiential results, our new system can automatically appended around 44% of new FA terms to the existence FA term Dictionary.


Authors
: Elsayed Atlam, Elmarhomy Ghada, Kazuhiro Morita, Masao Fuketa and Jun-ichi Aoe


Back


IBIMA 2005 Conference   www.ibima.org