|
Classifying Web Document such as BBS, HTML and e-mail, etc., is an important task
for web application. To solve this problem, this paper presents following results: (1)
Proposes a new text classification method called Classification by Genetic Algorithm with
Association Rules Method (CGAA method). (2) Other than previous work, the fitness function
are applied under the guidance of the association rules mined by Apriori_CGAA algorithm. (3)
Realizing a family of genetic procedures such as CGAA _Roulette_Selection, CGAA_Xover and
CGAA _binaryMutation and giving extensive experiments with real data. (4)The experiment show
that the CGAA algorithm is superior to other common methods. A Best-Vector with a score
3513.6 can be achieved after running CGAA algorithm after 50 generations. |
|
Keywords:Chinese document classification, Genetic Algorithm, Association rules, Natural |
|