Home > Papers

 
 
Multivariate Classification of Deep Web Sources Based on Maximum Entropy Model
Huang Li #,Zhao Pengpeng ,Cui Zhiming *
Institute of Intelligent Information Processing and Application in Soochow University
*Correspondence author
#Submitted by
Subject:
Funding: 国家自然科学基金项目,教育部科研重点项目,教育部博士学科点科研基金项目,研究生创新计划项目(No.60673092,205059,20040285016,)
Opened online:12 December 2007
Accepted by: none
Citation: Huang Li ,Zhao Pengpeng ,Cui Zhiming .Multivariate Classification of Deep Web Sources Based on Maximum Entropy Model[OL]. [12 December 2007] http://en.paper.edu.cn/en_releasepaper/content/16873
 
 
With the fast development of World Wide Web, the quantity of web information is increasing in an unprecedented pace, a great many of which are generated dynamically from background databases, and can’t be indexed by traditional search engine, so we call them Deep Web. For the heterogeneous and dynamic features of Deep Web sources, classifying the Deep Web source by domain effectively is a significant precondition of Deep Web sources integration. In this paper, we consider the visible features of Deep Web and Maximum Entropy approach, and then on the basis of binary classification, we propose a new multivariate classification approach based on Maximum Entropy towards Deep Web sources. In addition, we propose a Feedback algorithm to improve the accuracy of classification. An experimental evaluation over real Web data shows that, our approach could provide an effective and general solution to the multivariate classification of Deep Web sources.
Keywords:Deep Web, Multivariate Classification, Data Integration, Maximum Entropy, Feedback Algorithm
 
 
 

For this paper

  • PDF (0B)
  • ● Revision 0      
  • ● Print this paper
  • ● Recommend this paper to a friend
  • ● Add to my favorite list

    Saved Papers

    Please enter a name for this paper to be shown in your personalized Saved Papers list

Tags

Add yours

Related Papers

  • Other similar papers

Statistics

PDF Downloaded 349
Bookmarked 0
Recommend 5
Comments Array
Submit your papers