Structure Extraction of the Website Information Based on the Degree of Link Association

Wang Yang; Zhang Bin

Chinese︱Feedback︱Save this page

• Elaborating Academic Views 　　　　 • Exchanging Innovative Ideas
• Protecting Intellectual Properties 　　• Fast Sharing Science Papers

Sponsored by the Center for Science and Technology Development of the Ministry of Education
Supervised by Ministry of Education of the People's Republic of China

Home > Papers

Structure Extraction of the Website Information Based on the Degree of Link Association

Wang Yang #,Zhang Bin *

School of Information and Communication Engineering, BeiJing University of Post and Telecommunication, BeiJing 100876

*Correspondence author

#Submitted by

Subject:

Funding: none

Opened online:16 November 2015

Accepted by: none

Citation: Wang Yang,Zhang Bin.Structure Extraction of the Website Information Based on the Degree of Link Association[OL]. [16 November 2015] http://en.paper.edu.cn/en_releasepaper/content/4660762

Structure extraction of websites information is the basis of many other technologies about classifying the website. In this paper, some different algorithms that are used to extract the structure of the website information are listed, and this paper also analyzes the advantages and disadvantages of those different algorithms. Above all, a method about structure extraction of the website information based on the degree of link association is put forward in the paper. First of all, it's needed to extract the content of every page of the target website, secondly, we can use the page after the extraction of content to calculate the dissimilarity of pages and calculate the dissimilarity of the links of two pages, then we can also get the route which is from the home page to the target page by the dijkstra algorithm, finally, the structure of the whole website can be produced through the route.

Keywords:Pattern recognition; Structure of the website information; Content extraction; Link association

For this paper

● PDF (0B)
● Revision 0 　　
● Print this paper
● Recommend this paper to a friend
● Add to my favorite list

Saved Papers

Please enter a name for this paper to be shown in your personalized Saved Papers list

Tags

Add yours

Related Papers

Statistics

PDF Downloaded	41
Bookmarked	0
Recommend	0
Comments	Array

Submit your papers

Alert Name:
Alerting to:
Authentication email will be sent to your email address in 24 hours
Frequency:
Email Message Format:	Plain text Graphical(HTML)

Complete the form below and we will recommend the selected titles to your friends on your behalf. * Indicates a required field.
Your name*:
Your email address*:
Recipient's name*:
Recipient's email address*:
(multiple recipient's names and email addresses should be separated with semicolons)
Your comments:	I thought you would find the page(s) useful.

Your name:
Your email address:
Recipient's name:
Recipient's email address:
(multiple recipient's names and email addresses should be separated with semicolons)
Your comments:	I thought you would find this page useful.

Disclaimer: This message was sent to your friend using the "Send it to a friend" facility on the Sciencepaper Online’ WWW site, http://www.paper.edu.cn/en. The Sciencepaper Online is not responsible for the content of this email, and anything said in this email does not necessarily reflect the Sciencepaper Online's views.

	Check out RSS, or use RSS reader to subscribe this item

Saved Papers