Asian Journal of Information Technology

130
Views

0
Downloads

A Hybrid Machine Learning Approach for Extracting Information from WWW

Kun Yu, Zhi Cai , Xufa Wang and Qingsheng Cai
Page: 41-48 | Received 21 Sep 2022, Published online: 21 Sep 2022

By Research Area

Medicine & Public Health Life Sciences Engineering Mathematics Biomedicine Physics Chemistry Computer Science Earth Sciences Social Sciences Business and Management Psychology Materials Science Economics Education Environment Philosophy Statistics Law Political Science and International Relations Pharmacy Dentistry Energy Linguistics Geography Finance Criminology and Criminal Justice Medicine Cultural and Media Studies History Architecture / Design Literature Biomedical Sciences Religious Studies Education & Language Food Science & Nutrition Public Health

By Volume and Issue

Abstract

This paper presents a hybrid machine learning approach to extract information from WWW. It applies structure analysis to improve the extraction accuracy, with 96.5% average precision and 96.7% average recall for static web page, and 100% precision and recall for dynamic web page. Furthermore, the working time is short (< 800 ms) and the number of learning examples is small (< 4) due to little user participation. Our results prove that this approach offers the attractive advantageous of fast, convenient and high-accuracy requirements of practical applications.

How to cite this article:

Kun Yu, Zhi Cai , Xufa Wang and Qingsheng Cai . A Hybrid Machine Learning Approach for Extracting Information from WWW.
DOI: https://doi.org/10.36478/ajit.2005.41.48
URL: https://www.makhillpublications.co/view-article/1682-3915/ajit.2005.41.48