Web data mining exploring hyperlinks, contents, and usage data /

The rapid growth of the Web in the last decade makes it the largest p- licly accessible data source in the world. Web mining aims to discover u- ful information or knowledge from Web hyperlinks, page contents, and - age logs. Based on the primary kinds of data used in the mining process, Web mining...

Full description

Main Author: Liu, Bing, 1963-
Other Authors: SpringerLink (Online service)
Format: eBook
Language: English
Published: Berlin ; New York : Springer, ©2007.
Berlin ; New York : [2007]
Physical Description: 1 online resource (xix, 532 pages) : illustrations.
Series: Data-centric systems and applications.
Subjects:
Summary: The rapid growth of the Web in the last decade makes it the largest p- licly accessible data source in the world. Web mining aims to discover u- ful information or knowledge from Web hyperlinks, page contents, and - age logs. Based on the primary kinds of data used in the mining process, Web mining tasks can be categorized into three main types: Web structure mining, Web content mining and Web usage mining. Web structure m- ing discovers knowledge from hyperlinks, which represent the structure of the Web. Web content mining extracts useful information/knowledge from Web page contents. Web usage mining mines user access patterns from usage logs, which record clicks made by every user. The goal of this book is to present these tasks, and their core mining - gorithms. The book is intended to be a text with a comprehensive cov- age, and yet, for each topic, sufficient details are given so that readers can gain a reasonably complete knowledge of its algorithms or techniques without referring to any external materials. Four of the chapters, structured data extraction, information integration, opinion mining, and Web usage mining, make this book unique. These topics are not covered by existing books, but yet they are essential to Web data mining. Traditional Web mining topics such as search, crawling and resource discovery, and link analysis are also covered in detail in this book.
Item Description: Includes bibliographical references (pages 485-515) and index.
Data Mining Foundations -- Association Rules and Sequential Patterns -- Supervised Learning -- Unsupervised Learning -- Partially Supervised Learning -- Web Mining -- Information Retrieval and Web Search -- Link Analysis -- Web Crawling -- Structured Data Extraction: Wrapper Generation -- Information Integration -- Opinion Mining -- Web Usage Mining.
The rapid growth of the Web in the last decade makes it the largest p- licly accessible data source in the world. Web mining aims to discover u- ful information or knowledge from Web hyperlinks, page contents, and - age logs. Based on the primary kinds of data used in the mining process, Web mining tasks can be categorized into three main types: Web structure mining, Web content mining and Web usage mining. Web structure m- ing discovers knowledge from hyperlinks, which represent the structure of the Web. Web content mining extracts useful information/knowledge from Web page contents. Web usage mining mines user access patterns from usage logs, which record clicks made by every user. The goal of this book is to present these tasks, and their core mining - gorithms. The book is intended to be a text with a comprehensive cov- age, and yet, for each topic, sufficient details are given so that readers can gain a reasonably complete knowledge of its algorithms or techniques without referring to any external materials. Four of the chapters, structured data extraction, information integration, opinion mining, and Web usage mining, make this book unique. These topics are not covered by existing books, but yet they are essential to Web data mining. Traditional Web mining topics such as search, crawling and resource discovery, and link analysis are also covered in detail in this book.
English.
Physical Description: 1 online resource (xix, 532 pages) : illustrations.
Bibliography: Includes bibliographical references (pages 485-515) and index.
ISBN: 9783540378822
3540378820
3540378812
9783540378815