Fr. 47.50

Learning To Crawl Web Forums

English · Paperback / Softback

Shipping usually within 2 to 3 weeks (title will be printed to order)

Description

Read more

Present Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal overhead. Forum threads contain information content that is the target of forum crawlers. Although forums have di erent layouts or styles and are powered by di erent forum software packages, they always have similar implicit navigation paths connected by speci c URL types to lead users from entry pages to thread pages. Based on this observation, we reduce the web forum crawling problem to a URL-type recognition problem. And we show how to learn accurate and e ective regular expression patterns of implicit navigation paths from automatically created training sets using aggregated results from weak page type classi ers. Robust page type clas-si ers can be trained from as few as ve annotated forums and applied to a large set of unseen forums.

About the author










Herr Vipul D. Punjabi, BE Computer, M-Tech- IT, Promotion im Gange.

Product details

Authors Vipul Punjabi
Publisher LAP Lambert Academic Publishing
 
Languages English
Product format Paperback / Softback
Released 05.02.2018
 
EAN 9786135812343
ISBN 9786135812343
No. of pages 60
Subject Natural sciences, medicine, IT, technology > IT, data processing > Internet

Customer reviews

No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.

Write a review

Thumbs up or thumbs down? Write your own review.

For messages to CeDe.ch please use the contact form.

The input fields marked * are obligatory

By submitting this form you agree to our data privacy statement.