Html screen scraping with HtmlAgilityPack Library(www.fairnet.com)

submitted by ebizdomebizdom(320) 1 year, 8 months ago

What is Screen Scraping ? Screen scraping is a process that reads any webpage and extract data from html tags. In this article, i will examine how to scrape a given web page using htmlagilitypack library. It is a .NET code library that allows you to parse "out of the web" HTML files. It can be downloaded @ http://htmlagilitypack.codeplex.com/ In this Tutorial, i will read my own web site http://savebigbucks.ca that offers daily deals in Canada. Here is code snippet that reads the web page.

1 comment |category: |Views: 217

tags: another

new Add a live kick counter to your blog >> liveImage

You can even customize the image by choosing your own colors, and then clicking the button below to update the preview and the html code:

  • "Kick It" text
  • "Kick It" background
  • kick count text
  • kick count background
  • border

Simply copy and paste this HTML into your blog post.


Users who kicked this story:
Comments:

posted by FernirFernir(1) 11 months, 11 days ago 0

HtmlAgilityPack is really nice parsing library, but for data scraping (gathering) its better to use something like Gogybot library, it can retreview HTML pages and HtmlAgilityPack can parse data.

Reply

information Login or create an account to comment on this story