Quantcast
Browsing all 655 articles
Browse latest View live

New Post: How to handle free text containg characters such as ''?

The example below does not work:<span class='ocrx_word' title='bbox 741 2162 915 2291'><>¢f,/‘;/.</span> Whenever a html file is processed using the HTML Agility Pack (HAP) libraries...

View Article


New Post: Html Agility Pack cannot find

hi, i cannot use this components. Please help me, http://www.yerelnet.org.tr/belediyeler/index.php?tip=3&harf=tumu Xpath : //*[@id="r02"] how can i get image url and text?

View Article


New Post: Link extraction problem

I think this code could help you However, we have to change the code a bit, and if you're a professional programmer, you can use this code.Public Function GetLinkAddress(ByVal HtmlSource As String) As...

View Article

New Post: Problems with HTML Character References (e.g. '1') with...

I downloaded the source code and made a unit test (appended below) which fails on HAP 1.4.6. The problem is that HTML character references (e.g. '&#65297;') have the ampersands encoded so that they...

View Article

New Post: sgml?

does HAP officially support sgml files?

View Article


New Post: PARSE TELEPHONE NUMBER IN HTML EMAIL

Hello I'm looking to parse an html email searching for phone number patterns. When I detect a number I want to insert an Hyperlink. Some pattern may detect the same number twice, so I want to analyse...

View Article

New Post: Link nodes containing CDATA

Hello, we're using HtmlAgilityPack for some years now, but only recently we are trying to understand CDATA parsing. I understand that a comment node is created in case a node text is surrounded by...

View Article

New Post: Portable Class Library

Would it be difficult to make this great project a Portable Class Library (PCL)? I would like to embed it inside a Windows Store App but I can't at the moment because this project target .NET Fx 4.5...

View Article


New Post: On Windows Phone

The problem is due to HtmlAgiliyPack version. This issue can be solve by downloading suitable HtmlAgiliyPack varsion for the project from Nuget

View Article


New Post: StackOverflowException workaround

I've got StackOverflowException while scanning sites with complicated DOM. Html documents examples were uploaded by Dzonny here...

View Article

New Post: Substitution under the node

Sorry for asking, however I am a bit lost here. I need to parse the DOM tree, and when I encounter specific Attribute, use the information to substitute all occurrences of specific symbols within the...

View Article

New Post: Cache problem

I have the same issue since I would like to loop over some url. I havre tried the following code but I have the same data on each loop for (int i = 1; i <= Convert.ToInt32(NbrePage); i++) { url =...

View Article

New Post: HtmlAgilityPack Unable to connect to the remote server

Got a problem with Html Agility Pack When I run my code it says "Unable to connect to the remote server" Does anybody know why? Here is the code:public void Crawler(string url, int pid) { HtmlWeb hw =...

View Article


New Post: HtmlWeb and login

Is possible with this object "HtmlWeb hw = new HtmlWeb();", just log in to a site and then discharge the html ?

View Article

New Post: MHT Read & Write

I too have this question.

View Article


New Post: Retrieve an attribute with XPath

I've got the following code snippet and would like to retrieve the value of an attribute by including it in the XPath expression:foreach (HtmlNode node in doc.DocumentNode.SelectNodes("//a/@href")) {...

View Article

New Post: Stripping harmful HTML from user input, but allowing other HTML?

Thanks PatrickBurrows, Your code are helpful and it's work, Again thanks.Cardboard Boxes | Retail Boxes | Custom Boxes | Software Boxes

View Article


New Post: docHouse.Load() : problem with relative path

Hi, When I use the full path on my laptop, dochouse retreives the file no problem. When I use a relative path, it does not find it. I have tried:docHouse.Load(@"..\App_Data\Request.txt",...

View Article

New Post: Problem with HtmlWeb.cs

Hello world ! Please, excuse my bad english, I'm French... I work with Visual Studio 2012 for Web Express. I have to parse some web pages. So, I decided to use HtmlAgilityPack (I used it before with VS...

View Article

New Post: Update your browser | Facebook message

Hi, I am new to HtmlAgilityPack, and was told to load the meta data by using HtmlAgilityPack, this works perfect. But the issue I have to face, is that when loading the major websites such as Google...

View Article
Browsing all 655 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>