New Post: How to handle free text containg characters such as ''?
The example below does not work:<span class='ocrx_word' title='bbox 741 2162 915 2291'><>¢f,/‘;/.</span> Whenever a html file is processed using the HTML Agility Pack (HAP) libraries...
View ArticleNew Post: Html Agility Pack cannot find
hi, i cannot use this components. Please help me, http://www.yerelnet.org.tr/belediyeler/index.php?tip=3&harf=tumu Xpath : //*[@id="r02"] how can i get image url and text?
View ArticleNew Post: Link extraction problem
I think this code could help you However, we have to change the code a bit, and if you're a professional programmer, you can use this code.Public Function GetLinkAddress(ByVal HtmlSource As String) As...
View ArticleNew Post: Problems with HTML Character References (e.g. '1') with...
I downloaded the source code and made a unit test (appended below) which fails on HAP 1.4.6. The problem is that HTML character references (e.g. '1') have the ampersands encoded so that they...
View ArticleNew Post: PARSE TELEPHONE NUMBER IN HTML EMAIL
Hello I'm looking to parse an html email searching for phone number patterns. When I detect a number I want to insert an Hyperlink. Some pattern may detect the same number twice, so I want to analyse...
View ArticleNew Post: Link nodes containing CDATA
Hello, we're using HtmlAgilityPack for some years now, but only recently we are trying to understand CDATA parsing. I understand that a comment node is created in case a node text is surrounded by...
View ArticleNew Post: Portable Class Library
Would it be difficult to make this great project a Portable Class Library (PCL)? I would like to embed it inside a Windows Store App but I can't at the moment because this project target .NET Fx 4.5...
View ArticleNew Post: On Windows Phone
The problem is due to HtmlAgiliyPack version. This issue can be solve by downloading suitable HtmlAgiliyPack varsion for the project from Nuget
View ArticleNew Post: StackOverflowException workaround
I've got StackOverflowException while scanning sites with complicated DOM. Html documents examples were uploaded by Dzonny here...
View ArticleNew Post: Substitution under the node
Sorry for asking, however I am a bit lost here. I need to parse the DOM tree, and when I encounter specific Attribute, use the information to substitute all occurrences of specific symbols within the...
View ArticleNew Post: Cache problem
I have the same issue since I would like to loop over some url. I havre tried the following code but I have the same data on each loop for (int i = 1; i <= Convert.ToInt32(NbrePage); i++) { url =...
View ArticleNew Post: HtmlAgilityPack Unable to connect to the remote server
Got a problem with Html Agility Pack When I run my code it says "Unable to connect to the remote server" Does anybody know why? Here is the code:public void Crawler(string url, int pid) { HtmlWeb hw =...
View ArticleNew Post: HtmlWeb and login
Is possible with this object "HtmlWeb hw = new HtmlWeb();", just log in to a site and then discharge the html ?
View ArticleNew Post: Retrieve an attribute with XPath
I've got the following code snippet and would like to retrieve the value of an attribute by including it in the XPath expression:foreach (HtmlNode node in doc.DocumentNode.SelectNodes("//a/@href")) {...
View ArticleNew Post: Stripping harmful HTML from user input, but allowing other HTML?
Thanks PatrickBurrows, Your code are helpful and it's work, Again thanks.Cardboard Boxes | Retail Boxes | Custom Boxes | Software Boxes
View ArticleNew Post: docHouse.Load() : problem with relative path
Hi, When I use the full path on my laptop, dochouse retreives the file no problem. When I use a relative path, it does not find it. I have tried:docHouse.Load(@"..\App_Data\Request.txt",...
View ArticleNew Post: Problem with HtmlWeb.cs
Hello world ! Please, excuse my bad english, I'm French... I work with Visual Studio 2012 for Web Express. I have to parse some web pages. So, I decided to use HtmlAgilityPack (I used it before with VS...
View ArticleNew Post: Update your browser | Facebook message
Hi, I am new to HtmlAgilityPack, and was told to load the meta data by using HtmlAgilityPack, this works perfect. But the issue I have to face, is that when loading the major websites such as Google...
View Article