Quantcast
Channel: htmlagilitypack Forum Rss Feed
Viewing all articles
Browse latest Browse all 655

New Post: How to handle free text containg characters such as ''?

$
0
0
The example below does not work:
<span class='ocrx_word' title='bbox 741 2162 915 2291'><>¢f,/‘;/.</span>
Whenever a html file is processed using the HTML Agility Pack (HAP) libraries that I am using the above free text found above between the span tags fails and throws an exception.

The below works:
<span class='ocrx_word' title='bbox 741 2162 915 2291'>¢f,/‘;/.</span>
For some reason it doesn't like '<' and '>'. Any ideas how to get around this problem, HAP seems to be doing the job for me so far. I don't think I should have to code in a manual step to ensure that angle brackets are properly substituted (& l t ;). Any ideas on how I might solve this problem? I would have thought HAP would have me covered here but it seems like it doesn't.

Viewing all articles
Browse latest Browse all 655

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>