Anybody knows about C++ sources to extract raw text data from (X)HTML and
XML source pages?

Aby relevant info is greatly appreciated.

Re: Data extraction from HTML source using C++ by Peter

Peter
Wed May 26 07:59:43 CDT 2004

"David F" <David-White@earthlink.net> wrote in message
news:ZaSsc.9849$Tn6.7337@newsread1.news.pas.earthlink.net...
> Anybody knows about C++ sources to extract raw text data from (X)HTML and
> XML source pages?
>
> Aby relevant info is greatly appreciated.
>
>

Microsoft has an XML parser and toolkit that you can download. I'm sure you
could call it from C++. I don't remember the details but if you search on
the msdn website you should be able to find it. If you're using C++ in .Net
then the .Net framework itself has powerful XML tools.


--
Peter Aitken

Remove the crap from my email address before using.