Lloyd
Sun May 14 06:06:34 CDT 2006
Nono...
I found the URL, you could download the wikipedia's books at:
http://download.wikimedia.org/
Now I am the stage, trying to figure out what to do with this 136MB long XML
file.
Obviously basic XML tool which simply load it in memory are
innapropriate....
"Alex Li" <likwoka@gmail.com> wrote in message
news:1147581137.948357.286610@y43g2000cwc.googlegroups.com...
> Do you mean scrapping the wikipedia webpages? If that is the case,
> then you want to take a look at the System.Net namespace; in particular
> the WebClient class or the HttpWebRequest class for download the
> content; then use a parser to extra the data from the webpage content.
>
> Does that help?
> Alex
>