I have new problem :)

I use "InternetExplorer.Application" object for connection to website and analysis its HTML code.
But "ie.Document.Body.innerHTML" cannot get correct value because this page has invalid structure:

-------Start of HTML Page--------
.
<head>
<title> Title </title> </head>
<body>

<table>
..................... table codes.........................
</table>

Content-Type: text/html
.
-------End of HTML Page--------

You can see that '</body>' and '<html>' and '</html>' tags are missing!

I need to get all HTML content of this page.
What to do?

Thanks,
Uka

Re: I need to get content of webpage but this webpage has invalid structure by Ato

Ato
Mon Jul 25 09:07:18 CDT 2005


"Uka Miuinnen" <anonymous@anonymous.net> wrote in message
news:%23hCbtzOkFHA.3568@tk2msftngp13.phx.gbl...
> I have new problem :)
>
> I use "InternetExplorer.Application" object for connection to website and analysis its HTML code.
> But "ie.Document.Body.innerHTML" cannot get correct value because this page has invalid structure:
>
> -------Start of HTML Page--------
> .
> <head>
> <title> Title </title> </head>
> <body>
>
> <table>
> ..................... table codes.........................
> </table>
>
> Content-Type: text/html
> .
> -------End of HTML Page--------
>
> You can see that '</body>' and '<html>' and '</html>' tags are missing!
>
> I need to get all HTML content of this page.
> What to do?
>
> Thanks,
> Uka
>
>



Re: I need to get content of webpage but this webpage has invalid structure by Ato

Ato
Mon Jul 25 09:09:14 CDT 2005

Hello,

You might have better luck pulling down the web page using XMLHTTP.
Sample code follows:
----------------------------------------------------------------------------------
targetURL = "http://www.someurl.com"
txtFile = "c:\yourFolder\yourFile.txt"

Set xml = CreateObject("Microsoft.XMLHTTP")
Set fso = CreateObject("Scripting.FileSystemObject")
Set fil = fso.CreateTextFile(txtFile, True)

xml.Open "GET", targetURL, False
xml.Send
fil.Write xml.Responsetext
fil.Close
----------------------------------------------------------------------------------
HTH,
Ato

"Uka Miuinnen" <anonymous@anonymous.net> wrote in message
news:%23hCbtzOkFHA.3568@tk2msftngp13.phx.gbl...
> I have new problem :)
>
> I use "InternetExplorer.Application" object for connection to website and analysis its HTML code.
> But "ie.Document.Body.innerHTML" cannot get correct value because this page has invalid structure:
>
> -------Start of HTML Page--------
> .
> <head>
> <title> Title </title> </head>
> <body>
>
> <table>
> ..................... table codes.........................
> </table>
>
> Content-Type: text/html
> .
> -------End of HTML Page--------
>
> You can see that '</body>' and '<html>' and '</html>' tags are missing!
>
> I need to get all HTML content of this page.
> What to do?
>
> Thanks,
> Uka
>
>