Home All Groups Group Topic Archive Search About

How to strip XML from a webpage

Author
21 Jun 2006 7:51 PM
james
Hi, I've written a small application that  reads Weather data from a couple
of different websites (depending on the user's choice) and then scrolls the
data ( current temp. forecast etc.) on the system tray. The problem I am
encountering is the text includes the html or xml in the string and I would
like to filter it out and display only the weather data itself.
james

Author
21 Jun 2006 8:32 PM
Herfried K. Wagner [MVP]
"james" <jjames700REMOV***@earthlink.net> schrieb:
> Hi, I've written a small application that  reads Weather data from a
> couple of different websites (depending on the user's choice) and then
> scrolls the data ( current temp. forecast etc.) on the system tray. The
> problem I am encountering is the text includes the html or xml in the
> string and I would like to filter it out and display only the weather data
> itself.

Parsing an HTML file:

MSHTML Reference
<URL:http://msdn.microsoft.com/library/default.asp?url=/workshop/browser/mshtml/reference/reference.asp>

- or -

..NET Html Agility Pack: How to use malformed HTML just like it was
well-formed XML...
<URL:http://blogs.msdn.com/smourier/archive/2003/06/04/8265.aspx>

Download:

<URL:http://www.codefluent.com/smourier/download/htmlagilitypack.zip>

- or -

SgmlReader 1.4
<URL:http://www.gotdotnet.com/Community/UserSamples/Details.aspx?SampleGuid=B90FDDCE-E60D-43F8-A5C4-C3BD760564BC>

If the file read is in XHTML format, you can use the classes contained in
the 'System.Xml' namespace for reading information from the file.

--
M S   Herfried K. Wagner
M V P  <URL:http://dotnet.mvps.org/>
V B   <URL:http://classicvb.org/petition/>
Author
21 Jun 2006 8:54 PM
james
Thank you Herfried! Those links are what I was looking for. Sometimes, it
takes another pair of eyes to find the answers.
james

Show quoteHide quote
"Herfried K. Wagner [MVP]" <hirf-spam-me-here@gmx.at> wrote in message
news:%23q88fHXlGHA.4888@TK2MSFTNGP02.phx.gbl...
> "james" <jjames700REMOV***@earthlink.net> schrieb:
>> Hi, I've written a small application that  reads Weather data from a
>> couple of different websites (depending on the user's choice) and then
>> scrolls the data ( current temp. forecast etc.) on the system tray. The
>> problem I am encountering is the text includes the html or xml in the
>> string and I would like to filter it out and display only the weather
>> data itself.
>
> Parsing an HTML file:
>
> MSHTML Reference
> <URL:http://msdn.microsoft.com/library/default.asp?url=/workshop/browser/mshtml/reference/reference.asp>
>
> - or -
>
> .NET Html Agility Pack: How to use malformed HTML just like it was
> well-formed XML...
> <URL:http://blogs.msdn.com/smourier/archive/2003/06/04/8265.aspx>
>
> Download:
>
> <URL:http://www.codefluent.com/smourier/download/htmlagilitypack.zip>
>
> - or -
>
> SgmlReader 1.4
> <URL:http://www.gotdotnet.com/Community/UserSamples/Details.aspx?SampleGuid=B90FDDCE-E60D-43F8-A5C4-C3BD760564BC>
>
> If the file read is in XHTML format, you can use the classes contained in
> the 'System.Xml' namespace for reading information from the file.
>
> --
> M S   Herfried K. Wagner
> M V P  <URL:http://dotnet.mvps.org/>
> V B   <URL:http://classicvb.org/petition/>