Stripping HTML tags from document with grep...

Jarod Wilson jarod at wilsonet.com
Fri Apr 4 12:10:24 EDT 2008


On Fri, 2008-04-04 at 11:15 -0400, Myrle Francis wrote:
> Hi,
> 
>  I have the task of making sure a file got updated each day (an html file).
> The Contents shown below: (when opened in a web browser)
> 
> Performance Summary
> Updated 4/4/2008 10:37:08 AM
> Data as of 4/4/2008 10:36:20 AM
> 
> when I grep for Updated in this file.. ( grep 'Updated'
> ./DMK_MTD_Performance.htm) I get the whole line,  html code and all.
> 
> Can someone suggest a way to strip the html part and give me the lines as
> displayed above? Is grep the way to go, or should I be looking at another
> tool?

elinks -dump DMK_MTD_Performance.html | grep Updated


-- 
Jarod Wilson
jarod at wilsonet.com


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



More information about the Discuss mailing list