Boston Linux & Unix (BLU) Home | Calendar | Mail Lists | List Archives | Desktop SIG | Hardware Hacking SIG
Wiki | Flickr | PicasaWeb | Video | Maps & Directions | Installfests | Keysignings
Linux Cafe | Meeting Notes | Blog | Linux Links | Bling | About BLU

BLU Discuss list archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Stripping HTML tags from document with grep...



 On Fri, 2008-04-04 at 11:15 -0400, Myrle Francis wrote: 
> Hi, 
> 
>  I have the task of making sure a file got updated each day (an html file). 
> The Contents shown below: (when opened in a web browser) 
> 
> Performance Summary 
> Updated 4/4/2008 10:37:08 AM 
> Data as of 4/4/2008 10:36:20 AM 
> 
> when I grep for Updated in this file.. ( grep 'Updated' 
> ./DMK_MTD_Performance.htm) I get the whole line,  html code and all. 
> 
> Can someone suggest a way to strip the html part and give me the lines as 
> displayed above? Is grep the way to go, or should I be looking at another 
> tool? 


BLU is a member of BostonUserGroups
BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Valid HTML 4.01! Valid CSS!



Boston Linux & Unix / webmaster@blu.org