Perl question
Jeff Kinz
jkinz at kinz.org
Thu May 22 15:28:20 EDT 2003
On Thu, May 22, 2003 at 12:20:47PM -0400, Eric Schwartz wrote:
> Hello all,
> Thank you for your help in this matter. I have decided to move
> forward with using "html scraping" I am using this code from a book on
> perl, and i cant seem to get it to work. I tried to modify it to search
> specifically for estimated pages remaining, and I want it to look for the
> group of numbers that is right after, but i dont seem to be doing anything
> right. When I run this code it prints "here it is" and nothing
> else. Maybe because it just finds a blank space after the designated
> search, im not sure. Here is a small clip of the html i am looking at:
>
> <td width="90%">
> <p align="left"><font face="Arial,Helvetica" size="1" color="#000000">
> Estimated Pages Remaining:
> </font></p>
> </td>
> <td width="10%">
> <p align="right"><font face="Arial,Helvetica" size="1" color="#000000">
> 6052
> </font></p>
> </td>
> </tr>
Keep it simple:
Pipe your html screen into this perl script
(Or modify the script to read in the lines itself)
Note - this relys on your html file ALWAYS having the above format.
==============CUT HERE =========================
#!/usr/bin/perl
while (<STDIN>) {
if ( /Estimated Pages Remaining:/ ) {
# Read in five more line and print out the fifth one.
$_=<STDIN> ;
$_=<STDIN> ;
$_=<STDIN> ;
$_=<STDIN> ;
$_=<STDIN> ;
print "here it is: ";
print $_ ;
}
}
==============CUT HERE =========================
Uncompiled, untested, unwarranteed. etc .. Use at your own Risk,
FWIW, IANAL , IMHO blah blah blah.
--
Jeff Kinz, Open-PC, Emergent Research, Hudson, MA. jkinz at kinz.org
copyright 2003. Use is restricted. Any use is an
acceptance of the offer at http://www.kinz.org/policy.html.
Don't forget to change your password often.
----- End forwarded message -----
--
Jeff Kinz, Open-PC, Emergent Research, Hudson, MA. jkinz at kinz.org
copyright 2003. Use is restricted. Any use is an
acceptance of the offer at http://www.kinz.org/policy.html.
Don't forget to change your password often.
More information about the Discuss
mailing list