How to start matching and saving matched from exact point in a text

Posted by yuliya on Stack Overflow See other posts from Stack Overflow or by yuliya
Published on 2011-01-11T19:29:02Z Indexed on 2011/01/12 16:54 UTC
Read the original article Hit count: 245

Filed under:

matching

I have a text and I write a parser for it using regular expressions and perl.

I can match what I need with two empty lines (I use regexp), because there is a pattern that allows recognize blocks of text after two empty lines.

But the problem is that the whole text has Introduction part and some text in the end I do not need.

Here is a code which matches text when it finds two empty lines

#!/usr/bin/perl

use strict;
use warnings;

my $file = 'first';                    
open(my $fh, '<', $file);   
my $empty = 0;    
my $block_num = 1;    
open(OUT, '>', $block_num . '.txt');    

while (my $line = <$fh>) {  

 chomp ($line);
 if ($line =~ /^\s*$/) {  
  $empty++;      
  } elsif ($empty == 2) {     
   close(OUT);    
   open(OUT, '>', ++$block_num . '.txt');
   $empty = 0;
  } 
  else {
   $empty = 0;}
 print OUT "$line\n";

}
close(OUT);

This is example of the text I need (it's really small :))

this is file example

I think that I need to iterate over the text till the moment it will find the word LOREM IPSUM with regexps this kind "/^LOREM IPSUM/", because it is the point from which needed text starts(and save the text in one file when i reach the word). And I need to finish iterating over the text when INDEX word is fount or save the text in separate file.

How could I implement it. Should I use next function to proceed with lines or what?

BR, Yuliya

Developer IT

How to start matching and saving matched from exact point in a text - Developer IT

How to start matching and saving matched from exact point in a text

regex

perl

matching

Related posts about regex

Find multiple regex in each line and skip result if one of the regex doesn't match

OWASP Regex Repository: Is this regex correct?

Make a Perl-style regex interpreter behave like a basic or extended regex interpreter

JS regex isn't matching, even thought it works with a regex tester

c# RegEx with "|"

Related posts about perl

Munin on Centos 6 - missing perl MODULE_COMPAT_5.8.8

Pain removing a perl rootkit

How To Avoid a Perl script calling an Another Perl Script

Perl :how to sort dates in perl

please suggest a perl book exclusively for perl programs

Categories cloud