an algorhithm for filtering out raw txt files

Posted by Roman Luštrik on Stack Overflow See other posts from Stack Overflow or by Roman Luštrik
Published on 2011-01-07T18:50:04Z Indexed on 2011/01/07 18:53 UTC
Read the original article Hit count: 301

Filed under:

readlines

Imagine you have a .txt file of the following structure:

>>> header
>>> header
>>> header
K L M
200 0.1 1
201 0.8 1
202 0.01 3
...
800 0.4 2
>>> end of file
50 0.1 1
75 0.78 5
...

I would like to read all the data except lines denoted by >>> and lines below the >>> end of file line. So far I've solved this using read.table(comment.char = ">", skip = x, nrow = y) (x and y are currently fixed). This reads the data between the header and >>> end of file.

However, I would like to make my function a bit more plastic regarding the number of rows. Data may have values larger than 800, and consequently more rows.

I could scan or readLines the file and see which row corresponds to the >>> end of file and calculate the number of lines to be read. What approach would you use?

Related posts about import

E: Sub-process /usr/bin/dpkg returned an error code (1) seems to be choking on kde-runtime-data version issue

as seen on Ask Ubuntu - Search for 'Ask Ubuntu'
12.04 LTS, on a dell mini 10. Install stable until about a week ago. Updated about 1x a week, sometimes more often. Several days ago, I booted up and the system was no longer working correctly. All these symptoms occurred simultaneously: Cannot run (exit on opening, every time): Update manager, software… >>> More
Python ImportError when executing 'import.py', but not when executing 'python import.py'

as seen on Stack Overflow - Search for 'Stack Overflow'
I am running Cygwin Python version 2.5.2. I have a three-line source file, called import.py: #!/usr/bin/python import xml.etree.ElementTree as ET print "Success!" When I execute "python import.py", it works: C:\Temp>python import.py Success! When I run the python interpreter and type the… >>> More
Python Wildcard Import Vs Named Import

as seen on Stack Overflow - Search for 'Stack Overflow'
Ok, I have some rather odd behavior in one of my Projects and I'm hoping someone can tell me why. My file structure looks like this: MainApp.py res/ __init__.py elements/ __init__.py MainFrame.py Inside of MainFrame.py I've defined a class named RPMWindow which extends wx.Frame. In… >>> More
Import, Can I import all lib's, how to ??

as seen on Stack Overflow - Search for 'Stack Overflow'
I need to import all lib's in python in a code,, how should i do this ?? >>> More
how to import the blog.py(i import the 'blog' folder)

as seen on Stack Overflow - Search for 'Stack Overflow'
my dir location,i am in a.py: my_Project |----blog |-----__init__.py |-----a.py |-----blog.py when i 'from blog import something' in a.py , it show error: from blog import BaseRequestHandler ImportError: cannot import name BaseRequestHandler i think it… >>> More

Developer IT