Scientific Data processing (Graph comparison and interpretation)

Posted by pinkynobrain on Stack Overflow See other posts from Stack Overflow or by pinkynobrain
Published on 2010-05-29T17:42:56Z Indexed on 2010/05/29 18:02 UTC
Read the original article Hit count: 120

Filed under:
|
|
|
|

Hi stackoverflow friends, I'm trying to write a program to automate one of my more boring and repetitive work tasks. I have some programming experience but none with processing or interpreting large volumes of data so I am seeking your advice (both suggestions of techniques to try and also things to read to learn more about doing this stuff).

I have a piece of equipment that monitors an experiment by taking repeated samples and displays the readings on its screen as a graph. The input of experiment can be altered and one of these changes should produce a change in a section of the graph which I currently identify by eye and is what I'm looking for in the experiment. I want to automate it so that a computer looks at a set of results and spots the experiment input that causes the change.

I can already extract the results from the machine. Currently they results for a run are in the form of an integer array with the index being the sample number and the corresponding value being the measurement.

The overall shape of the graph will be similar for each experiment run. The change I'm looking for will be roughly the same and will occur in approximately the same place every time for the correct experiment input. Unfortunately there are a few gotchas that make this problem more difficult.

  1. There is some noise in the measuring process which mean there is some random variation in the measured values between different runs. Although the overall shape of the graph remains the same.

  2. The time the experiment takes varies slightly each run causing two effects. First, the a whole graph may be shifted slightly on the x axis relative to another run's graph. Second, individual features may appear slightly wider or narrower in different runs.

In both these cases the variation isn't particularly large and you can assume that the only non random variation is caused by the correct input being found.

Thank you for your time,

Pinky

© Stack Overflow or respective owner

Related posts about data

Related posts about graph