create new script for simulation of GC sampling based on PGA data
- create data for comparison between old method GC (gas chromatograph) and PGA (portable gas analyser), for which the options are needed:
- option to tell how many points I want to chose from PGA data
- decide where they these points are located (which second)
- take median from 10 points around that second (e. g., I want to choose a data point at 2 min and then take the measurements from 1min 55sec to 2min 5sec as median)
- meas campaign 202305: only select mps where we closed chamber for 10 min (600 sec)
- since chamber closure times during 202307 and 202309 was always 10 min, we probably don't need to select the duration of measurement period
- add a new parameter file
preprocessing/additional_info/simulate_gc_n2o_data.txt
- add a new column to the measPeriodN2o DF:
- values [0,1]
- mark all data points that were detected as (short) flatline with 1
- add this to data_filter_function_05
- input DF for GC calculations
- for mps with flat lines:
- "From periods affected by flatlines, you should use neither the original data nor the interpolated data. Since you have flagged flatline data in your pre-processing, you could try different approaches in case a randomly chosen data point happens to be a flatline:
- 1.) discard the entire set of randomly chosen points, and try once more, in hopes that this will fix the problem
- 2.) discard just those points affected by flatlines, and draw new ones to replace them
- 3.) for any flatline point, move either forward or backward in time in your time series until you hit the first point not affected by the flatline."
- for mps with flat lines:
- apply only a subset of correction/filter functions for the data (DF) use for the GC calculation
Edited by Jan Engel