Data Management in DGP¶
DGP manages and interacts with a variety of forms of Data.
Imported raw data (GPS or Gravity) is ingested and maintained internally as a
pandas.DataFrame or pandas.Series from their raw
representation in comma separated value (CSV) files.
The ingestion process performs type-casts, filling/interpolation of missing
values, and time index creation/conversion functions to result in a
ready-to-process DataFrame.
These DataFrames are then stored in the project’s HDF5 data-file, which natively supports (with PyTables and Pandas) the storage and retrieval of DataFrames and Series.
To facilitate storage and retrieval of data within the project, the
HDF5Manager class provides an easy to use
wrapper around the pandas.HDFStore and provides utility methods
for getting/setting meta-data attributes on nodes.