rex.rechunk_h5.rechunk_h5.get_dataset_attributes
- get_dataset_attributes(h5_file, out_json=None, chunk_size=2, weeks_per_chunk=None)[source]
Extact attributes, dtype, and chunk size for all datasets in .h5 file
- Parameters:
h5_file (str) – Path to source h5 file to scrape dataset data from
chunk_size (int, optional) – Chunk size in MB, by default 2
weeks_per_chunk (int, optional) – Number of weeks per time chunk, if None scale weeks based on 8 weeks for hourly data, by default None
out_json (str, optional) – Path to output json to save DataFrame of dataset attributes to, by default None
- Returns:
ds_attrs (pandas.DataFrame) – Attributes (attrs, dtype, shape, chunks) for all datasets in source .h5 file