rex.rechunk_h5.rechunk_h5.get_dataset_attributes

get_dataset_attributes(h5_file, out_json=None, chunk_size=2, weeks_per_chunk=None)[source]

Extact attributes, dtype, and chunk size for all datasets in .h5 file

Parameters:
  • h5_file (str) – Path to source h5 file to scrape dataset data from

  • chunk_size (int, optional) – Chunk size in MB, by default 2

  • weeks_per_chunk (int, optional) – Number of weeks per time chunk, if None scale weeks based on 8 weeks for hourly data, by default None

  • out_json (str, optional) – Path to output json to save DataFrame of dataset attributes to, by default None

Returns:

ds_attrs (pandas.DataFrame) – Attributes (attrs, dtype, shape, chunks) for all datasets in source .h5 file