rex.multi_file_resource.MultiFileNSRDB
- class MultiFileNSRDB(h5_source, unscale=True, str_decode=True, check_files=False, use_lapse_rate=True)[source]
Bases:
MultiFileResource
,NSRDB
Class to handle 2018 and beyond NSRDB data that is at 2km and sub 30 min resolution
See also
resource.MultiFileResource
Parent class
resource.NSRDB
Parent class
- Parameters:
h5_source (str | list) – Unix shell style pattern path with * wildcards to multi-file resource file sets. Files must have the same time index and coordinates but can have different datasets. Can also be an explicit list of complete filepaths.
unscale (bool) – Boolean flag to automatically unscale variables on extraction
str_decode (bool) – Boolean flag to decode the bytestring meta data into normal strings. Setting this to False will speed up the meta data read.
check_files (bool) – Check to ensure files have the same coordinates and time_index
use_lapse_rate (bool) – If a dataset is only available at a single hub-height and this flag value is set to True, pressure / temperature values will be calculated using linear lapse rate adjustment from the available hub height to the requested one. If the flag value is set to False, the value of these variables at the single available hub-height will be returned for all requested heights. This option has no effect if data is available at multiple hub-heights.
Methods
close
()Close h5 instance
df_str_decode
(df)Decode a dataframe with byte string columns into ordinary str cols.
get_SAM_df
(site[, extra_cols])Get SAM solar resource DataFrame for given site
get_attrs
([dset])Get h5 attributes either from file or dataset
get_dset_properties
(dset)Get dataset properties (shape, dtype, chunks)
get_meta_arr
(rec_name[, rows])Get a meta array by name (faster than DataFrame extraction).
get_scale_factor
(dset)Get dataset scale factor
get_units
(dset)Get dataset units
is_hsds_file
(file_path)Parse one or more filepath to determine if it is hsds
is_s3_file
(file_path)Parse one or more filepath to determine if it is s3
open_dataset
(ds_name)Open resource dataset
open_file
(file_path[, mode, hsds, hsds_kwargs])Open a filepath to an h5, s3, or hsds nrel resource file with the appropriate python object.
preload_SAM
(h5_source, sites[, unscale, ...])Pre-load project_points for SAM
Attributes
ADD_ATTR
INTERPOLABLE_DSETS
Air Temperature and Pressure lapse rate in C/km and Pa/km
SCALE_ATTR
UNIT_ATTR
VARIABLE_NAME
VARIABLE_UNIT
Dictionary of all dataset add offset factors
Dictionary of all dataset attributes
Dictionary of all dataset chunk sizes
(lat, lon) pairs
Get the version attribute of the data.
Datasets available
Datasets available
Dictionary of all dataset dtypes
Global (file) attributes
Groups available
Open h5py File instance.
Extract (latitude, longitude) pairs
Resource meta data DataFrame
Available resource datasets
Available resource datasets
Dictionary of all dataset scale factors
Resource shape (timesteps, sites) shape = (len(time_index), len(meta))
Dictionary of all dataset shapes
Resource DatetimeIndex
Dictionary of all dataset units
- classmethod preload_SAM(h5_source, sites, unscale=True, str_decode=True, tech='pvwattsv7', time_index_step=None, means=False, clearsky=False, bifacial=False, downscale=None, check_files=False)[source]
Pre-load project_points for SAM
- Parameters:
h5_source (str | list) – Unix shell style pattern path with * wildcards to multi-file resource file sets. Files must have the same time index and coordinates but can have different datasets. Can also be an explicit list of complete filepaths.
sites (list) – List of sites to be provided to SAM (sites is synonymous with gids aka spatial indices)
unscale (bool) – Boolean flag to automatically unscale variables on extraction
str_decode (bool) – Boolean flag to decode the bytestring meta data into normal strings. Setting this to False will speed up the meta data read.
tech (str, optional) – SAM technology string, by default ‘pvwattsv7’
time_index_step (int, optional) – Step size for time_index, used to reduce temporal resolution, by default None
means (bool, optional) – Boolean flag to compute mean resource when res_array is set, by default False
clearsky (bool) – Boolean flag to pull clearsky instead of real irradiance
bifacial (bool) – Boolean flag to pull surface albedo for bifacial modeling.
downscale (NoneType | str) – Option for NSRDB resource downscaling to higher temporal resolution. Expects a string in the Pandas frequency format, e.g. ‘5min’.
check_files (bool) – Check to ensure files have the same coordinates and time_index
- Returns:
SAM_res (SAMResource) – Instance of SAMResource pre-loaded with Solar resource for sites in project_points
- LAPSE_RATES = {'pressure': 11109, 'temperature': 6.56}
Air Temperature and Pressure lapse rate in C/km and Pa/km
- property adders
Dictionary of all dataset add offset factors
- Returns:
adders (dict)
- property attrs
Dictionary of all dataset attributes
- Returns:
attrs (dict)
- property chunks
Dictionary of all dataset chunk sizes
- Returns:
chunks (dict)
- close()
Close h5 instance
- property coordinates
(lat, lon) pairs
- Returns:
lat_lon (ndarray)
- Type:
Coordinates
- property data_version
Get the version attribute of the data. None if not available.
- Returns:
version (str | None)
- property datasets
Datasets available
- Returns:
list
- static df_str_decode(df)
Decode a dataframe with byte string columns into ordinary str cols.
- Parameters:
df (pd.DataFrame) – Dataframe with some columns being byte strings.
- Returns:
df (pd.DataFrame) – DataFrame with str columns instead of byte str columns.
- property dsets
Datasets available
- Returns:
list
- property dtypes
Dictionary of all dataset dtypes
- Returns:
dtypes (dict)
- get_SAM_df(site, extra_cols=None)
Get SAM solar resource DataFrame for given site
- Parameters:
site (int) – Site to extract SAM DataFrame for.
extra_cols (dict, optional) – A dictionary where they keys are extra columns to extract from the SAM solar resource DataFrame and the values are the names the new columns should have (e.g. extra_cols={‘surface_albedo’: ‘Surface Albedo’} will extract the ‘surface_albedo’ from the resource file and call it ‘Surface Albedo’ in the output).
- Returns:
res_df (pandas.DataFrame) – time-series DataFrame of resource variables needed to run SAM
- get_attrs(dset=None)
Get h5 attributes either from file or dataset
- Parameters:
dset (str) – Dataset to get attributes for, if None get file (global) attributes
- Returns:
attrs (dict) – Dataset or file attributes
- get_dset_properties(dset)
Get dataset properties (shape, dtype, chunks)
- Parameters:
dset (str) – Dataset to get scale factor for
- Returns:
shape (tuple) – Dataset array shape
dtype (str) – Dataset array dtype
chunks (tuple) – Dataset chunk size
- get_meta_arr(rec_name, rows=slice(None, None, None))
Get a meta array by name (faster than DataFrame extraction).
- Parameters:
rec_name (str) – Named record from the meta data to retrieve.
rows (slice) – Rows of the record to extract.
- Returns:
meta_arr (np.ndarray) – Extracted array from the meta data record name.
- get_scale_factor(dset)
Get dataset scale factor
- Parameters:
dset (str) – Dataset to get scale factor for
- Returns:
float – Dataset scale factor, used to unscale int values to floats
- get_units(dset)
Get dataset units
- Parameters:
dset (str) – Dataset to get units for
- Returns:
str – Dataset units, None if not defined
- property global_attrs
Global (file) attributes
- Returns:
global_attrs (dict)
- property groups
Groups available
- Returns:
groups (list) – List of groups
- property h5
Open h5py File instance. If _group is not None return open Group
- Returns:
h5 (h5py.File | h5py.Group)
- static is_hsds_file(file_path)
Parse one or more filepath to determine if it is hsds
- Parameters:
file_path (str | list) – One or more file paths (only the first is parsed if multiple)
- Returns:
is_hsds_file (bool) – True if hsds
- static is_s3_file(file_path)
Parse one or more filepath to determine if it is s3
- Parameters:
file_path (str | list) – One or more file paths (only the first is parsed if multiple)
- Returns:
is_s3_file (bool) – True if s3
- property lat_lon
Extract (latitude, longitude) pairs
- Returns:
lat_lon (ndarray)
- property meta
Resource meta data DataFrame
- Returns:
meta (pandas.DataFrame)
- open_dataset(ds_name)
Open resource dataset
- Parameters:
ds_name (str) – Dataset name to open
- Returns:
ds (ResourceDataset) – Resource for open resource dataset
- classmethod open_file(file_path, mode='r', hsds=False, hsds_kwargs=None)
Open a filepath to an h5, s3, or hsds nrel resource file with the appropriate python object.
- Parameters:
file_path (str) – String filepath to .h5 file to extract resource from. Can also be a path to an HSDS file (starts with /nrel/) or S3 file (starts with s3://)
mode (str, optional) – Mode to instantiate h5py.File instance, by default ‘r’
hsds (bool, optional) – Boolean flag to use h5pyd to handle .h5 ‘files’ hosted on AWS behind HSDS, by default False. This is now redundant; file paths starting with /nrel/ will be treated as hsds=True by default
hsds_kwargs (dict, optional) – Dictionary of optional kwargs for h5pyd, e.g., bucket, username, password, by default None
- Returns:
file (h5py.File | h5pyd.File) – H5 file handler either opening the local file using h5py, or the file on s3 using h5py and fsspec, or the file on HSDS using h5pyd.
- property res_dsets
Available resource datasets
- Returns:
list
- property resource_datasets
Available resource datasets
- Returns:
list
- property scale_factors
Dictionary of all dataset scale factors
- Returns:
scale_factors (dict)
- property shape
Resource shape (timesteps, sites) shape = (len(time_index), len(meta))
- Returns:
shape (tuple)
- property shapes
Dictionary of all dataset shapes
- Returns:
shapes (dict)
- property time_index
Resource DatetimeIndex
- Returns:
time_index (pandas.DatetimeIndex)
- property units
Dictionary of all dataset units
- Returns:
units (dict)