Running with HSDS
The Highly Scalable Distributed Service (HSDS) is a cloud optimized API to enable access to .h5 files hosted on AWS. The HSDS software was developed by the HDF Group and is hosted on Amazon Web Services (AWS) using a combination of EC2 (Elastic Compute) and S3 (Scalable Storage Service). You can read more about the HSDS service in this slide deck. You can use the NREL developer API as the HSDS endpoint for small workloads or stand up your own HSDS local server (instructions further below) for an enhanced parallelized data experience.
For general information on where to get started accessing NREL data from outside of NREL, see the rex docs.
You might also be interested in these examples of how to set up your own local HSDS server and how to run reV on an AWS parallel cluster.
Note that running directly from S3 files will be an easier solution although not as performant. For more details on running directly from S3 files see running reV locally and the rex s3 example
Setting up HSDS
To get started install the h5pyd library:
pip install h5pyd
Next, configure h5pyd by running hsconfigure
from the command line, or by
creating a configuration file at ~/.hscfg
:
hsconfigure
hs_endpoint = https://developer.nrel.gov/api/hsds
hs_username =
hs_password =
hs_api_key = {YOUR_API_KEY_HERE}
To get your own API key, visit https://developer.nrel.gov/signup/
Please note that our HSDS service is for demonstration purposes only. The API in the example above is hosted on an NREL server and will have limits on the amount of data you can access via HSDS. It is common to get an error: OSError: Error retrieving data: None errors
if you attempt to access too much data or if the server is busy. Here are two references for scaling reV using HSDS and AWS:
Using HSDS with reV
Once h5pyd has been installed and configured, rex
can pull data directly from AWS using HSDS
To access the resource data used by reV (NSRDB or WTK) you have to turn on the
hsds
flag in the resource handlers:
nsrdb_file = '/nrel/nsrdb/v3/nsrdb_2013.h5'
with rex.Resource(nsrdb_file, hsds=True) as f:
meta_data = f.meta
time_index = f.time_index
reV Gen
reV generation (reV.Gen) will automatically infer if a file path is locally on disk or from HSDS.
Note that for all of these examples, the sam_file
input points to files in
the
reV test directory
that may not be copied in your install. You may want to download the relevant
SAM system configs from that directory and point the sam_file
variable to
the correct filepath on your computer.
windpower
Compute wind capacity factors for a given set of latitude and longitude coordinates:
import os
import numpy as np
from reV import TESTDATADIR
from reV.config.project_points import ProjectPoints
from reV.generation.generation import Gen
from rex import init_logger
init_logger('reV', log_level='DEBUG')
lat_lons = np.array([[ 41.25, -71.66],
[ 41.05, -71.74],
[ 41.45, -71.66],
[ 41.97, -71.78],
[ 41.65, -71.74],
[ 41.53, -71.7 ],
[ 41.25, -71.7 ],
[ 41.05, -71.78],
[ 42.01, -71.74],
[ 41.45, -71.78]])
res_file = '/nrel/wtk/conus/wtk_conus_2012.h5' # HSDS 'file' path
sam_file = os.path.join(TESTDATADIR,
'SAM/wind_gen_standard_losses_0.json')
pp = ProjectPoints.lat_lon_coords(lat_lons, res_file, sam_file)
gen = Gen('windpower', pp, sam_file, res_file,
output_request=('cf_mean', 'cf_profile'))
gen.run(max_workers=1)
print(gen.out['cf_profile'])
[[0.319 0.538 0.287 ... 0.496 0.579 0.486]
[0.382 0.75 0.474 ... 0.595 0.339 0.601]
[0.696 0.814 0.724 ... 0.66 0.466 0.677]
...
[0.833 0.833 0.823 ... 0.833 0.833 0.833]
[0.782 0.833 0.833 ... 0.833 0.833 0.833]
[0.756 0.801 0.833 ... 0.833 0.833 0.833]]
pvwatts
NOTE: pvwattsv5
and pvwattsv7
are both available from reV.
Compute pvcapacity factors for all resource gids in a Rhode Island:
import os
from reV import TESTDATADIR
from reV.config.project_points import ProjectPoints
from reV.generation.generation import Gen
from rex import init_logger
init_logger('reV', log_level='DEBUG')
regions = {'Rhode Island': 'state'}
res_file = '/nrel/nsrdb/v3/nsrdb_2012.h5' # HSDS 'file' path
sam_file = os.path.join(TESTDATADIR, 'SAM/naris_pv_1axis_inv13.json')
pp = ProjectPoints.regions(regions, res_file, sam_file)
gen = Gen('pvwattsv5', pp, sam_file, res_file,
output_request=('cf_mean', 'cf_profile'))
gen.run(max_workers=1)
print(gen.out['cf_mean'])
[0.183 0.166 0.177 0.175 0.167 0.183 0.176 0.175 0.176 0.177]
Command Line Interface (CLI)
reV-gen can also be run from the command line and will output the results to an .h5 file that can be read with rex.resource.Resource.