Add sodapro.get_cams_radiation #1175

AdamRJensen · 2021-02-22T21:21:59Z

~~Closes #xxxx~~
I am familiar with the contributing guidelines
Tests added
Updates entries to docs/sphinx/source/api.rst for API changes.
Adds description and name entries in the appropriate "what's new" file in docs/sphinx/source/whatsnew for all changes. Includes link to the GitHub Issue with :issue:`num` or this Pull Request with :pull:`num`. Includes contributor name and/or GitHub username (link with :ghuser:`user`).
New code is fully documented. Includes numpydoc compliant docstrings, examples, and comments where necessary.
Pull request is nearly complete and ready for detailed review.
Maintainer: Appropriate GitHub Labels and Milestone are assigned to the Pull Request and linked Issue.

Add a function to retrieve CAMS McClear clear-sky radiation as previously unsuccessfully attempted in #271, #274, #279. This is a new pull request as my previous attempt #1172 erroneously had a long/incorrect commit history.

This reverts commit d7deb80.

AdamRJensen · 2021-02-22T21:27:02Z

@mikofski Thanks for the help in making a new pull request. The previous PR #1172 has now been closed.

As previously mentioned: I've asked the developing team of the CAMS radiation and McClear services, and have been informed that there is not and will not be any demo API email/access. So if tests have to be implemented, we need to register a PVLIB email. Only a registered email address is necessary and no password will ever be public.

The function includes some very convenient features, including the label, integrated, and map_variable arguments, making it very intuitive for Python users. I'd very much like a critical look at the function and feedback on how it can be improved.

AdamRJensen · 2021-02-22T21:34:09Z

@mikofski As far as I can tell CAMS McClear and ECMWF MACC pvlib.iotools.ecmwf_macc return very different parameters.

wholmgren · 2021-02-23T19:09:49Z

I'm not very familiar with the details of CAMS services, but it appears the major differences between this McClear data reader and the existing MACC reader are in spectral components of AOD. McClear returns AOD at 550 nm for a variety of particles, while MACC returns AOD at a variety of wavelengths. So I think they are complementary and both readers can exist in pvlib.

The CAMS data license (available here) allows us to bundle a file for testing.

How about one function for parsing data and one function for fetching data? The fetch function can call the parse function so a user only needs to make one function call. But separating the logic lets users load data in a different manner if desired (e.g. from disk or using threads).

AdamRJensen · 2021-02-23T21:41:32Z

@wholmgren I will separate it into a get and read/parse function. Any preference in calling it read or parse? Is there a difference?

The nice part about having it in one function is that you know if the file has 6 (verbose=False) or 23 columns (verbose=True). I suppose this can be solved by reading in the file, and then checking how many columns there are, and based on this assign the 6 or 23 column name header.

The primary output of CAMS McClear is the time-series of broad-band clear sky irradiance. This clear-sky model can be used for a number of purposes, e.g. quality checks, etc., and has the major advantage over most clear-sky models in that it is based on data for the specific location and the specific time and not just seasonal trends. The ECMWF MACC reader only returns AOD for different wavelengths and water vapor, which has very different uses.

wholmgren · 2021-02-23T22:00:55Z

Any preference in calling it read or parse? Is there a difference?

I'm guessing parse. Check out psm3.py for an example of a module with a get_psm3 (downloads data from remote source, passes it to parse), parse_psm3 that converts the buffered data into a well-formatted DataFrame, and finally read_psm3 that simplifies usage for local files.

#1155 has some recent discussion relevant to the benefits of splitting up fetching and parsing.

The nice part about having it in one function is that you know if the file has 6 (verbose=False) or 23 columns (verbose=True). I suppose this can be solved by reading in the file, and then checking how many columns there are, and based on this assign the 6 or 23 column name header.

That could work, but you could also keep it simple and add that flag to the read/parse function too. Most people will use the higher level function for direct downloads + parsing and it's not a big deal to make users of the lower level parse function repeat a parameter.

AdamRJensen · 2021-02-23T23:06:22Z

@wholmgren Ok, I will attempt to make a similar structure, with parse, read, and get. Or should we stick to just parse and get?
This function differs from psm3, in that arguments are needed to get passed from the get function to the read function and then to the parse function: particularly the integrated, label, and map_variables arguments.

wholmgren · 2021-02-23T23:20:49Z

I don't see that it's an issue to pass any of those arguments from one function to another or to expect that a user with local data already knows something about that data. But I don't want to over complicate it - iotools is supposed to be relatively easy to contribute to. It's really fine with me if you want to keep it all in one function.

AdamRJensen · 2021-02-24T00:03:08Z

@wholmgren I have now created read and parse sub-functions similar to psm3. It is indeed nice that it can retrieve files, but also work with local files.

Also, I found a way to parse the meta-data nicely and get the column names from the file itself.

The documentation for the two new functions is not fully developed, but I will do this within the next few days. I'll also be working on making tests. Though you are welcome to look through / test the function now.

And minor updates to the documentation

AdamRJensen · 2021-02-27T17:45:11Z

@wholmgren I have renamed the function to 'get_cams_radiation' as I have extended the functions to read/parse/get both CAMS Radiation and McClear as the two services follow the same format (API input is identical, with the exception of one string, specifying whether the users wants Radiation or McClear).

I have been looking into developing tests, but I am very uncertain about which aspects need to be tested? Do we want to register a pvlib email and test the 'get' method by making a real API call?

kandersolar

This is looking pretty good @AdamRJensen! Some notes below, and let's talk about the tests on our call later.

ci/requirements-py36-min.yml

pvlib/iotools/sodapro.py

kandersolar · 2021-06-07T14:22:00Z

pvlib/iotools/sodapro.py

+    if (time_step == '1d') | (time_step == '1M'):
+        data.index = pd.DatetimeIndex(data.index.date)
+    # For monthly data with 'right' label, the index should be the last
+    # date of the month and not the first date of the following month


Either way seems reasonable to me (and I personally prefer left-labeling to avoid questions like this), but I wonder what other people think. As a point of comparison, TMY3 data are right-labeled but without a sub-unit shift like this.

I also wonder if all this labeling code is worth it -- maybe we should just return the start observation time and be done with it.

maybe we should just return the start observation time and be done with it.

+1

I recognize your concern and would perhaps agree if it the service didn't return monthly data.

My main argument is that Pandas labels minute and daily data by the left interval, however, monthly data is labelled right (i.e., last day of the month). The majority of users of monthly data would be required to manually change the index labels, before being able to merge with any other resample data in pandas.

Personally I think it adds a lot of functionality that makes the function a lot easier to use, and that it would be a shame nixing it.

FWIW here are some one-liners for converting month-begin data to month-end:

In [20]: s = pd.Series([1,2,3], pd.date_range('2019-01-01', periods=3, freq='MS')) In [21]: s.index Out[21]: DatetimeIndex(['2019-01-01', '2019-02-01', '2019-03-01'], dtype='datetime64[ns]', freq='MS') In [22]: s.resample('M').mean().index Out[22]: DatetimeIndex(['2019-01-31', '2019-02-28', '2019-03-31'], dtype='datetime64[ns]', freq='M') # or In [23]: s.to_period('M').to_timestamp('M').index Out[23]: DatetimeIndex(['2019-01-31', '2019-02-28', '2019-03-31'], dtype='datetime64[ns]', freq='M')

How about removing the label parameter but adding a docstring example with (a better version of) the above snippet for the benefit of whatever subset of pvlib users want monthly insolation? E.g. this, which renders as this.

pvlib/iotools/sodapro.py

wholmgren

Nice job with the tests @AdamRJensen.

pvlib/iotools/sodapro.py

wholmgren · 2021-06-07T16:00:16Z

pvlib/iotools/sodapro.py

+    if (time_step == '1d') | (time_step == '1M'):
+        data.index = pd.DatetimeIndex(data.index.date)
+    # For monthly data with 'right' label, the index should be the last
+    # date of the month and not the first date of the following month


maybe we should just return the start observation time and be done with it.

+1

Co-authored-by: Kevin Anderson <[email protected]>

Also, includes a few minor changes addressing the review by @kanderso-nrel

Includes match keywords for tests that asserts warning and error messages.

wholmgren

Thanks @AdamRJensen. @kanderso-nrel I'll let you decide when to merge this and refrain from further review unless requested.

AdamRJensen · 2021-06-08T23:56:54Z

@wholmgren @kanderso-nrel I recommend that this first gets merged when a decision on #1245 is made.

kandersolar

Sorry for the delay here, this looks great. Thanks @AdamRJensen! One more idea, what do you think about adding the new-to-pvlib bhi acronym to the Variables and Symbols list?

AdamRJensen added 4 commits February 22, 2021 22:14

Add cams.get_cams_radiation function

d7deb80

Revert "Add cams.get_cams_radiation function"

510f08e

This reverts commit d7deb80.

Add cams.get_cams_mcclear

84e820c

Update v0.9.0.rst

0a92f72

AdamRJensen added 2 commits February 23, 2021 23:03

Reference correct pull request in whatsnew

e8d1098

Add test file for monthly data

2092f8b

AdamRJensen added 2 commits February 24, 2021 00:56

Create sub-functions parse and read

ff9cece

Fixed stickler

0c28299

AdamRJensen added 13 commits February 25, 2021 22:40

Update constants names

75e575f

Fixed monthly integration of values

1f2ec30

Improvement to meta-data parsing

641fc97

Update test file to be during the day

527fa1c

Convert to get_cams to add support for CAMS Radiation

673bf3b

Fixed stickler issues

c6764d3

Update function names to just 'cams'

225b0e1

Update return description

924c58a

Rename to cams_radiation

cbc116d

And minor updates to the documentation

Convert print statements to warnings

294c14a

Update function names

d28143a

Fixed stickler

f263184

Fixed stickler

9b26d91

AdamRJensen added 2 commits June 5, 2021 12:30

Add additional tests for full coverage

9814cae

Update stickler

1d28d1a

AdamRJensen requested review from wholmgren and kandersolar June 5, 2021 21:44

AdamRJensen added 4 commits June 6, 2021 19:01

Add timeout option to get_cams

1d115db

Add timeout option to get_cams

8e53165

Extent tests to cover label, integrated, map_variables arguments

3714e62

Fix stickler

cc5347d

kandersolar reviewed Jun 7, 2021

View reviewed changes

wholmgren reviewed Jun 7, 2021

View reviewed changes

AdamRJensen and others added 4 commits June 7, 2021 12:28

Updates to documentation

9fc945f

Co-authored-by: Kevin Anderson <[email protected]>

Update to documentation

8d85f0b

Co-authored-by: Kevin Anderson <[email protected]>

Reformat data inputs, set index name to None

857bd91

Also, includes a few minor changes addressing the review by @kanderso-nrel

Remove rounding when converting from integrated values

15e097c

AdamRJensen changed the title ~~Add cams.get_cams_radiation~~ Add sodapro.get_cams_radiation Jun 7, 2021

AdamRJensen added 2 commits June 7, 2021 13:40

Minor updates to mock tests

e80cbde

Includes match keywords for tests that asserts warning and error messages.

Remove round(4)

6aadd5c

AdamRJensen mentioned this pull request Jun 7, 2021

Inconsistencies in iotools #1245

Closed

wholmgren approved these changes Jun 7, 2021

View reviewed changes

AdamRJensen added 3 commits June 11, 2021 12:23

Change meta variable name to metadata

f194905

Change start_date/end_date to start/end

128c3f9

Make stickler happy

1e6d262

kandersolar approved these changes Jun 13, 2021

View reviewed changes

AdamRJensen mentioned this pull request Jun 13, 2021

Add bhi, pressure, and wind_ to list of variables #1247

Merged

1 task

kandersolar merged commit d5d8ffa into pvlib:master Jun 13, 2021

kandersolar added enhancement io labels Jun 13, 2021

AdamRJensen deleted the cams_mcclear branch August 10, 2021 15:01

AdamRJensen mentioned this pull request Aug 23, 2021

Summary of Google Summer of Code 2021 with pvlib #1289

Closed

Add sodapro.get_cams_radiation #1175

Add sodapro.get_cams_radiation #1175

Uh oh!

Conversation

AdamRJensen commented Feb 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdamRJensen commented Feb 22, 2021 • edited by wholmgren Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdamRJensen commented Feb 22, 2021

Uh oh!

wholmgren commented Feb 23, 2021

Uh oh!

AdamRJensen commented Feb 23, 2021

Uh oh!

wholmgren commented Feb 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdamRJensen commented Feb 23, 2021

Uh oh!

wholmgren commented Feb 23, 2021

Uh oh!

AdamRJensen commented Feb 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdamRJensen commented Feb 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kandersolar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kandersolar Jun 7, 2021

Choose a reason for hiding this comment

Uh oh!

wholmgren Jun 7, 2021

Choose a reason for hiding this comment

Uh oh!

AdamRJensen Jun 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kandersolar Jun 7, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wholmgren left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wholmgren Jun 7, 2021

Choose a reason for hiding this comment

Uh oh!

wholmgren left a comment

Choose a reason for hiding this comment

Uh oh!

AdamRJensen commented Jun 8, 2021

Uh oh!

kandersolar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AdamRJensen commented Feb 22, 2021 •

edited

Loading

AdamRJensen commented Feb 22, 2021 •

edited by wholmgren

Loading

wholmgren commented Feb 23, 2021 •

edited

Loading

AdamRJensen commented Feb 24, 2021 •

edited

Loading

AdamRJensen commented Feb 27, 2021 •

edited

Loading

AdamRJensen Jun 7, 2021 •

edited

Loading