Langsupport: Validation refactoring and better language support for autograder test cells. #1173

sigurdurb · 2019-08-15T17:52:46Z

Refactors repeated code. The code could maybe be refactored even more.
Better validation of errors, now supports Gnu Octave.
To get the kernel name I use lang = nb.metadata.get('kernelspec',{}).get("language","python"). Even though it says python it gets the real name of the kernel. So 'octave' is returned for the octave kernel for example. So contributers can now use the lang variable to do custom error handling for their kernel if it does not support the default functionality.
Octave kernel error matching now works, The code for this is in is_validate_error in utils.py, Octave does not have a cell output_type 'error' but we re.match the error message from the "text" output that has the "stream" output_type. That is the only way I found that works for Octave. Just don't put display('error: ') in a your autograder test cell as a teacher, those cells are read only for students anyway so for students that should not be a problem.

…nguage from the kernelspec, refactors get _get_failed_cells and _get_passed_cells into _get_cells_op they call with different parameters

… error

sigurdurb · 2019-08-15T17:55:04Z

Continuation of PR:#1038 and Issue: #1039

jhamrick · 2019-08-21T19:33:40Z

Thanks for the work you put into this!

However, I'm not sure this is something that I want to merge into master as is, as I would really like to keep all of the code in nbgrader as language agnostic as possible, and I would like to avoid any special casing for particular languages.

It seems to me that this is more of an issue with the octave kernel: it should be returning errors using the "error" output type rather than a stream output. I guess though for some kernels it might be challenging to do this (e.g. for bash)---really all you have is the stream output. In that case, I would be ok with merging a PR which checks to see if there is a stream with "name": "stderr" and if so treating that as an error, i.e. something like:

def get_errors(cell):
    errors = []
    for output in cell.outputs:
        if output.output_type == "stream" and output.name == "stderr":
            errors.append(output.text)
        elif output.output_type == 'error':
            errors.append(output.traceback)
    return errors

However, I do not want to merge something that checks for a language type explicitly.

Also, it unfortunately looks like these changes have undone some other stuff surrounding the logic of self.validate_all. The logic of _get_failed_cells and _get_passed_cells is legitimately different so I don't think it makes sense to factor that out into a shared method. Would you mind undoing those changes, and just making changes individually in those two methods for appropriately handling errors?

sigurdurb · 2020-06-24T11:30:19Z

There were two use cases with this PR:

If there is a use case use case to having a variable for language, "lang" being passed to determine_grade. But if nbgrader is language agnostic it would probably not be used in that function unless it is just an option for people to make their own custom changes more easily for kernels like octave.
A general refactoring PR could be useful. The utils.is_validate_error function I added refactored the error checks for:

for output in cell.outputs:
    if output.output_type == "error":
         errors.append("\n".join(output.traceback))

That were in utils.determine_grade and validator._extract_error (That now also have a check for partial credit)

is_validate_error first checked if the language was 'octave'. Also in _extract_error there is a check for 'octave' because there the error is in the cell text but else it is in the cell traceback. But it is not ideal to have octave hardcoded in two places.

But if anyone wants to make language support for weird kernels they could use this code for where to implement changes.

sigurdurb · 2020-06-24T11:45:07Z

This code would need to be separated into several PR's. Going to close this PR and refer it to people who are interest in custom support for languages.

williamstein · 2020-11-18T20:36:32Z

I have no choice but to deal with this same problem in CoCalc (because paying customers expect to be able to use nbgrader + octave). In case you're curious, here's how it works there (like you, I check for error: at the beginning of the line):

https://github.com/sagemathinc/cocalc/blob/master/src/smc-webapp/jupyter/nbgrader/autograde.ts#L219

Note that this is Typescript code, rather than Python.

Also, note that there are subtle similar problems that happen when using the R kernel.

williamstein · 2020-11-18T20:37:21Z

I also agree that this is all working around bugs in these kernels. However, I'm not optimistic that these bugs will be fixed in a timely manner.

sigurdurb added 3 commits August 15, 2019 17:04

adds is_validate_error function, in validate.py we add getting the la…

65e3881

…nguage from the kernelspec, refactors get _get_failed_cells and _get_passed_cells into _get_cells_op they call with different parameters

Removing cutoff since the code anyway stops executing after the first…

4c2fb02

… error

refactoring code

d195b8b

sigurdurb changed the title ~~Langsupport~~ Langsupport: Validation refactoring and better language support for autograder test cells. Aug 15, 2019

re.match to match for more efficiency

ef06d0e

jhamrick added enhancement needs work labels Aug 21, 2019

jhamrick added this to the 0.7.0 milestone Aug 21, 2019

sigurdurb closed this Jun 24, 2020

jld23 mentioned this pull request Aug 10, 2020

add SAS codestub and autograde for metakernel based non-python kernels #1356

Merged

williamstein mentioned this pull request Nov 18, 2020

nbgrader: make triggering failure on a special string (e.g., "error") in stdout be optional and configurable sagemathinc/cocalc#5014

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Langsupport: Validation refactoring and better language support for autograder test cells. #1173

Langsupport: Validation refactoring and better language support for autograder test cells. #1173

Uh oh!

sigurdurb commented Aug 15, 2019

Uh oh!

sigurdurb commented Aug 15, 2019

Uh oh!

jhamrick commented Aug 21, 2019

Uh oh!

sigurdurb commented Jun 24, 2020

Uh oh!

sigurdurb commented Jun 24, 2020

Uh oh!

williamstein commented Nov 18, 2020

Uh oh!

williamstein commented Nov 18, 2020

Uh oh!

Uh oh!

Langsupport: Validation refactoring and better language support for autograder test cells. #1173

Langsupport: Validation refactoring and better language support for autograder test cells. #1173

Uh oh!

Conversation

sigurdurb commented Aug 15, 2019

Uh oh!

sigurdurb commented Aug 15, 2019

Uh oh!

jhamrick commented Aug 21, 2019

Uh oh!

sigurdurb commented Jun 24, 2020

Uh oh!

sigurdurb commented Jun 24, 2020

Uh oh!

williamstein commented Nov 18, 2020

Uh oh!

williamstein commented Nov 18, 2020

Uh oh!

Uh oh!