Make sepEndBy stack safe #95

hjmtql · 2022-08-28T10:46:29Z

I encountered stack overflow when I used sepEndBy for large data.
It seemed to be caused by sepEndBy and sepEndBy1 are calling each other.
So I added a test and fixed sepEndBy1 using stack safe many.

It seems that chainl and chainr should also be fixed.
However I fixed only sepEndBy with this PR because these functions are a little complicated to me.

Checklist:

Added the change to the changelog's "Unreleased" section with a link to this PR and your username
Linked any existing issues or proposals that this pull request should close
Updated or added relevant documentation in the README and/or documentation directory
Added a test for the contribution (if applicable)

chtenb · 2022-08-28T17:20:04Z

src/StringParser/Combinators.purs

-      as <- sepEndBy p sep
-      pure (cons' a as)
-  ) <|> pure (NEL.singleton a)
+  as <- many $ try (sep *> p)


If I'm not mistaken, this changes the behavior of the function in case of a parser error in p due to the usage of try. PS I'm currently on vacation and may not be very swift to respond.

What you pointed out is correct.
Thanks for the quick review.

NOTE

It gets

> runParser (sepEndBy (char 'a') (char ';')) "a;b" (Right ('a' : Nil))

It seems that b is trashed. I think this is a little problem.

I found this result occurs also in original sebEndBy.
I wonder if it is the intended behavior.

Hm yeah, I can't think of obvious situations where this would be what you want. In test/BasicSpecs.purs we have some basic specs that say what should be parsable and what shouldn't. Perhaps we should extend these specs to also cover partial successes and assert on the expected remaining unparsed string.

We should also look at what purescript-parsing does in these cases.

I would naievely expect sepEndBy to successfully parse a separated sequence and whenever either the sep or the p fails to just stop parsing and yield what has been parsed so far.

hjmtql added 3 commits August 28, 2022 19:08

add overflow test of sepEndBy

095c1c8

fix sepEndBy1 to be stack safe using many

63b245e

add sepEndBy fix to CHANGELOG.md

6d07de4

chtenb reviewed Aug 28, 2022

View reviewed changes

hjmtql marked this pull request as draft August 29, 2022 00:14

chtenb mentioned this pull request Sep 10, 2022

Add assertions on consumption behavior to parser spec tests #96

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sepEndBy stack safe #95

Make sepEndBy stack safe #95

hjmtql commented Aug 28, 2022

chtenb Aug 28, 2022

hjmtql Aug 29, 2022

hjmtql Aug 29, 2022

chtenb Sep 2, 2022

chtenb Sep 2, 2022

Make sepEndBy stack safe #95

Are you sure you want to change the base?

Make sepEndBy stack safe #95

Conversation

hjmtql commented Aug 28, 2022

chtenb Aug 28, 2022

Choose a reason for hiding this comment

hjmtql Aug 29, 2022

Choose a reason for hiding this comment

hjmtql Aug 29, 2022

Choose a reason for hiding this comment

chtenb Sep 2, 2022

Choose a reason for hiding this comment

chtenb Sep 2, 2022

Choose a reason for hiding this comment