(feat) add semver.org example #300

dselman · 2022-06-18T11:36:15Z

Signed-off-by: Dan Selman danscode@selman.org

Adds an example grammar that describes a SemVer.org v2 semantic version.

Signed-off-by: Dan Selman <danscode@selman.org>

hildjj

This looks really good now.

Minor nits. If you like, I can merge it as-is.

hildjj · 2022-06-19T19:15:32Z

CHANGELOG.md

@@ -6,6 +6,9 @@ This file documents all notable changes to Peggy.
 Unreleased
 ----------

+- [#299](https://github.com/peggyjs/peggy/issues/299) Add example grammar for a


I'll fix this up after merge. Apparently I didn't leave a template in the changelog after 2.0.1, because that was done i n such a rush.

hildjj · 2022-06-19T19:16:23Z

examples/semver.peggy

+  = major:$numericIdentifier '.' minor:$numericIdentifier '.' patch:$numericIdentifier
+  {
+    return {
+      major: parseInt(major, 10),


Refactor parseInt into numericIdentifier.

Moving parseInt to numericIdentifier may be not so good, because that rule is also used in contexts where you do not want a number, but only a match result. I'm fine to leave parsing here.

then an intermediate rule that does the parsing, I think.

Regardless, the $ should go in numericIdentifier.

hildjj · 2022-06-19T19:18:21Z

examples/semver.peggy

+  = [0-9]
+
+positiveDigit
+  = [1-9]


newline at end, please

Could you please create an issue to set up husky and some kind of linter, so that nobody has to enforce code style by hand?

We don't have a linter for .peggy files yet, as far as I know. There should be one.

We lint all of the JS on each GHA run.

hildjj · 2022-06-19T19:20:58Z

examples/semver.peggy

+  = digit* nonDigit identifierChar*
+
+numericIdentifier
+  = '0' / (positiveDigit digit*)


Maybe this?

numericIdentifier = num:$(positiveDigit digit*) { return parseInt(num) } / '0' ![0-9] { return 0 }

TBH I don't understand how this works right now. positiveDigit digit* should return ['1', ['2', '3']], and parseInt(['1', ['2', '3']], 10) = 1.

I'd suggest a bit different code, though:

numericIdentifier = num:$('0' / [1-9][0-9]*) { return parseInt(num); }

IMO you don't have to write a lot of code to make it more readable. "Positive digit" doesn't add any more information to [1-9].

Trying to keep it closer to https://semver.org/spec/v2.0.0.html I think.

TBH I don't understand how this works right now.

The magic in the $ operator that returns the text from the input instead of results of subexpressions.

"Positive digit" doesn't add any more information to [1-9].

Agreed, especially since this rule is used only once

Oh, I missed $ up above.

Anyway, I didn't find a place in the spec that would limit numbers by any limit, so I suspect parseInt doesn't formally comply to the spec :/

reverofevil · 2022-06-19T19:27:38Z

examples/semver.peggy

+preRelease
+  = head:$preReleaseIdentifier tail:('.' @$preReleaseIdentifier)*
+  {
+    return [head, ...tail];


Thank you for this. I automatically kept using return tail.unshift(head), tail;.

It's in the docs now, look for "Parsing Lists".

reverofevil · 2022-06-19T19:59:23Z

examples/semver.peggy

+  = alphanumericIdentifier
+  / digit+
+
+alphanumericIdentifier


According to spec, it should be identifierChar* nonDigit identifierChar*.

If you substitute everything into alphanumericIdentifier, it turns out the only reason why it exists is to avoid parse conflict in preReleaseIdentifier. If there is no nonDigit, it might match same things as numericIdentifier.

In PEG parsers there is no parse conflicts, it can be greatly simplified.

I think this is all you need to comply to the spec:

semver = major:numericId "." minor:numericId "." patch:numericId prerelease:("-" @prerelease)? build:("+" @build)? { return {major, minor, patch, prerelease, build}; } prerelease = head:prereleaseId tail:("." @prereleaseId)* { return [head, ...tail]; } build = head:alnumId tail:("." @alnumId)* { return [head, ...tail]; } prereleaseId = alnumId / $("0" / [1-9][0-9]*) alnumId = $[0-9a-z-]i+

Actually buildId is just alnumId. It will never hit the second branch. The only reason why grammar in spec distinguishes them, is to stress that numeric identifiers are sorted in numeric order. As this is a parser, and not comparator, it shouldn't really care.

I think this is all you need to comply to the spec:

semver = major:numericId "." minor:numericId "." patch:numericId prerelease:("-" @prerelease)? build:("+" @build)? { return {major, minor, patch, prerelease, build}; } prerelease = head:prereleaseId tail:("." @prereleaseId)* { return [head, ...tail]; } build = head:alnumId tail:("." @alnumId)* { return [head, ...tail]; } prereleaseId = alnumId / $("0" / [1-9][0-9]*) alnumId = $[0-9a-z-]i+

rule numericId is missing, maybe like so:

semver = major:numericId "." minor:numericId "." patch:numericId prerelease:("-" @prerelease)? build:("+" @build)? { return {major, minor, patch, prerelease, build}; } prerelease = head:prereleaseId tail:("." @prereleaseId)* { return [head, ...tail]; } build = head:alnumId tail:("." @alnumId)* { return [head, ...tail]; } prereleaseId = alnumId / numericId alnumId = $[0-9a-z-]i+ numericId = $("0" / [1-9][0-9]*)

However, while this will successfully parse all valid examples it will also successfully parse invalid examples (1.2.3-0123, 1.2.3-0123.0123) but "numeric identifiers MUST NOT include leading zeroes" in the pre-release part.

reverofevil · 2022-06-19T20:10:27Z

examples/semver.peggy

+    pre:('-' @preRelease)?
+    build:('+' @build)?
+  {
+    return { versionCore, pre, build };


Don't think that hiding most important information into a deeper object with non-descript name is a good idea. At least {...versionCore, pre, build}, please.

hildjj · 2022-06-19T20:11:54Z

@dselman sorry this is getting so many nitpicks from all of us here. I think it's just because this seems so useful to have as an example!

reverofevil · 2022-06-19T20:14:10Z

@hildjj Say for yourself. I personally just like nitpicking (and very sorry for that).

dselman · 2022-06-20T11:03:40Z

@dselman sorry this is getting so many nitpicks from all of us here. I think it's just because this seems so useful to have as an example!

No problem, at all. I'm enjoying the discussion and learning! I suggest this is merged (functionally correct) and then another PR can be created to refactor it to be more concise and/or elegant. That takes me off the critical path. How does that sound?

hildjj · 2022-06-20T17:00:48Z

another PR can be created to refactor it to be more concise and/or elegant

We can do that. I think the most efficient way would be for me to create a new branch here on the main repo, edit this PR to target that branch, we iterate there until we're done, then another PR to pull that branch into main. I'm starting that process now, with the branch name semver-grammar.

hildjj · 2022-06-20T17:04:00Z

I'm going to go through all of the comments here, expect a PR with my changes, which I'll link here.

(feat) add semver.org example

0a200be

Signed-off-by: Dan Selman <danscode@selman.org>

dselman mentioned this pull request Jun 18, 2022

peggy.js grammar for semver #299

Closed

hildjj approved these changes Jun 19, 2022

View reviewed changes

reverofevil reviewed Jun 19, 2022

View reviewed changes

hildjj mentioned this pull request Jun 19, 2022

Generate SourceNodes for bytecode #240

Merged

hildjj changed the base branch from main to semver-grammar June 20, 2022 17:02

hildjj merged commit 4b98f92 into peggyjs:semver-grammar Jun 20, 2022

hildjj added a commit to hildjj/peggy that referenced this pull request Jun 20, 2022

Code review nits from peggyjs#300

b570309

hildjj mentioned this pull request Jun 20, 2022

Code review nits for semver grammar #303

Merged

dselman deleted the example-semver branch June 22, 2022 12:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(feat) add semver.org example #300

(feat) add semver.org example #300

dselman commented Jun 18, 2022

hildjj left a comment

hildjj Jun 19, 2022

hildjj Jun 19, 2022

Mingun Jun 19, 2022

hildjj Jun 19, 2022

hildjj Jun 19, 2022

hildjj Jun 19, 2022

reverofevil Jun 19, 2022

hildjj Jun 19, 2022

hildjj Jun 19, 2022

reverofevil Jun 19, 2022

reverofevil Jun 19, 2022

hildjj Jun 19, 2022

Mingun Jun 19, 2022

reverofevil Jun 19, 2022

reverofevil Jun 19, 2022

reverofevil Jun 19, 2022 •

edited

Loading

hildjj Jun 19, 2022

reverofevil Jun 19, 2022 •

edited

Loading

reverofevil Jun 19, 2022 •

edited

Loading

reverofevil Jun 19, 2022 •

edited

Loading

MarcelBolten Jun 20, 2022 •

edited

Loading

reverofevil Jun 19, 2022

hildjj commented Jun 19, 2022

reverofevil commented Jun 19, 2022

dselman commented Jun 20, 2022

hildjj commented Jun 20, 2022

hildjj commented Jun 20, 2022

(feat) add semver.org example #300

(feat) add semver.org example #300

Conversation

dselman commented Jun 18, 2022

hildjj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reverofevil Jun 19, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reverofevil Jun 19, 2022 • edited Loading

Choose a reason for hiding this comment

reverofevil Jun 19, 2022 • edited Loading

Choose a reason for hiding this comment

reverofevil Jun 19, 2022 • edited Loading

Choose a reason for hiding this comment

MarcelBolten Jun 20, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hildjj commented Jun 19, 2022

reverofevil commented Jun 19, 2022

dselman commented Jun 20, 2022

hildjj commented Jun 20, 2022

hildjj commented Jun 20, 2022

reverofevil Jun 19, 2022 •

edited

Loading

reverofevil Jun 19, 2022 •

edited

Loading

reverofevil Jun 19, 2022 •

edited

Loading

reverofevil Jun 19, 2022 •

edited

Loading

MarcelBolten Jun 20, 2022 •

edited

Loading