Error when parse lego.parse('[\S]') or similar regex #35

uynil · 2018-01-25T04:35:03Z

When I parse regex like [\S], it will give error like

>>> lego.parse('[\S\s]')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/yyy/.pyenv/versions/xx/lib/python3.6/site-packages/greenery/lego.py", line 68, in parse
    return pattern.parse(string)
  File "/Users/yy/.pyenv/versions/xx/lib/python3.6/site-packages/greenery/lego.py", line 244, in parse
    raise Exception("Could not parse '" + string + "' beyond index " + str(i))
Exception: Could not parse '[\S\s]' beyond index 0

qntm · 2018-01-25T21:44:05Z

Yeah, negated shorthands \D, \W and \S are not recognised inside a charclass. This would have been fiddly but maybe I'll implement it sometime. Fortunately there is a trivial workaround, instead of writing [\S] just write [^\s] or \S, and instead of writing [\S\s] just write .. If you have something more complex like [\Wdef], render it as \W|[def] and then do what greenery is designed to do:

>>> from greenery import lego
>>> str(lego.parse('\\W|[def]').reduce())
'[^0-9A-Z_abcg-z]'

qntm self-assigned this Jun 18, 2022

qntm added the enhancement label Jun 18, 2022

qntm mentioned this issue Nov 8, 2022

V4, drop greenery.fsm, overhauled API #67

Merged

qntm closed this as completed in #67 Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when parse lego.parse('[\S]') or similar regex #35

Error when parse lego.parse('[\S]') or similar regex #35

uynil commented Jan 25, 2018 •

edited

Loading

qntm commented Jan 25, 2018

Error when parse lego.parse('[\S]') or similar regex #35

Error when parse lego.parse('[\S]') or similar regex #35

Comments

uynil commented Jan 25, 2018 • edited Loading

qntm commented Jan 25, 2018

uynil commented Jan 25, 2018 •

edited

Loading