PEP 657: use 0-indexed offsets #2022

isidentical · 2021-07-04T08:41:19Z

For the background of this change, see the discussion on: python/cpython#26958 (comment)

isidentical · 2021-07-04T08:41:39Z

isidentical · 2021-07-04T08:44:45Z

pep-0657.rst

-while offsets bigger than these values will be treated as missing (value of 0).
+all AST nodes. The output of the public APIs (``co_positions`` and ``PyCode_Addr2Location``)
+that deal with these attributes use 0-indexed offsets (just like the AST nodes) and designates
+``-1`` as the return code for information not available, but the underlying implementation is


This is also more consistent with the PyCode_Addr2Line, since it also returns -1 on error but for the sake of consistency we used to convert -1 to 0 on PyCode_Addr2Location. Now that cast won't be needed.

pep-0657.rst

ammaraskar

Looks good with Pablo's suggestion incorporated.

pep-0657.rst

terryjreedy · 2021-07-04T18:27:43Z

The merged version looks good to me.

gvanrossum · 2021-07-05T02:10:01Z

Whoa, don’t we use 1-based offsets in SyntaxError attributes?

ammaraskar · 2021-07-05T02:12:51Z

There's a bit of discussion around that here: python/cpython#26958 (comment)

I think the consensus ended up being that SyntaxError is the odd one out here and we'd rather simplify the places we have the 1 to 0 conversion logic and stay consistent with the ast.

gvanrossum · 2021-07-05T02:20:26Z

Hm, possibly, but we chose 1-based because editors like vim and emacs interpret column numbers as 1-based. Where else do we use 0-based offsets te refer to source locations? (In Python code. Of course in C code it’s 0-based all the way down. :-)

pablogsal · 2021-07-05T10:26:52Z

Where else do we use 0-based offsets te refer to source locations?

In the AST, all offsets are 0 based and that's one of the primary motivations as we want to be as close as those as possible:

>>> import ast
>>> ast.dump(ast.parse("x"), include_attributes=True)
"Module(body=[Expr(value=Name(id='x', ctx=Load(), 
    lineno=1, col_offset=0, end_lineno=1, end_col_offset=1
), 
    lineno=1, col_offset=0, end_lineno=1, end_col_offset=1
)], type_ignores=[])"

so one of the advantages of 0-based is that every tool that currently deals with these can be used without modifications to deal with the new positions. We asked around some authors and seems that everyone frefers 0-based positions (so they can also do line[offset:end_offset] directly.

terryjreedy · 2021-07-05T10:32:40Z

I not sure exact what "In Python code." is meant to say, but both Python and tk (and hence tkinter and IDLE) are 0-based for both indexing and slicing.

gvanrossum · 2021-07-05T15:57:36Z

What I meant is when you print column offsets for end-user tools like editors to use, 1-based seems to be the norm. OTOH when you are using this to index into an array of lines, 0-based is more convenient and seems conventional. (Everybody agrees that line numbers start at 1 though. :-)

I'm not yet sure whether PEP 657 is more faced towards end users and their editors, or towards developers of Python libraries that will need to slice and dice the source code.

I do observe that in traceback.py the TraecbackException object has a documented offset field that's 1-based (since it is copied from SyntaxError).

I guess there will be a few traps no matter what we decide...

pablogsal · 2021-07-05T16:21:35Z

What I meant is when you print column offsets for end-user tools like editors to use, 1-based seems to be the norm. OTOH when you are using this to index into an array of lines, 0-based is more convenient and seems conventional. (Everybody agrees that line numbers start at 1 though. :-)

Oh, i see what you mean. Notice that we are not showing the numbers at the moment (and we don't plan to) so users will not see "error in column x". These numbers are mainly focused for tools to use them to grab the correct piece of source code that is associated with an instruction. If we ever show these numbers, we can easily make them 1-based only for the display.

I guess there will be a few traps no matter what we decide...

Right, after asking around seems that there is an opportunity to avoid a slightly bigger subset of tracks for libraries consuming these numbers. So this decision would prioritize reusing existing code and may give some small surprises when developing new code. I think is a good balance but I totally understand if someone sees the other way :)

gvanrossum · 2021-07-05T16:26:37Z

Okay, sounds good!

terryjreedy · 2021-07-05T20:34:49Z

I see the essence of PEP 657 as making the full AST position info available on the code object as iterable co_positions. bpo-43950 translates the position info, within the existing traceback.py traceback construction method, into a new caret line added under the code line.

bpo-44569, just opened, will factor the per-frame formatting code into a separate method that users can replace. Users can then handle the position info as they wish while customizing frame formatting, all without giving up other traceback features (like truncating tracebacks resulting from runaway recursion). IDLE, for instance. will omit the caret line and instead include the position info so that Shell can highlight the error slice within the code line.

isidentical requested a review from pablogsal as a code owner July 4, 2021 08:41

the-knights-who-say-ni added the CLA signed label Jul 4, 2021

isidentical commented Jul 4, 2021

View reviewed changes

PEP 657: use 0-indexed offsets

db72bf3

isidentical force-pushed the pep-657-indexing branch from 7796281 to db72bf3 Compare July 4, 2021 08:53

isidentical mentioned this pull request Jul 4, 2021

bpo-43950: use 0-indexed column offsets for bytecode positions python/cpython#27011

Merged

terryjreedy reviewed Jul 4, 2021

View reviewed changes

pep-0657.rst Outdated Show resolved Hide resolved

pablogsal reviewed Jul 4, 2021

View reviewed changes

pep-0657.rst Outdated Show resolved Hide resolved

ammaraskar approved these changes Jul 4, 2021

View reviewed changes

pablogsal approved these changes Jul 4, 2021

View reviewed changes

apply changes from Ammar

cfdc826

isidentical force-pushed the pep-657-indexing branch from 091adcf to cfdc826 Compare July 4, 2021 17:57

isidentical requested a review from warsaw July 4, 2021 17:57

pablogsal approved these changes Jul 4, 2021

View reviewed changes

pablogsal reviewed Jul 4, 2021

View reviewed changes

pep-0657.rst Outdated Show resolved Hide resolved

Update pep-0657.rst

e15131a

pablogsal merged commit 5fc4119 into python:master Jul 4, 2021

isidentical deleted the pep-657-indexing branch July 5, 2021 16:15

erlend-aasland mentioned this pull request May 13, 2022

pep8/greppable exception messages erlend-aasland/peps#1

Closed

erlend-aasland mentioned this pull request Jun 27, 2022

pep 687/mark as accepted erlend-aasland/peps#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PEP 657: use 0-indexed offsets #2022

PEP 657: use 0-indexed offsets #2022

isidentical commented Jul 4, 2021

isidentical commented Jul 4, 2021

isidentical Jul 4, 2021

ammaraskar left a comment

terryjreedy commented Jul 4, 2021

gvanrossum commented Jul 5, 2021

ammaraskar commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

pablogsal commented Jul 5, 2021 •

edited

Loading

terryjreedy commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

pablogsal commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

terryjreedy commented Jul 5, 2021

PEP 657: use 0-indexed offsets #2022

PEP 657: use 0-indexed offsets #2022

Conversation

isidentical commented Jul 4, 2021

isidentical commented Jul 4, 2021

isidentical Jul 4, 2021

Choose a reason for hiding this comment

ammaraskar left a comment

Choose a reason for hiding this comment

terryjreedy commented Jul 4, 2021

gvanrossum commented Jul 5, 2021

ammaraskar commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

pablogsal commented Jul 5, 2021 • edited Loading

terryjreedy commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

pablogsal commented Jul 5, 2021

gvanrossum commented Jul 5, 2021

terryjreedy commented Jul 5, 2021

pablogsal commented Jul 5, 2021 •

edited

Loading