Hir #860

Y-Nak · 2023-03-16T15:54:58Z

Define HIR, implement lowering from AST, and add helper structs to evaluate span lazily.

Architecture

Hir items

The hir_def module is a collection of HIR node definitions.
All items that correspond to the Rust items are defined as a salsa tracked struct so that they can work as a granularity of analysis.
Below is all item's definition.

pub enum ItemKind {
    TopMod(TopLevelMod),
    Mod(Mod),
    Func(Func),
    ExternFunc(ExternFunc),
    Struct(Struct),
    Contract(Contract),
    Enum(Enum),
    TypeAlias(TypeAlias),
    Impl(Impl),
    Trait(Trait),
    ImplTrait(ImplTrait),
    Const(Const),
    Use(Use),
    /// Body is not an `Item`, but this makes it easier for analyzers to handle
    /// it.
    Body(Body),

TopMod is the top-level module that corresponds to each file.
Mod is an internal module that is defined inside a TopMod by mod my_mod {...}.
ExternFunc is a desugared function that is defined inside extern block. extern block doesn't create a scope, so the HIR item definition doesn't contain extern block explicitly.
Body represents a function body or nameless body that appears in a constant expression context, e.g., 10 in [u32; 10] or LEN in [1; LEN]. Expressions, statements, and patterns are always stored inside the body's arena, not in the salsa database. Thus a body is a minimum granularity for analysis of the expressions, statements, and patterns.

Three different databases

`HirDb`

HirDb is the core database for analysis and integration with LSP implementation.
HirDb works mainly as a container for Hir nodes. But it also contains some private tracked functions for lowering.
HirDb can be considered a completely span-agnostic database from the perspective of external crates. In reality, the tracked structs stored in HirDb do contain span information, but they are designed so that they cannot be accessed through HirDb (all fields dependent on spans have private visibility). Furthermore, since lowering from AST (or source file) to HIR can also be considered span-dependent, the visibility of tracked functions that perform lowering is defined as private. However, functions that depend on HirDb may internally call the lowering function. These calls are unavoidable for HIR construction and do not unnecessarily invalidate the cache. Additionally, from external crates, the fact that these internal functions are being called is completely hidden.
It is possible to embed span information in diagnostics without directly depending on spans by using types that implement the LazySpan trait, which will be described later. This allows for each analysis phase to be span-agnostic.

`LowerHirDb`

LowerHirDb is a marker trait used for lowering source files or ASTs to HIR. All public functions related to lowering take LowerHirDb as an argument. All analysis passes must NOT depend on the db.

`SpannedHieDb`

SpannedHirDb is a marker trait used for evaluating the LazySpan trait lazily, which will be described later. All public functions related to span information take SpannedHirDb as an argument. All analysis passes must NOT depend on the db.

`LazySpan`

Types implementing the LazySpan trait provide the ability to extract span information lazily, but types themselves don't directly depend on it. These types are implemented under hir/span and basically correspond one-to-one with each HIR node. For example, LazyFuncSpan corresponds to the Func item. Please see the tests in span/item.rs for more examples. To construct these types, there is no need to depend on SpannedHirDb, but when "evaluating" the types to extract the actual span, you need to use SpannedHirDb. Types implementing LazySpan internally hold a SpanTransitionChain. This chain has a span-independent starting point of the chain and a transition function from the starting point. To evaluate the chain, SpannedHirDb first converts the starting point to a span-dependent structure, then applies the transition function to extract the specific span information.

… code

…rnal crates

Y-Nak · 2023-04-08T22:24:06Z

A potential issue with types implementing the LazySpan is that they are not coupled with the items defined in hir_def. For example, the following code is clearly redundant:

// Perform name resolution for function parameters
for (i, param) in func.params(..).enumerate() {
    if !resolve_type(param.type()) {
        // Construct the corresponding LazySpan
        let span = func.lazy_span().params().param(i).ty();
        Diagnostics::new(span, ...)
    }
}

To solve this issue, we could create a thin wrapper like the following that includes both the Hir Item and LazySpan:

pub struct SpannedItem<Item, Span> {
    item: Item,
    span: Span,
}
impl SpannedItem {
    pub fn name(db: &dyn HirDb) -> SpanendItem<ItentId, LazySpanAtom> {
        ...
    }

    pub fn  params(db: &dyn HirDb) -> Spanneditem<FnParamListId, LazyFnParamList> {
        ...
    }
}
impl<Item, Span: LazySpan> LazySpan for Spanneditem<Item, LazySpan> {
    fn resolve(&self, db: &dyn SpannedHirDb) -> Span {
        self.span.resolve(db)
    }
}

However, at the moment, it is unclear whether SpannedItem is actually needed, so it has not been implemented. Also, regarding the spans of Expr, Stmt, and Pat, BodySourceMap directly manages them, so this redundancy wouldn't occur. Therefore, I plan to add it once I'm confident that it is a common pattern while writing some analyses.

Y-Nak · 2023-04-17T21:58:12Z

crates/hir/src/hir_def/item_tree.rs

+/// The root node of the tree is the top level module, which corresponds to the
+/// `module_tree::TopLevelModule`.
+#[derive(Debug, Clone, PartialEq, Eq)]
+pub struct ItemTree {


ItemTree will be replaced with ScopeGraph in the next PR.

Y-Nak · 2023-04-19T15:24:45Z

@sbillig
I have rewritten the implementation of LazySpan. The change is that I replaced the implementation of TransitionFn from an Fn trait object to:

#[derive(Clone, Copy, PartialEq, Eq, Hash, Debug)]
pub(crate) struct LazyTransitionFn {
    pub(super) f: fn(ResolvedOrigin, LazyArg) -> ResolvedOrigin,
    pub(super) arg: LazyArg,
}

By explicitly holding a function pointer and captured variables as LazyArg, it becomes possible to implement Clone and Eq. Since the captured variables are very limited in the implementation of LazySpan, the codebase should not become bloated even if we hold them as LazyArg explicitly.

Furthermore, I defined DynLazySpan(SpanTransitionChain) as a type corresponding to dyn LazySpan. Since conversions from all types implementing LazySpan to this type are implemented, it should be able to be integrated with salsa more flexibly as an alternative to dyn LazySpan.

Y-Nak · 2023-04-19T15:32:49Z

Also, I added more precise support to track the node by changing the argument of LazyTransitionFn from SyntaxNode to ResolvedOrigin.
Please see the tests in hir/span/stmt.rs and hir/span/expr.rs as examples of how lowered HIR node can be remapped to the source location.

sbillig

This is nice. I haven't verified that everything is correct, but I suspect that any issues will become apparent as the analyzer is implemented. The lazy span stuff is clever, if perhaps a bit complicated (though I can't think of a way to avoid such complication of course).

Y-Nak added 9 commits February 27, 2023 17:10

Initialize fe-hir

d3535a6

Allow leading mut before pat

2867a88

Remove assert statement from the language

eedb2eb

Add HIR item def

5b0eab9

Add HIR attr` def

501e541

Add HIR body def

61ed75b

Add HIR expr def

98baa51

Add HIR params def

7c416ec

Add HIR pat def

97cbd68

Y-Nak force-pushed the hir branch 2 times, most recently from f61edbd to e11d401 Compare March 21, 2023 02:08

Y-Nak added 6 commits March 21, 2023 03:15

Add HIR stmt def

55896d3

Add HIR type def

080c3bd

Add HIR use_tree def

7ef834a

Add Jar for HIR entities

f124569

Add HirOrigin type to track the HIR definition origin in the source…

932f20e

… code

Add HIR path def

ec4e35c

Y-Nak force-pushed the hir branch from e11d401 to 62d2793 Compare March 21, 2023 02:16

Y-Nak added 12 commits March 21, 2023 03:17

Add HIR lower for Path

ce6bc1e

Define IngotId and FileId

eadd043

Add HIR lower for TypeId

488aa2c

Add HIR lower for params

1decf82

Add HIR lower for attr

d16b1b3

Introduce MaybeInvalid type

0592bb9

Add HIR lower for Item

5e46510

Add syntax to specify generic parameters in impl block

d478b1f

Add HIR lower for Pat

2dd211b

Add HIR lower for Stmt

3d07d8c

Add HIR lower for Expr

9a0a570

Add HIR lower for Body

3feec5b

Y-Nak force-pushed the hir branch from 1b20cb2 to 6dfb646 Compare April 6, 2023 23:02

Y-Nak added 4 commits April 7, 2023 13:39

Add lazy span for stmt

0f755c4

Add TestDb

821702e

Add test for ItemTree

ada4c15

Add test for lazy span

6494f4e

Y-Nak force-pushed the hir branch 2 times, most recently from d168f1f to 689720d Compare April 7, 2023 21:33

Y-Nak marked this pull request as ready for review April 7, 2023 21:34

Add an accumulator for ParseDiagnostic

1b2f5a8

Y-Nak force-pushed the hir branch 2 times, most recently from 0881acf to c983488 Compare April 8, 2023 17:21

Make HirDb completely span-independent from the perspective of exte…

3508787

…rnal crates

Y-Nak force-pushed the hir branch from c983488 to 3508787 Compare April 8, 2023 17:24

Y-Nak force-pushed the hir branch 2 times, most recently from 744195e to c74bb85 Compare April 9, 2023 10:11

Add tests for ModuleTree

bf6e823

Y-Nak force-pushed the hir branch from c74bb85 to bf6e823 Compare April 9, 2023 10:14

Y-Nak commented Apr 17, 2023

View reviewed changes

Y-Nak added 3 commits April 19, 2023 17:14

Make types implementing LazySpan comparabale

05a8107

Allow more precise span origin tracing

44db5e3

Add DynLazySpan

f679cd9

Y-Nak force-pushed the hir branch from 6c70a1f to f679cd9 Compare April 19, 2023 15:14

sbillig mentioned this pull request Apr 19, 2023

poc: remove Rc Cell from parser2 parse scope structs #874

Draft

Y-Nak mentioned this pull request Apr 19, 2023

v2 parser AugAssign stmt doesn't allow x[0] += 1 #875

Closed

sbillig approved these changes Apr 20, 2023

View reviewed changes

Y-Nak merged commit 95b2c99 into ethereum:fe-v2 Apr 20, 2023

Y-Nak deleted the hir branch April 24, 2023 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hir #860

Hir #860

Y-Nak commented Mar 16, 2023 •

edited

Loading

Y-Nak commented Apr 8, 2023 •

edited

Loading

Y-Nak Apr 17, 2023

Y-Nak commented Apr 19, 2023

Y-Nak commented Apr 19, 2023

sbillig left a comment

Hir #860

Hir #860

Conversation

Y-Nak commented Mar 16, 2023 • edited Loading

Architecture

Hir items

Three different databases

HirDb

LowerHirDb

SpannedHieDb

LazySpan

Y-Nak commented Apr 8, 2023 • edited Loading

Y-Nak Apr 17, 2023

Choose a reason for hiding this comment

Y-Nak commented Apr 19, 2023

Y-Nak commented Apr 19, 2023

sbillig left a comment

Choose a reason for hiding this comment

Y-Nak commented Mar 16, 2023 •

edited

Loading

`HirDb`

`LowerHirDb`

`SpannedHieDb`

`LazySpan`

Y-Nak commented Apr 8, 2023 •

edited

Loading