How do I implement efficient data fetches for nested objects? #167

brandur · 2018-04-12T20:30:49Z

Hello, great work with the project. I have a question pertaining to how to build an efficient implementations given nested objects.

To demonstrate, take a sample blog schema where you have articles that have comments, and comments that have favorites. Favorites are their own relation with an associated user_id to the user that favorited that comment.

A query to fetch all the data you need to render an article would go two relations deep like this:

article(article_id: 123) {
    comments {
        favorites {
            user_id
        }
    }
}

The system is implemented on a relational store, and a simplified version of your Juniper article object looks like this:

graphql_object!(<'a> &'a Article: Context |&self| {
    field comments(&executor) -> Vec<Comment> {
        // SELECT * FROM comments WHERE article_id = self.article_id
    }
}

Comments are similar, with a nested favorites field:

graphql_object!(<'a> &'a Comment: Context |&self| {
    field favorites(&executor) -> Vec<Favorite> {
        // SELECT * FROM favorites WHERE comment_id = self.comment_id
    }
}

The trouble is that if we execute the query above, it will resolve successfully, but it will do so in a way that's degenerately inefficient. To get all our favorites we'll execute N + 1 total queries (1 to get comments, and then N to get favorites for each comment), and every further level of nesting will multiply the total number of queries by another N.

What we'd do ideally is when fetching favorites, use this naive implementation above if we're only fetching them for a single comment, but when we're fetching them for a set of comments, have a higher level execute only a single query:

SELECT * FROM favorites
WHERE comment_id IN (comment_id11, comment_id2, comment_id3)
ORDER BY comment_id;

And then partition the results and distribute them to the underlying "favorites" objects being resolved.

Do you have any recommendations for how to make this sort of pattern possible in a relatively performant and sustainable (in the sense of code complexity) way? I've found some references to "context switching", and looking the source code suggests that it seems like something that might be what I'm looking for, but the documentation for it is light. Is that what I should be using?

Thanks!

The text was updated successfully, but these errors were encountered:

LegNeato · 2018-04-13T01:57:29Z

Generally Facebook suggests using something like https://github.com/cksac/dataloader-rs, which is what they do internally

brandur · 2018-04-13T16:05:32Z

@LegNeato Ah, thank you. Yeah, I saw that Data Loader seems to be common in GraphQL implementations from other languages. If this is the right way, it might be helpful to have an example of its integration with Juniper — given heavy reliance on futures, etc., it's somewhat non-trivial to integrate.

theduke · 2018-04-22T06:14:03Z

There are essentially two approaches to this.

One is a futures and dataloader style async approach, tracked in #2.
With the large changes in currently happening in the Futures/Tokio ecosystem, I'm afraid this is still a bit farther on the horizon.

The other option is to inspect the requested schema and smartly determining what to fetch in a root resolver.
There is a PR for this, and the tracking issue is #124 .

theduke · 2018-04-22T06:14:41Z

Closing this issue here, feel free to discuss further in one of the two other issues.

brandur · 2018-04-22T13:30:01Z

Thanks @theduke. #16 turns out to be exactly what I'm looking for here.

brokenthorn · 2020-02-03T16:37:55Z

AKA the GraphQL N+1 Problem #387 .

theduke closed this as completed Apr 22, 2018

takeit mentioned this issue Jul 2, 2019

Look ahead and N+1 query problem #387

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I implement efficient data fetches for nested objects? #167

How do I implement efficient data fetches for nested objects? #167

brandur commented Apr 12, 2018 •

edited

Loading

LegNeato commented Apr 13, 2018

brandur commented Apr 13, 2018

theduke commented Apr 22, 2018

theduke commented Apr 22, 2018

brandur commented Apr 22, 2018

brokenthorn commented Feb 3, 2020

How do I implement efficient data fetches for nested objects? #167

How do I implement efficient data fetches for nested objects? #167

Comments

brandur commented Apr 12, 2018 • edited Loading

LegNeato commented Apr 13, 2018

brandur commented Apr 13, 2018

theduke commented Apr 22, 2018

theduke commented Apr 22, 2018

brandur commented Apr 22, 2018

brokenthorn commented Feb 3, 2020

brandur commented Apr 12, 2018 •

edited

Loading