Large entities causing memory overhead #2348

leesiongchan · 2017-10-05T12:18:33Z

Recently I tried to create a custom-source-plugin to fetch data from my API and everything works great except for once I adjusted to recursively fetch data from every page, the array size getting bigger and memory reach out to more than 1gb, and after when it ready to createNode, the memory continues to increase exponentially until the app burst! So my question is, do we really have to preload everything?

If yes, how can I improve the performance and efficiency? Or there any way to dynamically fetch the necessary data based on the request?

The text was updated successfully, but these errors were encountered:

leesiongchan · 2017-10-05T16:22:09Z

Do you any timeline in mind for live source fetching? So gatsby will become any application generator instead of static only, currently our application is more like an ecommerce site, I think we might not be able to use gatsby for this case. But I really love gatsby's concept, it's really beautiful.

KyleAMathews · 2017-10-05T17:04:54Z

There's probably some low-hanging fruit for increasing efficiency — improving Gatsby's scalability will be a focus towards the latter part of this year and next year.

Currently though, it sounds like you're just running into Node's built-in memory limits. If you run Gatsby like node --max_old_space_size=4096 ./node_modules/.bin/gatsby you'll have a lot more memory to work within.

Preloading is by far the simplest way to do things and arguably the best as development & builds are much faster when data is local & it's easy for Gatsby to autogenerate the GraphQL schema. There are other harder ways of not making data local but it's not something that's been explored much.

jasonphillips · 2017-10-05T22:17:25Z

On a related point, is there no standardized way for a plugin (source plugin, I suppose) to extend the Graphql schema / resolvers directly, without simply adding preloaded nodes to the tree?

In other words, a way to permit a custom graphql resolve logic for part of the schema -- but where it would still be executed and cached during the build time, not as some kind of live query.

KyleAMathews · 2017-10-05T22:29:50Z

There is https://www.gatsbyjs.org/docs/node-apis/#setFieldsOnGraphQLNodeType

It's generally suggested you use this only for adding fields that you want to have arguments (e.g. the "excerpt" field on "MarkdownRemark" let's you pass in a pruneLength variable to control the creation of the excerpt) or when you want to do custom processing (e.g. transformer-remark let's you create image thumbnails using GraphQL).

I think the right solution to this problem of "too much data" is a way to pull data fetching & schema creation into another process w/ a DB backing the data instead of everything being in memory. Watch this space :-) working on a hosted version of this. This way there's essentially no limit to the amount of data Gatsby can handle.

KyleAMathews · 2018-08-29T16:34:15Z

Hey, closing out old issues. Please re-open if you have additional questions, thanks!

Also, check out v2! We've vastly reduced memory usage + build speed in general.

brandonmp mentioned this issue Nov 6, 2017

1.9.9 -> 1.9.100, now 'Javascript heap out of memory' #2796

Closed

KyleAMathews closed this as completed Aug 29, 2018

snyk-bot mentioned this issue Nov 6, 2020

[Snyk] Upgrade sharp from 0.25.4 to 0.26.2 attawayinc/gatsby#165

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large entities causing memory overhead #2348

Large entities causing memory overhead #2348

leesiongchan commented Oct 5, 2017 •

edited

Loading

leesiongchan commented Oct 5, 2017 •

edited

Loading

KyleAMathews commented Oct 5, 2017

jasonphillips commented Oct 5, 2017

KyleAMathews commented Oct 5, 2017

KyleAMathews commented Aug 29, 2018

Large entities causing memory overhead #2348

Large entities causing memory overhead #2348

Comments

leesiongchan commented Oct 5, 2017 • edited Loading

leesiongchan commented Oct 5, 2017 • edited Loading

KyleAMathews commented Oct 5, 2017

jasonphillips commented Oct 5, 2017

KyleAMathews commented Oct 5, 2017

KyleAMathews commented Aug 29, 2018

leesiongchan commented Oct 5, 2017 •

edited

Loading

leesiongchan commented Oct 5, 2017 •

edited

Loading