-
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Citation preview incrementally increases memory usage with each preview #2533
Comments
Similar to this:
|
Yes, we cache the preview to gain speed when showing the second time.
@bartsch-dev What is our policy for cache clearance?
|
The mention did not work, so here again @bartsch-dev What is our policy on cache clearance? |
The cached citation gets deleted when a field of the BibEntry changes or the BibEntry itself is removed (or the selected citation style changes). https://github.com/JabRef/jabref/blob/master/src/main/java/org/jabref/logic/citationstyle/CitationStyleCache.java#L59 It gets generated again the next time it's selected by the user and the preview is active. |
This issue will be solved if we remove the cache and finally integrate #2250 in the master branch. |
Sorry, can't confirm with current master 1f893cd under linux with
I opened a 6k entry bibtex database and performed the above described steps. RAM never went beyond 425MB before the Garbage Collection kicked in. |
Thanks for testing @lynyus! @kingwarrick Please give a it a try with the latest version of JabRef from http://builds.jabref.org/master/ Your problem seems to be solved in the most recent dev builds. Please do this with a backup of your bib file since there are a number of breaking changes regarding groups. Let us know if your experiences! |
Hello,
The issue persists on my machine, both linux (openjdk, openjfx) and
Windows 10
JabRef 4.0.0-dev--snapshot--2017-04-05--master--a1f4101df
Windows 10 10.0 amd64
Java 1.8.0_121
Let me know if I can help in troubleshooting further.
Warrick
…On Wed, 05 Apr 2017 00:23:50 -0700 Jörg Lenhard ***@***.***> wrote:
Thanks for testing @lynyus!
@kingwarrick Please give a it a try with the latest version of JabRef
from http://builds.jabref.org/master/ Your problem seems to be solved
in the most recent dev builds. Please do this with a backup of your
bib file since there are a number of breaking changes regarding
groups. Let us know if your experiences!
|
Can reproduce the high RAM and CPU usage if one of the CSL Styles is used for preview. Performing a manual garbage collection using jvisualvm reduces the amount of used heapspace - but this is not freeed for the OS. I assume we cannot really do anything about this - potentially #2250 will solve this but I'm not Sure... |
Yes, it will be solved. However, currently no one has time to work on the build process. Hopefully after the alpha release. Let's see. |
@lynyus has worked on the caching of citation styles and it should get better. @kingwarrick Is there any upper bound of memory usage? - There should be... |
Actually I have not. I implemented the caching in the FileAnnotationCache.
|
I have not experienced an upper bound, but have seen it climb to ~1900MB before I stopped testing.
The increase is not linear, but seems to occur in bursts. Not sure if it is entry-specific; I will try to be more systematic to see if there are specific entry types that use more memory.
I saw some graphs in an issue thread that showed memory use over time (can't find it now, but it dealt with garbage collection). Can you or someone direct me on where to get/how to use that tracing/debugging program? This would be easier for me (and maybe more informative for
you) than relying on task manager and top.
Edit:
This graph: #2247 (comment)
…On 06/04/17 06:32 AM, Oliver Kopp wrote:
@lynyus <https://github.com/lynyus> has worked on the caching of
citation styles and it should get better.
@kingwarrick <https://github.com/kingwarrick> Is there any upper bound
of memory usage? - There should be...
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2533 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AWd9NoL3O955W9hoxaT_MWbKLNf58jm8ks5rtNtGgaJpZM4L8hHB>.
|
That's |
The PR #3089 should have introduced a limit of the memory consumption. Could you please check again? I hope, that not each preview adds 100 MB of RAM. If it does, we have to really limit the number of cached entry previews. Currently, there are 1024 entries cached. |
JabRef 4.0-dev--snapshot--2017-09-01--master--e46d5ee88 First test (preview one entry at a time, advancing after display; no screenshot): Second test (quickly advance through 3-10 entries, triggers preview for more than one entry; screenshot attached): |
I can confirm that the 4.0 release has severe memory issues under Linux. Just opening my database with Jabref 4.0 makes Java use 984 MB of RAM. The first search makes this go up to 1.2 GB. Stepping through the first 20 shown entries brings this to 2.1 GB. After adding 5 articles this morning and searching some others Jabref was at 3.2 GB. That's not really practicable. How can caching something like a 300 character-string (citation preview) cost several 10s of MB of RAM? JabRef 4.0 |
To be fair on the 4.0 release, this problem was present well before that :) And of course caching a 300 character string does not take 10s of MB of RAM, obviously JabRef stores more than that (and I am in no way saying that's good). Given that the CSL cache seems to be totally non-functionally, can we simply disable/remove it? Or limit the cache to, say, a 1 digit number of entries? I know that this will mean that displaying the previews will take longer, but the memory consumption described here is far beyond anything justifiable. I have no deeper knowledge of how the previews work internally, but maybe one of the developers who contributed to it can clarify? @sbitzer the quickfix (mentioned above) seems to be to not use citation styles for the preview. EDIT: I think we should do something here soon and added it to the 4.1 milestone. |
@halirutan do you have time to look at this issue? Your analysis in other performance related issues was superb. Of course, I can help fixing the code. |
@tobiasdiez I already profiled this. I was working the last weeks on my Mathematica plugin which explains my absence. I have two issues on my list that I want to look at. It's this one and the disastrous behavior of the group-view for moderately nested group-trees. When my memory serves me right, I couldn't come to a definite conclusion with this issue. It is a combination of the several string-builders. I will try to repeat the profiling to find out if there is a point where we can make a worthy impact on the memory consumption |
@tobiasdiez @sbitzer @lenhard @kingwarrick I cannot confirm the high RAM usage with the latest master c51ecbe. I used a large database (several thousand entries) and scrolled through 500 of them, showing the preview each time in IEEE style. The preview is the biggest contributor, but it used less than 90MB of RAM. This comes down to 180kb per preview. See the highlighted line However, one thing that I find highly suspicious is that the simple call to setting the text in the JPane with Can someone on a machine that shows the weird behavior test to comment out the actual setting of the preview string? public void setPreviewLabel(String text) {
if (SwingUtilities.isEventDispatchThread()) {
final Document document = previewPane.getDocument();
// previewPane.setText(text);
// previewPane.revalidate();
} else {
SwingUtilities.invokeLater(() -> {
// previewPane.setText(text);
// previewPane.revalidate();
});
}
this.scrollToTop();
} This leaves the whole machinery of JabRef creating the preview string alive but shoves the JPanel out of the memory picture. Scroll slowly bit by bit through the entries with preview open and see what your memory does. I'm on Ubuntu 16.04 with Oracle JRE. |
@halirutan Thanks for the analysis. 180kb per preview sounds fine. So with the current cache size of 1024, there shouldn't be any problems since the total memory consumption is then around 200 mb. So how does these astronomical reports of > 2 gb come about? It is understandable that the the |
@halirutan @tobiasdiez If I am not mistaken, the memory issue happens only when using citationstyles, not the default preview or the IEEE preview. |
@lenhard That's true.
|
Here comes the revised performance analysis. I am not certain, but when I understand this correctly, then the cache is not the problem. The main drain of resources seems to be the This is the memory footprint: As you can see, 98% of the memory is eaten up by creating the styled bib-entry through the CSL library. Notably, 63% of it only by constructing the CSL instance inside
This is by far no understatement. I understand that using If possible at all, I would advise to hold our own CSL instance and reuse it. If we look at the performance, we get the same picture. Please don't be confused by the overall 25%. I left the application run idle for a moment so a lot of time was spent only waiting for me. The 25% is the time where the hard work was done. What you should take from this is that from the 25% of the time that was needed to preview the item, 18% were spent only constructing the CSL instance. That is tremendous (50 out of 70 seconds under profiling!). The code for public static Bibliography makeAdhocBibliography(String style, String outputFormat,
CSLItemData... items) throws IOException {
ItemDataProvider provider = new ListItemDataProvider(items);
CSL csl = new CSL(provider, style);
csl.setOutputFormat(outputFormat);
String[] ids = new String[items.length];
for (int i = 0; i < items.length; ++i) {
ids[i] = items[i].getId();
}
csl.registerCitationItems(ids);
return csl.makeBibliography();
} There are 3 parts in this method. First, the CSL instance is created. As you can see, the There are 2 other parts. The for loop just copies the My very naive view is: They need to initialize the whole machinery and they need to parse the csl style. I understand this is hard work. But then they have everything set up and we provide Anyway, my suggestion is to leave the cache alone or turn it of. More important is that we initialize the CSL engine and reuse it as much as possible. |
Another quick way to add a workaround is to switch to the V8 engine. See #3180. Especially the screen videos with V8 and without V8. |
This should be fixed in the latest development version. Could you please check the build from http://builds.jabref.org/master/. Thanks! |
I just tried it out. Stepping through the first 50 entries of my database producing APA-style previews for them, increased the memory usage of Jabref from 915 MB to 1.4 GB. That's certainly less than before, but it's still a lot. For comparison: Doing the same with the standard entry preview raises memory usage only from 915 MB to 920 MB. JabRef 4.1-dev--snapshot--2017-12-21--master--7819cb4aa |
How is the rendering speed? Maybe, we can drop the caching of rendered
entries completely?
Am 21.12.2017 12:50 schrieb "Sebastian Bitzer" <notifications@github.com>:
… I just tried it out. Stepping through the first 50 entries of my database
producing APA-style previews for them, increased the memory usage of Jabref
from 915 MB to 1.4 GB. That's certainly less than before, but it's still a
lot. For comparison: Doing the same with the standard entry preview raises
memory usage only from 915 MB to 920 MB.
JabRef 4.1-dev--snapshot--2017-12-21--master--7819cb4aa
Linux 3.16.0-38-generic amd64
Java 1.8.0_151
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#2533 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/ABTafqdDCcUp2nCPcqByGgd1Lwv-Ou-jks5tCkYPgaJpZM4L8hHB>
.
|
The first entry preview takes a while to be generated. The others are almost instantaneous. |
@sbitzer The first entry takes exactly as long as every entry needed before the patch. It initializes the CSL engine before it can render a preview. After that, we reuse it and until you change the style, every preview is fast. On a style change, the first item will trigger a re-initialization again. @koppor I am certain the cache is not the problem and in fact, it saves us from using even more memory. Look here: Simply running through 20 items and showing the preview uses 87% of the memory. 43% is spent in The real pain under the hood is the |
JabRef 3.8.2
windows 10 10.0 amd64
Java 1.8.0_121
RAM is used during preview, but not fully released. With more previews, less is released until run-away usage occurs. After ~4-5 previews (or ~500MB RAM, not sure which is more significant), no appreciable amount of RAM is released, with subsequent previews adding 100-500MB per preview to total RAM usage. Behaviour changes after ~1400MB RAM use, with slower incremental use and more frequent release of small amounts of ram.
Tested with different preview styles (no change); interesting to note how CPU use in IEEE preview generation spikes to ~40%, whereas APA spikes to ~20%.
NOT tested with any release versions.
Steps to reproduce [RAM use in square brackets]:
This shows in the exceptions tab of the error console:
The text was updated successfully, but these errors were encountered: