Preserve user comments in bib file #1471

lenhard · 2016-06-03T14:31:41Z

So I am finally having my take at #1026. The current solution was surprisingly easy so far (despite what I had written before). It looks explicitly for our encoding prefix in the file ("Encoding: ") and kills lines that contain it (but only lines preceding an entry, it will not delete something that is inside an entry). Other user comments are left untouched and are always written out again, even if the entry is reformatted.

User comments that are above meta data, bibtex strings, or the preamble will still be removed, though. Changing that would require large scale changes to these items in our model, since we would also need to store the parsed serialization for them, which we do not do so far.

This should receive significant automated, but also manual, testing, since it modifies a quite critical part and has the potential to destroy bib files. I will add a few more tests for this PR.

Change in CHANGELOG.md described
Tests created for changes

…lizations

…ents and entry

lenhard · 2016-06-08T16:00:15Z

Ok, I'd like to put this up for discussion.

User comments are now kept under the following circumstances:

Above any BibEntry
Above any BibtexString
Comments at the end of the file

This is independent of whether an entry is changed/reformatted, but not if it is deleted. If it is reformated, user comment text is located exactly one blank line above the reformated entry.

User comments are still not kept when:

Above meta data comments
Above the Preamble
Contains JabRef's ENCODING_PREFIX
Above an entry that has been deleted

Is this good enough? It should be sufficient in most scenarios and work if someone opens his non-JabRef bib file with JabRef. If someone starts adding arbitrary text between JabRef's Metadata, though... The latter is hard to achieve since we do not store the serialization of MetaData. If we want to, I'd have to make some additions to our MetaData objects. I could also store the comments between meta data and add them as a bunch at the end of the file, but then the relative position gets lost. Before I do more work, I'd like to clarify the following points:

Does the storage of arbitrary text above MetaData really matter?
Is it ok to delete comments if the entry below them is deleted?

# Conflicts: # CHANGELOG.md

tobiasdiez · 2016-06-26T12:49:34Z

CHANGELOG.md

@@ -12,6 +12,7 @@ We refer to [GitHub issues](https://github.com/JabRef/jabref/issues) by using `#
 ## [Unreleased]

 ### Changed
+- JabRef does no longer delete user comments outside of BibTeX entries [#1026]


Change format of changelog entry to match the other ones.

# Conflicts: # CHANGELOG.md

simonharrer · 2016-07-11T13:57:58Z

LGTM. I think integration tests are more valuable. So this would suffice for me.

lenhard · 2016-07-11T14:35:07Z

Great! Let us talk briefly about this during the next devcall and decide if we take it into 3.5 or 3.6.

tobiasdiez · 2016-07-13T09:12:13Z

For 3.6

koppor · 2016-07-13T12:06:59Z

jabref.install4j

@@ -369,7 +369,7 @@ return true;</string>
                </serializedBean>
                <condition>context.getBooleanVariable("addToDockAction")</condition>
              </action>
-              <group name="Registry" id="239" customizedId="" beanClass="com.install4j.runtime.beans.groups.ActionGroup" enabled="true" commentSet="false" comment="" actionElevationType="inherit">
+              <group name="Registry" id="239" customizedId="" beanClass="com.install4j.runtime.beans.groups.ActionGroup" enabled="true" commentSet="false" comment="" actionElevationType="elevated">


Why is an install4j change part of this PR?

I revert it later, sorry...

This reverts commit b4b288a.

This reverts commit 67a4885.

# Conflicts: # CHANGELOG.md # src/main/java/net/sf/jabref/model/entry/BibEntry.java

…serve-comments

tobiasdiez · 2016-07-14T10:01:03Z

src/main/java/net/sf/jabref/model/entry/BibtexString.java

+    /*
+    * Returns user comments (arbitrary text before the string) if there are any. If not returns the empty string
+     */
+    public String getUserComments() {


This is now an exact copy of BibEntry.getUserComments right? Maybe add a superclass BibItem containing this method and then BibEntry and BibString derive from this superclass (maybe there is even more common code which could be extracted to the super class)

Sorry to always slip into the role of your antagonist, but I am against creating a class hierarchy. In most cases, it makes the code harder to understand, and the code duplication savings are just not worth the effort. In addition, one has to adhere to the liskov substitution principle to create a good class hierarchy, but this is hard to do right.

No problem, having an experienced antagonist is probably the best way to learn something 😄

So now ignoring everything about the actual implementation and only speaking about "business objects": There are different items which can be contained in a bib file. For example @Comments, @Strings and normal bib entries. They all have a similar structure (e.g begin with @, followed by some identifier and then braces which surround the actual content) and all of them could have some text comments in front of them. Thus there is also a common behavior when it comes to parsing and writing.
So how would my deer antagonist reflect this common structure and behavior in java code?

My idea would be: BibString, BibEntry, BibComment all derive from BibItem. BibItem has methods to parse and write at least the form @something { abstract method to parse / write value }. Then the writer just gets a list of BibItems and invokes the write method on them. Similarly the parser returns a list of BibItems while the parsing of every item was done in the subclass. But again...this idea somehow corresponds from a service implementation to a full-blown domain model....so yeah, maybe we should discuss this in the next dev call ;-)

stefan-kolb · 2016-07-14T10:04:36Z

@lenhard Please squash the commits via Github when merging this 😄

tobiasdiez · 2016-07-14T10:33:40Z

src/test/java/net/sf/jabref/logic/bibtex/BibEntryWriterTest.java


        BibEntry entry = entries.iterator().next();
-        assertEquals("test", entry.getCiteKey());


Why did you removed this (suboptimal) assertions?

Are you being serious? In this very same PR you asked me to reduce the number asserts as far as possible, aiming for one assertion per test. This is the reason.

lenhard added the type: feature label Jun 3, 2016

lenhard mentioned this pull request Jun 8, 2016

Jabref removes comments inside the Bibtex code #1026

Closed

lenhard added 12 commits June 8, 2016 15:42

Add failing test with preceding comment

d6d6721

Add failing parser test

cb85443

Improve test naming

b50c3d6

Reuse Globals.ENCODING_PREFIX in test

2c39c89

Check explicitly for encoding line and purge it

b547703

Add changelog entry

ce56e23

Write out user comments also for modified entries

fe79b41

Add test to check preservation of ENCODING_PREFIX inside an entry

ec589e8

Make BibEntryWriter more robust when dealing with faulty parsed seria…

c7f4694

…lizations

Add test with user comment in file

3957680

Add test that changes and reformats an entry with a user comment

399bcb9

Add test with user comment before String

14db4bf

lenhard force-pushed the preserve-comments branch from e00aeae to 14db4bf Compare June 8, 2016 14:05

lenhard added 4 commits June 8, 2016 16:11

Preserve newlines also when used with bibtex strings

392864e

Add test for serialization of epilog

9c23e94

Fix string detection in test

21eac95

In case of change, only remove trailing whitespaces between user comm…

3206b29

…ents and entry

lenhard added the status: ready-for-review Pull Requests that are ready to be reviewed by the maintainers label Jun 8, 2016

lenhard added 3 commits June 14, 2016 09:04

Merge branch 'master' into preserve-comments

a1c98c5

# Conflicts: # CHANGELOG.md

Remove unused variable

678c83e

Remove unused import

7a3522a

tobiasdiez reviewed Jun 26, 2016
View reviewed changes

lenhard added 4 commits July 8, 2016 15:05

Merge branch 'master' into preserve-comments

dc4ae0f

# Conflicts: # CHANGELOG.md

Remove unnecessary epilog test

ccddef0

Remove redundant test bib file

6d1ac65

Remove redundant asserts

0972b51

lenhard added the status: devcall label Jul 11, 2016

tobiasdiez removed the status: devcall label Jul 13, 2016

tobiasdiez added this to the v3.6 milestone Jul 13, 2016

Elevate registry actions

b4b288a

koppor reviewed Jul 13, 2016
View reviewed changes

stefan-kolb and others added 5 commits July 13, 2016 14:21

Delete stuff

67a4885

Revert "Elevate registry actions"

9000f1c

This reverts commit b4b288a.

Revert "Delete stuff"

d970a4c

This reverts commit 67a4885.

Merge branch 'master' into preserve-comments

c1db48f

# Conflicts: # CHANGELOG.md # src/main/java/net/sf/jabref/model/entry/BibEntry.java

Merge branch 'preserve-comments' of github.com:JabRef/jabref into pre…

75d6262

…serve-comments

tobiasdiez reviewed Jul 14, 2016
View reviewed changes

lenhard added 4 commits July 14, 2016 13:33

Use optional in assert

98cbe98

Remove duplicate test

72de4ce

Remove unnecessary asserts

e5e5c44

Remove unused import of Optional

a8a7b20

lenhard merged commit aa42c16 into master Jul 14, 2016

lenhard deleted the preserve-comments branch July 14, 2016 12:14

koppor mentioned this pull request Jun 28, 2017

How to prevent Jabref add Comments to .bib files? #2944

Closed

koppor mentioned this pull request Oct 9, 2019

Describe migration paths 2.9.2 <-> 3.8.2 <-> 4.3.1 <-> 5.0 JabRef/user-documentation#227

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve user comments in bib file #1471

Preserve user comments in bib file #1471

lenhard commented Jun 3, 2016

lenhard commented Jun 8, 2016

tobiasdiez Jun 26, 2016

simonharrer commented Jul 11, 2016

lenhard commented Jul 11, 2016

tobiasdiez commented Jul 13, 2016

koppor Jul 13, 2016

stefan-kolb Jul 13, 2016

tobiasdiez Jul 14, 2016

simonharrer Jul 14, 2016

tobiasdiez Jul 14, 2016

stefan-kolb commented Jul 14, 2016

tobiasdiez Jul 14, 2016

lenhard Jul 14, 2016


		BibEntry entry = entries.iterator().next();
		assertEquals("test", entry.getCiteKey());

Preserve user comments in bib file #1471

Preserve user comments in bib file #1471

Conversation

lenhard commented Jun 3, 2016

lenhard commented Jun 8, 2016

Choose a reason for hiding this comment

simonharrer commented Jul 11, 2016

lenhard commented Jul 11, 2016

tobiasdiez commented Jul 13, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stefan-kolb commented Jul 14, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment