-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JHOVE Incorrectly reading beyond RIFF 'data' Chunk ID and calling it invalid... #9
Comments
Okay, so the problem seems to be that JHOVE assumes that an 'int' is a big enough variable to offsets etc. Your data chunk is so long it won't fit in an signed Java int, and so 'skipBytes' fails during an attempt to pass over the data chunk. Attempting to set up a pull-request including this test case. |
Hi Andy, Just to clarify JHOVE isn't storing offset values correctly, because we're stuffing 2^32 bit (unsigned) integers into signed 2^31 integers... That is, both the offsets/chunk sizes we see in this breakdown extend beyond 2^31... 2^31 = 2,147,483,648 (possible positive integers can be stored/represented)
Thus causing JHOVE to get confused... Just trying to get a good simple description for the folks digitizing here (and myself too!) Great spot. Thank you for your help! I hope Gary can provide some feedback on the proposed fixes/timescales etc. Cheers, Ross |
The reality is somewhat messier, but yes, that's essentially it. |
Closed by #37 |
I have received a 2GB wav file that I'm having difficulty validating in JHOVE. The tool tells me that I have an invalid character within a CHUNK ID.
Analyzing the file, however, and it seems that JHOVE is reading beyond the CHUNK ID and returning an invalid result.
The error message seems to be returned from this part of the code:
https://github.com/gmcgath/jhove/blob/0dc774d98efa8c7581fe1602c3f6e713f499201d/src/main/java/edu/harvard/hul/ois/jhove/module/iff/ChunkHeader.java#L53
The byte causing the first issue is 0x05 at offset 46, I've starred offset 46 and 47. See also the screenshot.
The screenshot has been generated by looking at the following snippet from the 2GB file:
The analysis shows, that 0x05 is no longer in the CHUNK ID, nor is the preceding byte 0x00, which will also show up in error if one artificially turns 0x05 into a byte greater than 0x32.
Screenshot:
JHOVE Version: 1.11
Java: 1.7
Platform: Windows XP SP3
Creating Application (WAV): Adobe Audition CS6 (Macintosh)
The text was updated successfully, but these errors were encountered: