diff --git a/modules/gzip/index.html b/modules/gzip/index.html new file mode 100644 index 000000000..fbdba1099 --- /dev/null +++ b/modules/gzip/index.html @@ -0,0 +1,108 @@ +--- +title: GZIP-kb Module +--- + + +{% include header.html %} + + +{% include navbar.html nav=site.data.navbar %} +
+

GZip-kb Module

+ +

1 Introduction

+
+

+ The GZIP-kb module recognizes and validates the Gzip (GNU zip) format. + [GZip]. +

+

+ The module is invoked by the: +

+
+
+    jhove ... -m GZIP-kb ...
+  
+
+

+ command line option. +

+

+ The GZIP-kb module recognizes and validates GZip version 4.3. It also supports multiple member GZip files. This module uses the JWAT library for GZip parsing. +

+ +

+This module doesn't have configurable parameters. +

+ + +

2 Coverage

+
+

+ The GZIP-kb module recognizes and validates the following public profiles: +

+ + + +

3 Well-Formedness

+
+

+ The GZip module checks well-formedness. +

+ + +

4 Validity

+

+ The following criteria must be met by a GZip file for JHOVE to consider it valid: +

+ + + +

5 Representation Information

+

+ The MIME type is reported as: application/gzip [RFC 6713]. Application/x-gzip is also supported + +

+

+ In addition to the standard JHOVE + representation information, the following + GZip-specific properties are reported: +

+ + + + +

6 Additional Module Properties

+
+ +
+{% include footer.html %} + + diff --git a/modules/warc/index.html b/modules/warc/index.html new file mode 100644 index 000000000..be687333d --- /dev/null +++ b/modules/warc/index.html @@ -0,0 +1,114 @@ +--- +title: WARC-kb Module +--- + + +{% include header.html %} + + +{% include navbar.html nav=site.data.navbar %} +
+

WARC-kb Module

+ +

1 Introduction

+
+

+ The WARC-kb module recognizes and validates the WARC (Web ARChive) format. + [WARC]. It only validates the WARC file format and WARC headers, not the actual payload of the WARC records. + This module uses the JWAT library for WARC parsing. + For Compressed WARC files the JWAT library is also used to parse compressed WARCs (.warc.gz) +

+

+ The module is invoked by the: +

+
+
+    jhove ... -m WARC-kb ...
+  
+
+

+ command line option. +

+

+ The WARC-kb module recognizes ISO28500:2009. +

+ +

+This module doesn't have configurable parameters. +

+ + +

2 Coverage

+
+

+ The WARC-kb module recognizes and validates the following profiles: +

+ + + +

3 Well-Formedness

+
+

+ The WARC module doesn't check the well-formedness +

+ + +

4 Validity

+

+ The WARC module only validates the WARC file format, WARC headers. It doesn't check the payload of the WARC records. +

+ + +

5 Representation Information

+

+ The MIME type is reported as: application/warc + [application/warc, application/warc-fields]. +

+

+ In addition to the standard JHOVE + representation information, the following + WARC-specific properties are reported: +

+ + +

6 Additional Module Properties

+ + +
+{% include footer.html %} + + diff --git a/references/index.html b/references/index.html index 8218ce10d..93292ee2b 100644 --- a/references/index.html +++ b/references/index.html @@ -10,90 +10,90 @@

JHOVE References

-

+

  1. Adobe Systems, Inc., Adobe PageMaker 6.0 TIFF Technical Notes, September 14, 1995 <http://partners.adobe.com/public/developer/en/tiff/TIFFPM6.pdf>. -

    +

  2. Adobe Systems, Inc., Adobe Photoshop TIFF Technical Notes, March 22, 2002 <https://partners.adobe.com/public/developer/en/tiff/TIFFphotoshop.pdf>. -

    +

  3. Adobe Systems, Inc., Adobe Photoshop 6.0 File Formats Specification, Version 6.0, Release 2, November 2000. <https://www.adobe.com/devnet-apps/photoshop/fileformatashtml/>. -

    +

  4. Adobe Systems, Inc., Digital Negative (DNG) Specification, Version 1.0.0.0, September, 2004 <http://wwwimages.adobe.com/content/dam/Adobe/en/products/photoshop/pdfs/dng_spec_1.4.0.0.pdf>. -

    +

  5. Adobe Systems, Inc., PDF Reference, Version 1.4 (3rd ed.; Boston: Addison-Wesley, 2001) <http://www.adobe.com/content/dam/acom/en/devnet/pdf/pdf_reference_archive/PDFReference.pdf>. -

    +

  6. Adobe Systems, Inc., PDF Reference, Version 1.5 (4th ed.; Boston: Addison-Wesley, 2003) <http://partners.adobe.com/public/developer/en/pdf/PDFReference15_v6.pdf>. -

    +

  7. Adobe Systems, Inc., PDF Reference, Version 1.6 (5th ed.; Boston: Addison-Wesley, 2004) <http://www.adobe.com/content/dam/acom/en/devnet/pdf/pdf_reference_archive/PDFReference16.pdf>. -

    +

  8. Adobe Systems, Inc., TIFF Revision 6.0, Final - June 3, 1992 <https://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf>. -

    +

  9. Adobe Systems, Inc., XMP Specification, January 2004 <https://partners.adobe.com/public/developer/en/xmp/sdk/XMPspecification.pdf>. -

    +

  10. Aldus Corporation, Tag Image File Format Rev 4.0, April 30, 1987. <http://cool.conservation-us.org/bytopic/imaging/std/tiff4.html>. -

    +

  11. Aldus Corporation, TIFF Revision 5.0, August 8, 1988. <http://cool.conservation-us.org/bytopic/imaging/std/tiff5.html>. -

    +

  12. Murray Altheim and Shane McCarron, eds., XHTML 1.1 - Module-based XHTML, W3C Recommendation, 31 May 2001 <https://www.w3.org/TR/xhtml11/>. -

    +

  13. ANSI X3.4-1986, @@ -101,25 +101,25 @@

    JHOVE References

    Standard Code for Information Interchange (7-Bit ASCII), December 30, 1986.
    <https://en.wikipedia.org/wiki/ASCII>. -

    +

  14. ANSI X3.66-1979, Advanced Data Communication Control Procedures (ADCCP). -

    +

  15. Apple Computer, Inc. Audio Interchange File Format: "AIFF", Version 1.3, January 4, 1989 <https://en.wikipedia.org/wiki/Audio_Interchange_File_Format>. -

    +

  16. Apple Computer, Inc. Audio Interchange File Format AIFF-C, Draft, August 26, 1991. -

    +

  17. Mark Baker, Masayasu Ishikawa, Shinichi Matsui, Peter Stark, Ted Wugofski, @@ -127,21 +127,21 @@

    JHOVE References

    XHTML Basic,
    W3C Recommendation, 19 December 2000 <https://www.w3.org/TR/xhtml-basic/>. -

    +

  18. T. Berners-Lee, D. Connolly, Hypertext Markup Language - 2.0, RFC 1866, November 1995 <http://www.ietf.org/rfc/rfc1866.txt>. -

    +

  19. T. Berners-Lee, R. Fielding, and L. Masinter, eds. Uniform Resource Identifiers (URI): Generic Syntax, RFC 2396, August 1998 <http://www.ietf.org/rfc/rfc2396.txt>. -

    +

  20. Paul V. Biron and Ashok Malhotra, eds. @@ -205,7 +205,7 @@

    JHOVE References

    TIFF-F Revised Specifications: The Spirit of TIFF Class F, April 29, 1990 [USENET], newsgroups: alt.fax, comp.graphics.
    -

    +

  21. Digital Library Federation, @@ -213,14 +213,21 @@

    JHOVE References

    Serials, Version 1, December 2002
    <http://www.diglib.org/standards/bmarkfin.htm>. -

    +

    + +
  22. +Peter Deutsch, +RFC 1952: GZIP file format specification version 4.3 +(Aladdin Enterprises, 1996) +<https://tools.ietf.org/html/rfc1952>. +

  23. EBU Technical Specification 3285, BWF - a format for audio data files in broadcasting, Version 2, May 2011 <https://tech.ebu.ch/docs/tech/tech3285.pdf>. -

    +

  24. EBU Technical Specification 3285 - Supplement 1, @@ -228,7 +235,7 @@

    JHOVE References

    Supplement 1 - MPEG audio,
    July 1997 <https://tech.ebu.ch/docs/tech/tech3285s1.pdf>. -

    +

  25. EBU Technical Specification 3285 - Supplement 2, @@ -236,7 +243,7 @@

    JHOVE References

    Supplement 2 - Capturing Report,
    July 2001 <https://tech.ebu.ch/docs/tech/tech3285s2.pdf>. -

    +

  26. EBU Technical Specification 3285 - Supplement 3, @@ -244,7 +251,7 @@

    JHOVE References

    Supplement 3 - Peak Envelope Chunk,
    July 2001 <https://tech.ebu.ch/docs/tech/tech3285s3.pdf>. -

    +

  27. EBU Technical Specification 3285 - Supplement 4, @@ -252,7 +259,7 @@

    JHOVE References

    Supplement 4: <link> Chunk,
    April 2003 <https://tech.ebu.ch/docs/tech/tech3285s4.pdf>. -

    +

  28. EBU Technical Specification 3285 - Supplement 5, @@ -260,7 +267,7 @@

    JHOVE References

    Supplement 5: <axml> Chunk,
    July 2003 <https://tech.ebu.ch/docs/tech/tech3285s5.pdf>. -

    +

  29. EBU Technical Specification 3285 - Supplement 6, @@ -268,42 +275,42 @@

    JHOVE References

    Supplement 6: Dolby Metadata, <dbmd> Chunk,
    October 2009 <https://tech.ebu.ch/docs/tech/tech3285s6.pdf>. -

    +

  30. EBU Technical Specification 3306, MBWF / RF64: An Extended File Format for Audio, July 2009 <https://tech.ebu.ch/docs/tech/tech3306v1_1.pdf>. -

    +

  31. ECMA-6, 7-Bit coded Character Set, 6th ed., December 1991, <http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-006.pdf>. -

    +

  32. E. Fleischman, WAVE and AVI Codec Registries, RFC 2361, June 1998 <http://www.ietf.org/rfc/rfc2361.txt>. -

    +

  33. James Gosling, Bill Joy, Guy L. Steele, Jr., and Gilad Bracha, Java Language Specification (2nd ed.; Addison-Wesley, 2000) <http://titanium.cs.berkeley.edu/doc/java-langspec-2.0.pdf>. -

    +

  34. IBM Corporation and Microsoft Corporation. Multimedia Programming Interface and Data Specifications 1.0, August 1991. <http://www.tactilemedia.com/info/MCI_Control_Info.html>. -

    +

  35. ICC 1:2001-12, @@ -312,6 +319,13 @@

    JHOVE References

    <
    http://www.color.org/newiccspec.pdf>.

    +
  36. +International Internet Preservation Consortium, +WARC Specifications +(s.l., s.d.) +<https://iipc.github.io/warc-specifications/>. +

    +
  37. ISO/IEC 646:1991, Information technology -- ISO 7-bit coded character set for information @@ -339,35 +353,35 @@

    JHOVE References

    Electronic still-picture imaging -- Removable memory -- Part 2: TIFF/EP image data format, October 15, 2001.
    -

    +

  38. ISO/DIS 12639:2003, Graphic technology -- Prepress digital data exchange -- Tag image file format for image technology (TIFF/IT), September 4, 2003. -

    +

  39. ISO/IEC 14495-1:1999, Information technology -- Lossless and near-lossless compression of continuous-tone still images: Baseline, December 1, 1999. -

    +

  40. ISO/IEC 14495-2:2003, Information technology -- Lossless and near-lossless compression of continuous-tone still images: Extensions, April 2, 2003 -

    +

  41. ISO/IEC 14721:2002, Space data and information transfer systems -- Open archival information system -- Reference Model <https://public.ccsds.org/pubs/650x0m2.pdf>. -

    +

  42. ISO/IEC 15444-1:2000, @@ -376,7 +390,7 @@

    JHOVE References

    July 31, 2002, Final Committee Draft (FCD) version available as <
    https://www.iso.org/standard/27687.html>. -

    +

  43. ISO/IEC 15444-2:2004, @@ -384,28 +398,28 @@

    JHOVE References

    May 15, 2004, Final Committee Draft (FCD) version available as <
    https://www.iso.org/standard/33160.html>. -

    +

  44. ISO/IEC 15444-6:2003, Information technology -- JPEG 2000 image coding system -- Part 6: Compound image file format, October 15, 2003. <https://www.iso.org/standard/35458.html>. -

    +

  45. ISO/IEC 15444-12:2004, Information technology -- JPEG 2000 image coding system -- Part 12: ISO base media file format, February 1, 2004. <https://www.iso.org/standard/38612.html>. -

    +

  46. ISO 15929:2002, Graphic technology -- Prepress digital data exchange -- Guidelines and principles for the development of PDF/X standards, March 21, 2002. -

    +

  47. ISO 15930-1:2001, @@ -413,7 +427,7 @@

    JHOVE References

    Part 1: Complete exchange using CMYK data (PDF/X-1 and PDF/X-1a)
    December 6, 2001. <https://www.iso.org/standard/29061.html>. -

    +

  48. ISO 15930-4:2003, @@ -422,7 +436,7 @@

    JHOVE References

    PDF 1.4 (PDF/X-1a) August 4, 2003.
    <https://www.iso.org/standard/39938.html>. -

    +

  49. ISO 15930-5:2003, @@ -430,7 +444,7 @@

    JHOVE References

    Part 5: Partial exchange of printing data using PDF 1.4 (PDF/X-2), August 5, 2003.
    <https://www.iso.org/standard/39939.html>. -

    +

  50. ISO 15930-6:2003, @@ -439,7 +453,7 @@

    JHOVE References

    PDF 1.4 (PDF/X-3), August 6, 2003.
    <https://www.iso.org/standard/39940.html>. -

    +

  51. ISO/DIS 19005-1, @@ -447,7 +461,14 @@

    JHOVE References

    preservation -- Part 1: Use of PDF 1.4 (PDF/A-1),
    December 22, 2004. <https://www.iso.org/standard/38920.html>. -

    +

    + +
  52. +International Organization for Standardization, +ISO 28500:2009 - Information and documentation — WARC file format +(Geneva, 2009) +<https://www.iso.org/standard/44717.html>. +

  53. ITU-T Rec. T.800 (2002), @@ -455,14 +476,14 @@

    JHOVE References

    JPEG 2000 image coding system: Core coding system,
    August 2002. <http://www.itu.int/rec/T-REC-T.800/>. -

    +

  54. A. Katz and D. Cohen, A File Format for the Exchange of Images in the Internet, RFC 1314, April 1992 <http://www.ietf.org/rfc/rfc1314.txt>. -

    +

  55. JEITA JEIDA-49-1998, @@ -471,7 +492,7 @@

    JHOVE References

    Version 2.1, December 1998
    <http://www.exif.org/dcf-exif.PDF>. -

    +

  56. JEITA CP-3451, @@ -479,21 +500,28 @@

    JHOVE References

    Exif Version 2.2, April 2002
    <http://www.exif.org/Exif2-2.PDF>. -

    +

    + +
  57. +J. Levine, +RFC 6713: The 'application/zlib' and 'application/gzip' Media Types +(Taughannock Networks, 2012) +<https://tools.ietf.org/html/rfc6713> +

  58. Tim Lindholm and Frank Yellin, The Java Virtual Machine Specification (2nd ed.; Addison-Wesley, 1999) <http://www.cs.miami.edu/home/burt/reference/java/language_vm_specification.pdf> -

    +

  59. L. McIntyre, S. Zilles, R. Buckley, D. Venable, G. Parsons, and J. Rafferty, File Format for Internet Fax, RFC 2301, March 1998 <http://www.ietf.org/rfc/rfc2301.txt>. -

    +

  60. L. McIntyre, G. Parsons, and J. Rafferty, @@ -501,7 +529,7 @@

    JHOVE References

    Sub-type Registration, RFC 3250, September 2002
    <http://www.ietf.org/rfc/rfc3250.txt>. -

    +

  61. Microsoft Corporation, @@ -511,7 +539,7 @@

    JHOVE References

    (MSDN > MSDN Library > ... > AVI File Format > AVI RIFF File Reference) <
    https://msdn.microsoft.com/en-us/library/windows/desktop/dd318189(v=vs.85).aspx>. -

    +

  62. Microsoft Corporation, @@ -521,7 +549,7 @@

    JHOVE References

    Windows Multimedia > Windows Multimedia > Multimedia Reference > Multimedia Structures > PCMWAVEFORMAT) <
    https://msdn.microsoft.com/en-gb/library/windows/desktop/dd743663(v=vs.85).aspx>. -

    +

  63. Microsoft Corporation, @@ -531,7 +559,7 @@

    JHOVE References

    Windows Multimedia > Windows Multimedia > Multimedia Reference > Multimedia Structures > WAVEFORMAT) <
    https://msdn.microsoft.com/en-gb/library/windows/desktop/dd757712(v=vs.85).aspx>. -

    +

  64. Microsoft Corporation, @@ -541,7 +569,7 @@

    JHOVE References

    Windows Multimedia > Windows Multimedia > Multimedia Reference > Multimedia Structures > WAVEFORMATEX) <
    https://msdn.microsoft.com/en-gb/library/ms913542.aspx>. -

    +

  65. Microsoft Corporation, @@ -551,40 +579,47 @@

    JHOVE References

    Windows Multimedia > Windows Multimedia > Multimedia Reference > Multimedia Structures > WAVEFORMATEXTENSIBLE) <
    https://msdn.microsoft.com/en-us/library/windows/hardware/ff538802(v=vs.85).aspx>. -

    +

  66. MIX: NISO Metadata for Images in XML Schema <http://www.loc.gov/standards/mix/>. -

    +

  67. Jerry Morrison, "EA IFF 85" Standard for Interchange Format Files, January 14, 1985. <http://www.martinreddy.net/gfx/2d/IFF.txt>. -

    +

  68. M. Murata, S. St. Laurent, and D. Kohn, XML Media Types, RFC 3023, January 2001 <http://www.ietf.org/rfc/rfc3023.txt>. -

    +

    + +
  69. +NetarchiveSuite, +Java Web Archive Toolkit (JWAT), +2019. +<https://github.com/netarchivesuite/jwat>. +

  70. NISO Z39.87-2002/AIIM 20-2002, Data Dictionary -- Technical Metadata for Digital Still Images, Draft standard for trial use, June 1, 2002 -- December 31, 2003 <http://xml.coverpages.org/NISO-Z39-87-TrialUse20020601.pdf>. -

    +

  71. G. Parsons and Rafferty, J., Tag Image File Format (TIFF) - F Profile for Facsimile, RFC 2306, March 1998 <http://www.ietf.org/rfc/rfc2306.txt>. -

    +

  72. Dave Ragget, @@ -592,97 +627,97 @@

    JHOVE References

    Internet Draft <draft-ietf-html-specv3-00.txt>, expired 28 September 1995 <
    http://www.w3.org/MarkUp/html3/CoverPage>. -

    +

  73. Dave Ragget, HTML 3.2 Reference Specification, W3C Recommendation, 14 January April 1997 <http://www.w3.org/TR/REC-html32>. -

    +

  74. Dave Ragget, Arnauld Le Hors, and Ian Jacobs, eds. HTML 4.0 Specification, W3C Recommendation, 24 April 1998 <http://www.w3.org/TR/1998/REC-html40-19980424/>. -

    +

  75. Dave Ragget, Arnauld Le Hors, and Ian Jacobs, eds. HTML 4.01 Specification, W3C Recommendation, 24 December 1999 <http://www.w3.org/TR/html4/>. -

    +

  76. Niles Ritter and Mike Ruth, GeoTIFF Format Specification: GeoTIFF Revision 1.0, Version 1.8.2, December 28, 2000 <http://mac.mf3x3.com/GIS/GEOTIFF/geotiff_spec.pdf>. -

    +

  77. R. Rivest, The MD5 Message-Digest Algorithm, RFC 1321, April 1992 <http://www.ietf.org/rfc/rfc1321.txt>. -

    +

  78. Secure Hash Standard, Federal Information Processing Standards Publication 180-2, August 1, 2002 <http://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.180-4.pdf>. -

    +

  79. D. Singer, R. Clark, and D. Lee, MIME Type Registrations for JPEG 2000 (ISO/IEC 15444), RFC 3745, April 2004 <http://www.ietf.org/rfc/rfc3745.txt>. -

    +

  80. Kevin A. Smith and Doug Kramer. Requirements for Writing Java API Specifications, January 2003 <http://www.oracle.com/technetwork/articles/javase/index-142372.html>. -

    +

  81. Sun Microsystems, Inc., How to Write Doc Comments for the Javadoc Tool, 2000 <http://java.sun.com/j2se/javadoc/writingdoccomments/>. -

    +

  82. Sun Microsystems, Inc., Java 2 Platform, Standard Edition (J2SE), 2003 <http://java.sun.com/j2se/>. -

    +

  83. Sun Microsystems, Inc., Java 2 SDK, Standard Edition, v 1.4.2, 2003 <http://java.sun.com/j2se/1.4.2/index.jsp>. -

    +

  84. Unicode Consortium, Blocks-6.0.0.txt, Correlated with Unicode 6.0, June 4, 2010 <http://www.unicode.org/Public/UNIDATA/Blocks.txt>. -

    +

  85. Unicode Consortium, The Unicode Standard, Version 4.0 (Boston: Addison-Wesley, 2003) <http://www.unicode.org/versions/Unicode4.0.0/>. -

    +

  86. World Wide Web Consortium, @@ -691,12 +726,12 @@

    JHOVE References

    W3C Recommendation, 26 January 2000; revised 1 August 2002 <
    http://www.w3.org/TR/2002/REC-xhtml1-20020801/>. -

    +

  87. textMD: Technical Metadata for Text <https://www.loc.gov/standards/textMD/>. -

    +

{% include footer.html %}