Add proposal "Format API" #127

mickael-menu · 2020-03-23T20:29:39Z

This proposal introduces a dedicated API to easily figure out a file format.

While a Publication is independent of any particular format, knowing the format of a publication file is necessary to:

determine the publication parser to use,
group or search publications by file type in the user's bookshelf.

This API is not tied to Publication, so it can be used as a general purpose tool to guess a file format, e.g. during HTTP requests or in the LCP library.

You can read the formatted proposal here.

proposals/001-file-format-api.md

mickael-menu · 2020-04-01T17:08:45Z

@danielweck @jccr We would like to merge this proposal during the next Readium dev call, so any insights you may have for desktop and web is most welcomed 🙏

proposals/001-format-api.md

mickael-menu · 2020-04-24T09:36:41Z

I'd like to move forward, so I intend to merge this proposal after next week's meeting, if there's no counter-arguments till then.

I made a few changes in the proposal after implementing it in Swift:

Renamed Format.guess into Format.of, because the caller is expecting an accurate Format returned.
Added bitmap formats sniffing, because we need them in the CBZ parser.
Removed inspectingContent parameter to simplify the sniffers. Instead, Format.of() will iterate twice through the sniffers: first with a context containing only file extensions and media types, and the second time with a context containing the content, if there's any.
I grouped the sniffing of several formats in single sniffers, when the logic is shared (e.g. all the Readium WebPub formats).
Added a few more APIs:
- MediaType: encoding, structuredSyntaxSuffix, isZIP, isJSON, isRWPM, isPartOf()
- Link: mediaType,
- Format.SnifferContext: encoding,contentAsRWPM and readFileSignature() to sniff magic numbers.

Change the ZAB media type Fix EPUB heavy sniffing

Add the x. facet for ZAB and W3C WPUB

Renamed Format.guess into Format.of Added bitmap formats, used for the CBZ parser Removed `inspectingContent` parameter to simplify the sniffers Grouped formats in shared sniffers

Add `Link.mediaType` helper Add `Format.SnifferContext` `encoding`, `contentAsRWPM` and `readFileSignature()`

Add MediaType's isAudio and isLCPProtected Add OPDS Authentication Document media type and format Sniff an Audiobook using the reading order types

This was referenced Mar 23, 2020

Submitting and Archiving Proposals #128

Closed

Model for the Publication's format/type #112

Closed

mickael-menu commented Mar 23, 2020

View reviewed changes

proposals/001-file-format-api.md Outdated Show resolved Hide resolved

qnga reviewed Mar 24, 2020

View reviewed changes

proposals/001-file-format-api.md Outdated Show resolved Hide resolved

qnga reviewed Mar 24, 2020

View reviewed changes

proposals/001-file-format-api.md Outdated Show resolved Hide resolved

mickael-menu mentioned this pull request Mar 25, 2020

Media types of Readium publications #121

Closed

mickael-menu changed the title ~~Initial proposal for the File and Format API~~ Add proposal "File and Format API" Mar 30, 2020

mickael-menu changed the title ~~Add proposal "File and Format API"~~ Add proposal "Format API" Apr 1, 2020

mickael-menu requested review from danielweck, HadrienGardeur, llemeurfr, JayPanoz and jccr April 1, 2020 16:16

mickael-menu commented Apr 11, 2020

View reviewed changes

proposals/001-format-api.md Show resolved Hide resolved

mickael-menu mentioned this pull request Apr 16, 2020

Add the Format API readium/r2-shared-swift#88

Merged

HadrienGardeur approved these changes Apr 22, 2020

View reviewed changes

This was referenced Apr 29, 2020

Add MediaType API readium/r2-shared-kotlin#99

Merged

Use the new MediaType API readium/r2-streamer-kotlin#101

Merged

Use the new MediaType API readium/r2-testapp-kotlin#313

Closed

Add the Format API readium/r2-shared-kotlin#100

Merged

mickael-menu added 7 commits May 5, 2020 15:05

Initial proposal for the File and Format API

008cc3c

Add LPF and W3C WPUB formats

037cdb1

Change the ZAB media type Fix EPUB heavy sniffing

Cosmetics, refactor structure, remove File API

676f597

Add the x. facet for ZAB and W3C WPUB

Various changes in Format API after implementing in Swift

a755875

Renamed Format.guess into Format.of Added bitmap formats, used for the CBZ parser Removed `inspectingContent` parameter to simplify the sniffers Grouped formats in shared sniffers

Add MediaType encoding, isPartOf() and isRWPM

fb2d803

Add `Link.mediaType` helper Add `Format.SnifferContext` `encoding`, `contentAsRWPM` and `readFileSignature()`

Add MediaType's structuredSyntaxSuffix, isZIP and isJSON

e9bd3f1

Rename MediaType.isPartOf into matches

4d42752

Add MediaType's isAudio and isLCPProtected Add OPDS Authentication Document media type and format Sniff an Audiobook using the reading order types

mickael-menu merged commit 7b8a1b6 into readium:master May 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add proposal "Format API" #127

Add proposal "Format API" #127

mickael-menu commented Mar 23, 2020 •

edited

Loading

mickael-menu commented Apr 1, 2020

mickael-menu commented Apr 24, 2020 •

edited

Loading

Add proposal "Format API" #127

Add proposal "Format API" #127

Conversation

mickael-menu commented Mar 23, 2020 • edited Loading

mickael-menu commented Apr 1, 2020

mickael-menu commented Apr 24, 2020 • edited Loading

mickael-menu commented Mar 23, 2020 •

edited

Loading

mickael-menu commented Apr 24, 2020 •

edited

Loading