(Pack­aged) Web Pub­li­ca­tions (PWP)

Ivan Her­man, W3C

Se­man­tic Web in Li­braries (SWIB16), Bonn, Ger­many, 2016-11-29

(Packaged) Web Publications

Ivan Her­man, W3C

Se­man­tic Web in Li­braries (SWIB16), Bonn, Ger­many

2016-11-29

These Slides are Avail­able on the Web

See: https://w3c.github.io/dpub/2016/SWIB/

(Slides are in HTML)

A format to represent digital publications: EPUB 3

Cloned Milk­men, Flickr

EPUB 3 is a very mature specification

A wide variety of books have been created

Books with lots of il­lus­tra­tions…

An extract of the 'Petit Prince' with a typical drawing
An­toine de Sain-Exupéry: “Le Petit Prince”, Ebooks li­bres et gra­tu­its
An extract of Winnie the Pooh with a typical drawing
A.A. Milne: “Win­nie-the-Pooh”, Egmont UK Ltd.

Sci­en­tific pre­sen­ta­tions

Slide-like page with lots of mathematical equations
David Mao: “Cal­cu­lus”

Art books

Page with an annotated high quality reproduction of a Dutch painting
In­grid Koe­nen: “Dutch Golden Age”

Man­gas

Page of a typical Japanese manga
“ハルコさんの彼氏”, IDPF EPUB3 Sam­ple

Tech­ni­cal books

Extract of a CSS book, with codes and figures
Lea Verou: “CSS Se­crets”, O'Reilly
Extract of a Javascript book with figure and code
Michael Fogus: “Func­tional Pro­gram­ming”, O'Reilly

Books with different character sets and writing directions

An extract of a text in hindi
“The Ma­hab­harata in Devanāgarī (देवनागरी)”, IDPF EPUB3 Sam­ple
A book with hebrew characters
“Is­rael sail­ing”, IDPF EPUB3 Sam­ple

It is not only for books! It can be…

…conference proceedings

Cover of a Springer Proceedings of an LOD conference
Article from a Springer Proceedings of an LOD conference

…journals or magazines (articles)

Cover of the JEP journal
Table of content of the JEP journal

…of­fi­cial re­ports of all kinds

EU brochure in English
Eu­ro­pean Com­mis­sion — Gen­eral Re­port 2015
EU brochure in Bulgarian
Eu­ro­pean Com­mis­sion — Gen­eral Re­port 2015

In fact, just about anything!

Screendump of gdrive saving a document as EPUB
Screendump of Apple Pages saving a document as EPUB

What is the secret?

James Ar­boghast, flickr

Well, at least one of the secrets…

Rough structure of an EPUB file

EPUB Packaging structure diagram

A good example

Figure with a complex image processing done, in fact, in CSS
Lea Verou: “CSS Se­crets”, O'Reilly

Bottom line: relationships of EPUB with OWP is fundamental

Are we all done?
I.e., are OWP and EPUB a perfect match?

There are two major areas that need work:

  1. bring OWP and Dig­i­tal Pub­lish­ing closer
  2. bring the Web and Dig­i­tal Pub­lish­ing closer

Bring OWP and Digital Publishing closer

Missing OWP features

Bring other OWP features to publishing

Bring­ing these to pub­lish­ing should (and will…) hap­pen

Bring the Web and Digital Publishing closer

What is, in fact, a (dig­i­tal) book?

What we get today…

EU report page with signs for downloads
Dump from EU pub­li­ca­tions' page

What we get today…

EU report in EPUB
Dump from EU pub­li­ca­tions' page

What we get today…

EU report page with nice outlook, TOC, etc
Dump from EU pub­li­ca­tions' page

“This should not be the case!”
what does this mean?

Portable Web Publication at a glance

Sep­a­ra­tion be­tween pub­lish­ing as Web sites and in an of­fline pack­age should be di­min­ished to zero

ibta ara­bia

For example: book in a browser

Joseph Reagle's book as a web page
Joseph Rea­gle: “Good Faith Col­lab­o­ra­tion”, PhD The­sis, MIT Press

For example: book in a browser

Joseph Reagle's book as an ebook in reader
Joseph Rea­gle: “Good Faith Col­lab­o­ra­tion”, PhD The­sis, MIT Press

For example: I may not be online…

Person sitting in a station with a mobile in hand
Bryan Ong, Flickr

For example: scholarly publishing

Screen dump of an article on F1000
Jul­lien Colomb et. al: “Sub-strains of Drosophila Can­ton-S…”, F1000Re­search

But… why not simply rely only on the current Web?
(with some facilities for offline)

The web already provides all we need!

Not quite…
(even when considering the Web only, i.e., no packaging)

Need for the concept of a “publication” of many resources

Why do we need the WP concept?

Why do we need the WP concept?

How does that translate to the Web?

a collection of resources with different URL pointer

How does that translate to the Web?

a collection of resources in a 'blob' with one URL pointer

An additional concept:
a “WP Processor”

An internal representation may also be needed

Architectural challenge: handling online/offline

Envisioned architecture:
online

Document consumed through the Web in a traditional way

Envisioned architecture:
offline

Document consumed through a Service Worker, possibly cached

Is this approach at all feasible?

Advances in modern browsers: Web and Service Workers

Advances in modern browsers: Web and Service Workers

Work in progress

A WP Processor can be implemented using Service Workers

Service Workers are coming…

Screen dump of the service workers' draft spec

An ex­am­ple of on­line/of­fline book with Ser­vice Work­ers

Screen dump the book “High Performance Browser Networking”

Manifests

Packaged Web Publications

Pack­aged Web Pub­li­ca­tions (PWP)

A layer “on top” of WP-s

a collection of resources in a 'blob' in a rectangle with one URL pointer

Structure of an EPUB3 file

EPUB Packaging structure diagram

A Packaging of a Web Publication

PWP packaging structure

PWP Packaging structure diagram with admin file in JSON

A PWP Proces­sor

A PWP Processor

Document consumed through a Service Worker, possibly unpacked

Technical challenge: addressing, identification

Is it “addressing” or “identification”?

Is it “addressing” or “identification”?

What does a Web request return for a locator?

Ergonomy differences

Book read­ing needs a dif­fer­ent ap­proach to er­gon­omy

Front page of the War and Peace ebook
Lev Tol­stoy: “War and Peace”, feed­books

Per­son­al­iza­tion

But what about
EPUB???

PWP vs. EPUB3.1

Most things are the same!

De­vel­op­ment process

Other synergy effects of convergence

Advantage for the publishers‘ community

Photo of a bookshelf with lots of technical books
Jef­frey Zeld­man, Flickr

Advantage for the publishers‘ community

Photo of a bookshelf with lots of technical books
Jef­frey Zeld­man, Flickr

Advantage for the Web community

image of a medieval manuscript
Oliver Byrne's edi­tion of Eu­clid, Uni­ver­sity of British Co­lum­bia

To conclude:
Let us create real publications on the Web!

Some references

Lat­est PWP Use Cases and Re­quire­ments draft:
https://w3c.github.io/dpub-pwp-ucr/
Lat­est PWP Ed­i­tors’ draft:
https://w3c.github.io/dpub-pwp/
PWP Issue list:
https://github.com/w3c/dpub-pwp/issues

Some references

Lat­est PWP Use Cases and Re­quire­ments draft:
https://w3c.github.io/dpub-pwp-ucr/
Lat­est PWP Ed­i­tors’ draft:
https://w3c.github.io/dpub-pwp/
PWP Issue list:
https://github.com/w3c/dpub-pwp/issues

Con­stantly evolv­ing…

One more thing…

This is not how Web development works at W3C…

Ed Rit­ger, Flickr

…it is more like that!

Paul Downey, Flickr

I.e., join W3C to help things moving forward!

Paul Downey, Flickr

Thank you for your at­ten­tion!

This pre­sen­ta­tion:
https://w3c.github.io/dpub/2016/SWIB/
(PDF is also avail­able for down­load)
My con­tact:
ivan@​w3.​org