Publishing Maintenance Working Group Telco

Meeting minutes

Overview of TPAC

Wendy Reid: we had really good sessions on annotations and a lot of the ground work is done so now we need to start a draft proposal
… expect to see that in the next few weeks
… a lot of time was spent on fixed layouts and comics
… made good progress on what to do with the problematic rendition fxl properties
… we are not going to add the html syntax - I am working on a blog post on this
… we're still working on what to do about submissions to iso
… got an update on Japanese publishing
… we have some info on the iso transition but the issue is the european commission and iso coordination

Ivan Herman: I had some disagreement about submission at the meeting but it seems that iso requires a document to be done as iso
… the wcag group tried to convince them that this is not useful to do for wcag and iso accepted it
… iso accepted their argument and wcag was submitted
… I have asked for the argument on how to make this case for epub

Ivan Herman: we also discussed images in the spine at tpac

Wendy Reid: we intend to leave the spec requirements for this as they are

Parallel Content - https://github.com/w3c/epub-specs/discussions/2829

Wendy Reid: https://github.com/w3c/epub-specs/issues/2670, https://github.com/w3c/epub-specs/issues/2825, https://github.com/w3c/epub-specs/issues/2773, https://github.com/w3c/epub-specs/issues/2806

Wendy Reid: we've had use cases for multiple versions of the content and tried to solve with multiple renditions
… parallel content is where there are two or more parts that appear in conjunction with each other
… content in translation is an example where you might have a poem in two different languages on facing pages so you can review each line in each language
… it also comes up in accessibility issues

Gautier Chomel: currently there are different problematics like translated content, fixed and reflow, audio and text

<Wendy Reid> https://www.irccloud.com/pastebin/HohO22uS/

Gautier Chomel: up until now there have been separate discussions and proposals for each but we should try to find one way for all
… I don't have a preferred option for how to address this technically

Ivan Herman: it is an important use case and multiple renditions covers at least some of it
… we don't have a technical solution yet so this group probably can't do it - it likely needs to be incubated in the community group
… I expect the solution will be complicated and needs incubation to figure out what will work

Hadrien Gardeur: multiple renditions does have a mapping document to jump between the renditions - it's a lot like a navigation document
… we need to be careful not to force everything together if they don't work well together
… I think if we modify multiple renditions we could do some interesting use cases like magazines with fixed and reflowable layouts
… we shouldn't reject all of multiple renditions but I agree we do need to look into it more
… the way multiple renditions was defined was not ideal

Wendy Reid: I agree we need incubation - we have bits and pieces of what we need
… if I'm reading a book where one page is one language and another in another language you wouldn't want two separate documents but we don't have multiple reading orders
… same is true for audio where the transcript might be in the same file
… but reflow and fxl probably are always going to require separate documents

Hadrien Gardeur: I think another option for multiple reading orders than multiple renditions and all it entails - a separate publication - we also have collections for alternate reading order
… we wouldn't need to introduce anything new - is worth exploring

<Murata Makoto> Unfortunately, multiple renditions were never really used. Right?

Hadrien Gardeur: I had this in mind at tpac but we didn't have time to discuss - once we're done with comics I'd like to explore it
… we should consider a task force since we're in a revision cycle right now

Ivan Herman: I have practical problems, namely whatever we do if we want to put it in the epub standard we need two implementations
… I get that it's not multiple renditions and it has not been picked up - that's why we took it out and made it a note

<Murata Makoto> If my memory is correct, I think that it is too difficult to use multiple publication documents per zip file.

Ivan Herman: I'm worried about putting something into the standard again before we know that there will be adoption

Wendy Reid: if we are to undertake this a big question is what stopped adoption of multiple renditions - there are some implementations but they are really platform base
… I have a feeling I know why - people didn't know what to do with what we gave them

Hadrien Gardeur: what if we create a task force that does not attempt to introduce new things in the spec
… based on the result of that task force we see whether there is any interest and then decide where to go
… if that fails then we can move it over to the CG to look into further

Gregorio Pellegrino: we shouldn't work in the past but whether it is needed now - EAA is requiring publishers to now produce both fxl and reflow in the same package

Shinya Takami: I wonder if there is enough interest or demand for this - I think it needs more incubation

Ivan Herman: the working group is getting thin and we have existing task forces with a lot of work to do - we should be getting to CR within half a year - so I'm worried about this taking more time away from the revision

Hadrien Gardeur: there is demand for textbooks and it is common for magazines - they are lacking the format to do it
… there is scale now that didn't exist when we did multiple renditions
… the interest is specialized though
… and I agree with Ivan Herman that we shouldn't do too many things at the same time - there is still work for comics but not as much as for annotations
… I can't see doing both in parallel but if we can close comics soon and we get interest in this from other participants then we should at least explore what we can do

<Ivan Herman> +1 to Hadrien Gardeur

Hadrien Gardeur: and if we can't add anything to the spec than it still leaves people with some ideas to work on

Gautier Chomel: scholarly publishing does do multiple renditions but in a closed environment - they need to provide interoperable content
… I understand the time and resource issues but this problem has been around for a long time and should be a priority to finally address
… other solutions are being implemented and it makes it harder to standardize

Wendy Reid: we need to do some outreach to find out who is interested before we get to a task force
… it's easy to start a task force but we don't want to find out later that the work won't be implemented

Gautier Chomel: I think it would be good to have this in the CG because we have more people with interest in this area

<Avneesh Singh> +1, anyone can join CG

Hadrien Gardeur: we also have interest in EDRLab but they are not represented in this group

Wendy Reid: for a next step we should start the conversation in the CG
… nothing stops us from promoting the work into the WG later
… doesn't require more work from this group to get it started

Gautier Chomel: I can try by mid-January to have a call in the CG to start a task force

Scheduling - https://github.com/w3c/epub-specs/discussions/2834

Wendy Reid: only other thing is meeting scheduling

Wendy Reid: I've already seen some responses, but we had a question about our meeting scheduling because our one time slot is not ideal for the west coast of US, too early, or too late for those in Japan
… wanted to find out if we could go back to a rotating schedule
… no interest so far in starting even earlier but nothing negative about a rotating meeting time

Ivan Herman: I've done the rotating schedule here and in other groups and it never works well - it divides the community
… it's always been abandoned after a while

Avneesh Singh: do we have enough agenda items to fill up four or five meetings a month or should we consider dropping the schedule from every week

Wendy Reid: if you haven't had a chance to respond please think about it and add a comment

test suite changes - w3c/epub-tests#309

Ivan Herman: I made some changes to the test suite after the tpac meeting because we deprecated a property
… (I still need a full list of what we'll deprecated, as an aside)
… there is a pull request for the tests to separate the tests that are deprecated - new labelling like required vs. recommended
… the other thing is more controversial - if someone currently creates a test they are expected to create a new zipped epub file
… this forces the tester to test the book they've created
… but this has become a pain when you need to go back and make minor changes to existing tests
… the PR adds a github action to auto-generate all the epubs
… is this okay or should we keep people generating the epubs manually?

Hadrien Gardeur: I will get back to you soon about the other properties to deprecate

Gautier Chomel: if all the epub files are updated then you lose some information about when they were last changed

Ivan Herman: yes, but the metadata in the epub file will still be the same about the last modification date

Gautier Chomel: might be a problem for sorting, but not sure how big

Ivan Herman: for reporting the date of the packaged epub is not used
… only the unpacked files are used

Wendy Reid: the last modification date only changes if you actually modify the test

Ivan Herman: if you have a lot of tests to work on then generating each one becomes a pain to do
… I considered adding epubcheck validation but I'm not sure how to do that

Gautier Chomel: I can send you an example of how to call it into your workflow

Wendy Reid: could have epubcheck as an action

Ivan Herman: but a lot of tests that epubcheck would have to be started up and run on
… is it okay if I merge as it is now?

Wendy Reid: yes

Gautier Chomel: yes

Ways to identify inferred metadata - w3c/epub-specs/discussions/2830

<Wendy Reid> w3c/epub-specs#2833

Gautier Chomel: I have questions about ways to infer metadata
… people are starting to update their epubs and that comes with a lot of AI generated content

Gautier Chomel: if we cannot differentiate what has been auto-answered it will become problematic

Hadrien Gardeur: I raised this during tpac - there could be legal reasons for identifying what is AI generated
… might not be in our scope to do this for html
… if there is work going on in this area it would be helpful to know where

Ivan Herman: w3c is still trying to find out what can be done about AI - there is a new interest group but not answer at this time
… even if we are talking metadata this is not just an epub issue
… coming up with our own metadata in isolation is not a good idea

Wendy Reid: we might be overthinking the question - this is only epub metadata and what is generated automatically

<Hadrien Gardeur> https://w3c.github.io/pm-wg/minutes/2025-11-11-f2f.html#2ee0

Wendy Reid: how can you tell if a book contains AI-generated metadata
… could we use something like the refines attribute to make statements about resources
… all we are doing is putting an asterisk on resouce essentially

Hadrien Gardeur: I think Ivan Herman's point still stands that we don't have the metadata to say anything - we're lacking the vocabulary and it's not in our scope to define that
… could be schema.org or dublin core or someone else to define the metadata

Wendy Reid: if I have a million epubs in my database and some are quite old and I want to add accessibility metadata then I can try to infer things like whether they support some textual reading
… but there should be a way to say that this metadata was automated so it is not compeletely trustworthy

Gautier Chomel: the problem is bigger, but yes, that is what we need to capture first