<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: New proposal for captions and other timed text for HTML5</title>
	<atom:link href="http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/</link>
	<description>Silvia&#039;s blog</description>
	<lastBuildDate>Sun, 28 Feb 2010 17:51:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1909</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Thu, 29 Oct 2009 22:05:57 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1909</guid>
		<description>@Doris

There are use cases for both, text inside a video/audio file, and text outside, but related. For most Web developers, outside is in fact a lot more sensible since it&#039;s easier to update such files.

I am neither trying to avoid a patent reality nor trying to hack around issues. Even if there existed only a single format in which we encoded and encapsulated audio and video, I would still propose to use both: in-stream and external (out-of-band) time-aligned text.

For Web pages we already have the reality that it consists of many files that together create a consistent presentation. It&#039;s not a problem - we have zip/tar files and many other means to solve this issues.

In fact, I think it will be even less of a problem for video, since a server can provide the additional service of embedding a text files (or, in fact, text from a database) inside a video file upon download, should that be desirable. Also, it is possible to download the video and the associated text files as package. If the text &quot;mysteriously disappears&quot;, it&#039;s a feature/bug of the Website rather than a fundamental design issue.</description>
		<content:encoded><![CDATA[<p>@Doris</p>
<p>There are use cases for both, text inside a video/audio file, and text outside, but related. For most Web developers, outside is in fact a lot more sensible since it&#8217;s easier to update such files.</p>
<p>I am neither trying to avoid a patent reality nor trying to hack around issues. Even if there existed only a single format in which we encoded and encapsulated audio and video, I would still propose to use both: in-stream and external (out-of-band) time-aligned text.</p>
<p>For Web pages we already have the reality that it consists of many files that together create a consistent presentation. It&#8217;s not a problem &#8211; we have zip/tar files and many other means to solve this issues.</p>
<p>In fact, I think it will be even less of a problem for video, since a server can provide the additional service of embedding a text files (or, in fact, text from a database) inside a video file upon download, should that be desirable. Also, it is possible to download the video and the associated text files as package. If the text &#8220;mysteriously disappears&#8221;, it&#8217;s a feature/bug of the Website rather than a fundamental design issue.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doris</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1897</link>
		<dc:creator>Doris</dc:creator>
		<pubDate>Thu, 29 Oct 2009 17:07:22 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1897</guid>
		<description>It would be fantastic if there was a commen video standard to embed text in a video file that every browser could just access the same way as the video and audio streams inside.

Your idea is a hack around the stupid patent driven reality.

Your idea is well designed but I am afraid it also reenforces the reality. Why should people care to use a open standard for video with text embeding abilities if they can just use your hack?

People saving the video will end up with a useless file not containing the text they might need. And no indication of that before they save it. The text will mysteriously disappear.</description>
		<content:encoded><![CDATA[<p>It would be fantastic if there was a commen video standard to embed text in a video file that every browser could just access the same way as the video and audio streams inside.</p>
<p>Your idea is a hack around the stupid patent driven reality.</p>
<p>Your idea is well designed but I am afraid it also reenforces the reality. Why should people care to use a open standard for video with text embeding abilities if they can just use your hack?</p>
<p>People saving the video will end up with a useless file not containing the text they might need. And no indication of that before they save it. The text will mysteriously disappear.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1657</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Tue, 20 Oct 2009 12:55:49 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1657</guid>
		<description>@Sam
Yes, addition of enter and leave events make sense and make it complete.</description>
		<content:encoded><![CDATA[<p>@Sam<br />
Yes, addition of enter and leave events make sense and make it complete.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sam Dutton</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1656</link>
		<dc:creator>Sam Dutton</dc:creator>
		<pubDate>Tue, 20 Oct 2009 10:15:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1656</guid>
		<description>&gt;&gt; Re timed events: I assume you are saying that you like the callback methods that were introduced on the timed text segments, since they allow you to do such timed updates? &lt;&lt;
 
That&#039;s right. I was also trying to say (probably not very clearly!)  that it would be good to be able to listen for enter and leave events as well as being able to to attach event handler callbacks to onenter and onleave -- if only to encourage coders to move JavaScript out of HTML.</description>
		<content:encoded><![CDATA[<p>&gt;&gt; Re timed events: I assume you are saying that you like the callback methods that were introduced on the timed text segments, since they allow you to do such timed updates? &lt;&lt;</p>
<p>That&#039;s right. I was also trying to say (probably not very clearly!)  that it would be good to be able to listen for enter and leave events as well as being able to to attach event handler callbacks to onenter and onleave &#8212; if only to encourage coders to move JavaScript out of HTML.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1631</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Sat, 17 Oct 2009 10:13:15 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1631</guid>
		<description>@Sam

Thanks for the extensive feedback - I&#039;ve added it to https://wiki.mozilla.org/Accessibility/Experiment2_feedback .

Re SMIL: yes, it is oriented towards multi-media presentations where the timeline is in control - that is not how Web pages work, which are essentially static text content enriched with interactive and media elements. Thus the poor fit.

Re live timed events: I think it is possible to point the video src url to a live broadcast, which then gets updated continuously. It might make sense to turn off the controls for such an element. I also don&#039;t see a problem in attaching a subtitle file that is continuously updated in an itext element to such a live video source. The text could continue to be pushed/pulled. With javascript, it would also be possible to continue pulling other content, such as images or other text.

Re timed events: I assume you are saying that you like the callback methods that were introduced on the timed text segments, since they allow you to do such timed updates? I can see how that would be possible - nice ideas!</description>
		<content:encoded><![CDATA[<p>@Sam</p>
<p>Thanks for the extensive feedback &#8211; I&#8217;ve added it to <a href="https://wiki.mozilla.org/Accessibility/Experiment2_feedback" rel="nofollow">https://wiki.mozilla.org/Accessibility/Experiment2_feedback</a> .</p>
<p>Re SMIL: yes, it is oriented towards multi-media presentations where the timeline is in control &#8211; that is not how Web pages work, which are essentially static text content enriched with interactive and media elements. Thus the poor fit.</p>
<p>Re live timed events: I think it is possible to point the video src url to a live broadcast, which then gets updated continuously. It might make sense to turn off the controls for such an element. I also don&#8217;t see a problem in attaching a subtitle file that is continuously updated in an itext element to such a live video source. The text could continue to be pushed/pulled. With javascript, it would also be possible to continue pulling other content, such as images or other text.</p>
<p>Re timed events: I assume you are saying that you like the callback methods that were introduced on the timed text segments, since they allow you to do such timed updates? I can see how that would be possible &#8211; nice ideas!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sam Dutton</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1625</link>
		<dc:creator>Sam Dutton</dc:creator>
		<pubDate>Fri, 16 Oct 2009 08:54:25 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1625</guid>
		<description>&gt;&gt; Callbacks on timed text segments &lt;&gt; It is possible to achieve this effect simply through adding a timeupdate event listener, but proper callbacks like these are much more efficient. &lt;&lt;I&gt;&gt; I am also consciously refraining from re-implementing SMIL. I do not want the full complexity of the “seq” and “par” elements.  &lt;&lt;
 
I&#039;m also interested in how timed-event &#039;fragments&#039; might be handled -- SMIL seems oriented to complete presentations -- and the possibility of live timed events: for example, subtitling of live broadcasts, or content pushed or pulled in addition to live streaming, such as commentary and additional content broadcast during a concert.

We&#039;ve also been looking at how to implement custom timed &#039;events&#039; in a flexible way (though states might be a better word). For example, a carousel widget could listen for chapter events emitted by a video:

 {
   &quot;start&quot;: 20.00,
   &quot;end&quot;: 30.00,
   &quot;sender&quot;: &quot;video#myVideo&quot;,
   &quot;type&quot;: &quot;chapter&quot;,
   &quot;title&quot;: &quot;Single-celled organisms&quot;, 
   &quot;description&quot;: &quot;Single-celled microorganisms began to develop 3-4 billion years ago.&quot;, 
   &quot;src&quot;: &quot;single_cell.jpg&quot;,
   &quot;href&quot;: &quot;en.wikipedia.org/wiki/Microorganism&quot;
 } 

(Data here is shown as an JavaScript object literal, but could be in other formats. The main thing is that the event object properties can have any name and any value type.)

Alternatively, an element could emit timed CSS, HTML (or even JavaScript) events:

 { 
   &quot;start&quot;: 5.00,  
   &quot;end&quot;: 10.00,  
   &quot;sender&quot;: &quot;video#myVideo&quot;
   &quot;event&quot;: &quot;timeupdate&quot;
   &quot;receiver&quot;: &quot;div#subtitle&quot;
   &quot;type&quot;: &quot;HTML&quot;,
   &quot;value&quot;: &quot;It is half past nine and we&#039;ve only just passed &lt;a href=&#039;www.sheffield.com&#039; rel=&quot;nofollow&quot;&gt;Sheffield&lt;/a&gt;&quot; 
 }

Subtitles could be made bold for a few seconds like this:

 {
   &quot;start&quot;: 62.13,
   &quot;end&quot;: 65.29,
   &quot;sender&quot;: &quot;video#myVideo&quot;
   &quot;event&quot;: &quot;timeupdate&quot;
   &quot;receiver&quot;: &quot;div#subtitle&quot;
   &quot;type&quot;: &quot;CSS&quot;,
   &quot;value&quot;: {&quot;font-weight&quot;: &quot;bold&quot;}
 }</description>
		<content:encoded><![CDATA[<p>&gt;&gt; Callbacks on timed text segments &lt;&gt; It is possible to achieve this effect simply through adding a timeupdate event listener, but proper callbacks like these are much more efficient. &lt;<i>&gt; I am also consciously refraining from re-implementing SMIL. I do not want the full complexity of the “seq” and “par” elements.  &lt;&lt;</p>
<p>I&#039;m also interested in how timed-event &#039;fragments&#039; might be handled &#8212; SMIL seems oriented to complete presentations &#8212; and the possibility of live timed events: for example, subtitling of live broadcasts, or content pushed or pulled in addition to live streaming, such as commentary and additional content broadcast during a concert.</p>
<p>We&#039;ve also been looking at how to implement custom timed &#039;events&#039; in a flexible way (though states might be a better word). For example, a carousel widget could listen for chapter events emitted by a video:</p>
<p> {<br />
   &quot;start&quot;: 20.00,<br />
   &quot;end&quot;: 30.00,<br />
   &quot;sender&quot;: &quot;video#myVideo&quot;,<br />
   &quot;type&quot;: &quot;chapter&quot;,<br />
   &quot;title&quot;: &quot;Single-celled organisms&quot;,<br />
   &quot;description&quot;: &quot;Single-celled microorganisms began to develop 3-4 billion years ago.&quot;,<br />
   &quot;src&quot;: &quot;single_cell.jpg&quot;,<br />
   &quot;href&quot;: &quot;en.wikipedia.org/wiki/Microorganism&quot;<br />
 } </p>
<p>(Data here is shown as an JavaScript object literal, but could be in other formats. The main thing is that the event object properties can have any name and any value type.)</p>
<p>Alternatively, an element could emit timed CSS, HTML (or even JavaScript) events:</p>
<p> {<br />
   &quot;start&quot;: 5.00,<br />
   &quot;end&quot;: 10.00,<br />
   &quot;sender&quot;: &quot;video#myVideo&quot;<br />
   &quot;event&quot;: &quot;timeupdate&quot;<br />
   &quot;receiver&quot;: &quot;div#subtitle&quot;<br />
   &quot;type&quot;: &quot;HTML&quot;,<br />
   &quot;value&quot;: &quot;It is half past nine and we&#039;ve only just passed <a href='www.sheffield.com' rel="nofollow">Sheffield</a>&#8221;<br />
 }</p>
<p>Subtitles could be made bold for a few seconds like this:</p>
<p> {<br />
   &#8220;start&#8221;: 62.13,<br />
   &#8220;end&#8221;: 65.29,<br />
   &#8220;sender&#8221;: &#8220;video#myVideo&#8221;<br />
   &#8220;event&#8221;: &#8220;timeupdate&#8221;<br />
   &#8220;receiver&#8221;: &#8220;div#subtitle&#8221;<br />
   &#8220;type&#8221;: &#8220;CSS&#8221;,<br />
   &#8220;value&#8221;: {&#8220;font-weight&#8221;: &#8220;bold&#8221;}<br />
 }</i></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1545</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Tue, 06 Oct 2009 23:42:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1545</guid>
		<description>@blizzard The &quot;name&quot; attribute could indeed be localized before the UA sees it, since it will get displayed in the menu. The idea is however to allow the page author to influence the name of the menu. This may be a bad idea, I don&#039;t know - I&#039;m happy for suggestions there.</description>
		<content:encoded><![CDATA[<p>@blizzard The &#8220;name&#8221; attribute could indeed be localized before the UA sees it, since it will get displayed in the menu. The idea is however to allow the page author to influence the name of the menu. This may be a bad idea, I don&#8217;t know &#8211; I&#8217;m happy for suggestions there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christopher Blizzard</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1544</link>
		<dc:creator>Christopher Blizzard</dc:creator>
		<pubDate>Tue, 06 Oct 2009 23:18:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1544</guid>
		<description>Is the intent here that the name attribute of the itextlist element be localized as its sent down the wire?  Just wondering if the idea is that is a well-defined string that the browser localizes or it&#039;s something that should be localized before the UA sees it.</description>
		<content:encoded><![CDATA[<p>Is the intent here that the name attribute of the itextlist element be localized as its sent down the wire?  Just wondering if the idea is that is a well-defined string that the browser localizes or it&#8217;s something that should be localized before the UA sees it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1543</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Tue, 06 Oct 2009 22:24:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1543</guid>
		<description>@kunter There is no need to merge the markup of subtitles (or other time-aligned text for that matter) with HTML directly, just as there is no need to base64 encode images and include them directly in HTML. The itext proposal replicates for subtitles what we do for images and thus it follows completely along the HTML philosophy. DFXP is more than enough mark-up for subtitles, and so is srt or any of the other millions of formats that people have come up with over the years.

As for linking to subtitles in a HTML head element: that won&#039;t work when you have multiple video elements on the web page. You really do need a solution that clearly associates with a particular video element.</description>
		<content:encoded><![CDATA[<p>@kunter There is no need to merge the markup of subtitles (or other time-aligned text for that matter) with HTML directly, just as there is no need to base64 encode images and include them directly in HTML. The itext proposal replicates for subtitles what we do for images and thus it follows completely along the HTML philosophy. DFXP is more than enough mark-up for subtitles, and so is srt or any of the other millions of formats that people have come up with over the years.</p>
<p>As for linking to subtitles in a HTML head element: that won&#8217;t work when you have multiple video elements on the web page. You really do need a solution that clearly associates with a particular video element.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/10/06/new-proposal-for-captions-and-other-timed-text-for-html5/comment-page-1/#comment-1542</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Tue, 06 Oct 2009 22:09:34 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=591#comment-1542</guid>
		<description>@Jack thanks for the comments - and you are right: there are plenty of existing syntax elements in other specifications that could be tweaked, adapted and possibly re-used. However, none of them really fit.

&quot;category&quot; is very different to &quot;role&quot; - it is the category of time-aligned text we are talking about and there is a limited list part of the spec.

&quot;name&quot; could be replaced by &quot;title&quot; or something else - I am not particularly fussed about this though I needed it as an attribute rather than as content model, which would have been more obvious.

I am also consciously refraining from re-implementing SMIL. I do not want the full complexity of the &quot;seq&quot; and &quot;par&quot; elements. Also, the &quot;text&quot; or &quot;ref&quot; elements do not compare to &quot;itext&quot; which references a particular type of interactive text files similar to how &quot;img&quot; references particular types of image files.

Further, HTML doesn&#039;t do namespaces, so every adoption from another standard would need to be replicated into HTML anyway. And since there is not an exact match between the needs that itext and itextlist express and those provided by other specs, I&#039;d rather avoid that complexity.

The important thing here is though that we have looked at existing syntaxes and have learnt from them, so even through there is no direct re-use, there is indeed conceptual re-use and learnings.</description>
		<content:encoded><![CDATA[<p>@Jack thanks for the comments &#8211; and you are right: there are plenty of existing syntax elements in other specifications that could be tweaked, adapted and possibly re-used. However, none of them really fit.</p>
<p>&#8220;category&#8221; is very different to &#8220;role&#8221; &#8211; it is the category of time-aligned text we are talking about and there is a limited list part of the spec.</p>
<p>&#8220;name&#8221; could be replaced by &#8220;title&#8221; or something else &#8211; I am not particularly fussed about this though I needed it as an attribute rather than as content model, which would have been more obvious.</p>
<p>I am also consciously refraining from re-implementing SMIL. I do not want the full complexity of the &#8220;seq&#8221; and &#8220;par&#8221; elements. Also, the &#8220;text&#8221; or &#8220;ref&#8221; elements do not compare to &#8220;itext&#8221; which references a particular type of interactive text files similar to how &#8220;img&#8221; references particular types of image files.</p>
<p>Further, HTML doesn&#8217;t do namespaces, so every adoption from another standard would need to be replicated into HTML anyway. And since there is not an exact match between the needs that itext and itextlist express and those provided by other specs, I&#8217;d rather avoid that complexity.</p>
<p>The important thing here is though that we have looked at existing syntaxes and have learnt from them, so even through there is no direct re-use, there is indeed conceptual re-use and learnings.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
