<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The model of a time-linear media resource for HTML5</title>
	<atom:link href="http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/</link>
	<description>Silvia&#039;s blog</description>
	<lastBuildDate>Sun, 28 Feb 2010 17:51:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2652</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Tue, 24 Nov 2009 03:29:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2652</guid>
		<description>@Daniel

Thanks for your thoughts!

In the picture, I would think that everything is &quot;continuous&quot; media, since everything has a relationship to the main video timeline. I agree that DFXP as aligned to a video is also &quot;continuous&quot; media, even if it might have gaps. It belongs into the &quot;time-aligned text&quot; category. &quot;Discrete&quot; media would only be media that has no relationship to time and just exist. BTW: video and in particular audio can also have non-constant frame rate, so it&#039;s not that different after all.

I do believe that DAISY - just like SMIL - consists of presentations with multiple timelines and it is what DAISY needs. It would be interesting to see if the DAISY specification - just like the SMIL specification - could be implemented using HTML5 and JavaScript. With all the modern extensions of HTML5, it may well be possible to do so. I am curious as to what the limitations would be. Hierarchical navigation will certainly be implementable with Media Fragment URIs, HTML and JavaScript. Just look at the linking that is done in http://oggify.com/videos/7 - an experimental site of ours.

We&#039;re not fully there with the standard and not fully there with implementations, but we are certainly getting closer and I hope it will all be resolvable within the next 6 months.</description>
		<content:encoded><![CDATA[<p>@Daniel</p>
<p>Thanks for your thoughts!</p>
<p>In the picture, I would think that everything is &#8220;continuous&#8221; media, since everything has a relationship to the main video timeline. I agree that DFXP as aligned to a video is also &#8220;continuous&#8221; media, even if it might have gaps. It belongs into the &#8220;time-aligned text&#8221; category. &#8220;Discrete&#8221; media would only be media that has no relationship to time and just exist. BTW: video and in particular audio can also have non-constant frame rate, so it&#8217;s not that different after all.</p>
<p>I do believe that DAISY &#8211; just like SMIL &#8211; consists of presentations with multiple timelines and it is what DAISY needs. It would be interesting to see if the DAISY specification &#8211; just like the SMIL specification &#8211; could be implemented using HTML5 and JavaScript. With all the modern extensions of HTML5, it may well be possible to do so. I am curious as to what the limitations would be. Hierarchical navigation will certainly be implementable with Media Fragment URIs, HTML and JavaScript. Just look at the linking that is done in <a href="http://oggify.com/videos/7" rel="nofollow">http://oggify.com/videos/7</a> &#8211; an experimental site of ours.</p>
<p>We&#8217;re not fully there with the standard and not fully there with implementations, but we are certainly getting closer and I hope it will all be resolvable within the next 6 months.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2596</link>
		<dc:creator>Daniel</dc:creator>
		<pubDate>Mon, 23 Nov 2009 14:09:49 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2596</guid>
		<description>Hi Silvia, thanks for this blog post.

I always find it interesting to compare terminologies. In the Media Fragment diagram, there is a type of track composed of &quot;discrete&quot; media, as per the SMIL definition. By contrast, some tracks are based on &quot;continuous&quot; Media.

http://www.w3.org/TR/SMIL3/smil-timing.html#Timing-DiscreteContinuousMedia

http://www.w3.org/TR/SMIL3/smil-extended-media-object.html#smilMediaNS-Definitions

Time-aligned text, in particular W3C-TimedText-AFXP-DFXP-TTML (or whatever it ends-up being known as) is an interesting case. The simple internal timing model means that such media track can be seen as a linear (i.e. from a defined &quot;begin&quot; to a known &quot;end&quot; without intrinsic discontinuity) time-based resource (i.e. not frame-based) that returns a visual depending on essentially 2 input parameters: a time offset and the coordinates+dimensions of a host rendering area. Both parameters get resolved against particular reference points, such as the wallclock for time or a video rectangle for the space domain. Generally-speaking, this kind of &quot;text media&quot; can actually be thought of as a motion-pictures track, with similar content addressing behaviors (i.e. &quot;give me a time offset and I will return a frame to render, in a deterministic/reproducible manner&quot;). Contiguous &quot;frames&quot; of static text do not necessarily have identical durations (i.e. non-constant frame rate), so the analogy with video ends here (in SMIL, the &quot;textstream&quot; element is recommended instead of &quot;text&quot;). However, this model is also compatible with transitions (visual animation from one internal frame to the next, with no impact on the execution timeline), although DFXP only offers roll-up captioning, whereas SMIL offers much more with its library of transitions.

In my line of work (DAISY Digital Talking Books), text is primarily a static resource that is hosted within a SMIL presentation so that an audio narration can be finely-synchronized with its corresponding textual source (paragraph, sentence, word, etc.). This model also works with the reverse scenario where the text is a transcript of an original audio track, so this may also be suitable for the video+captions use-case. DAISY offers an additional structure on top of the timing and synchronization layer: hierarchical navigation, slightly richer to that of an ePUB book (table of content, page numbers, footnotes, sidebars, etc.). In this context, cross-publication navigation (i.e. outwards hyper-linking) is not a top-priority, however bookmarks and annotations are must-haves (inwards references to points or ranges within a given distribution unit).

Web-browsers offer off-the-shelf implementations of many crucial components (text rendering, navigation, etc.), so we&#039;re keeping a close eye on the developments of HTML5 and Media Fragments. Let&#039;s hope it doesn&#039;t take another decade for these to be ratified by standardization bodies :)

Cheers, Daniel</description>
		<content:encoded><![CDATA[<p>Hi Silvia, thanks for this blog post.</p>
<p>I always find it interesting to compare terminologies. In the Media Fragment diagram, there is a type of track composed of &#8220;discrete&#8221; media, as per the SMIL definition. By contrast, some tracks are based on &#8220;continuous&#8221; Media.</p>
<p><a href="http://www.w3.org/TR/SMIL3/smil-timing.html#Timing-DiscreteContinuousMedia" rel="nofollow">http://www.w3.org/TR/SMIL3/smil-timing.html#Timing-DiscreteContinuousMedia</a></p>
<p><a href="http://www.w3.org/TR/SMIL3/smil-extended-media-object.html#smilMediaNS-Definitions" rel="nofollow">http://www.w3.org/TR/SMIL3/smil-extended-media-object.html#smilMediaNS-Definitions</a></p>
<p>Time-aligned text, in particular W3C-TimedText-AFXP-DFXP-TTML (or whatever it ends-up being known as) is an interesting case. The simple internal timing model means that such media track can be seen as a linear (i.e. from a defined &#8220;begin&#8221; to a known &#8220;end&#8221; without intrinsic discontinuity) time-based resource (i.e. not frame-based) that returns a visual depending on essentially 2 input parameters: a time offset and the coordinates+dimensions of a host rendering area. Both parameters get resolved against particular reference points, such as the wallclock for time or a video rectangle for the space domain. Generally-speaking, this kind of &#8220;text media&#8221; can actually be thought of as a motion-pictures track, with similar content addressing behaviors (i.e. &#8220;give me a time offset and I will return a frame to render, in a deterministic/reproducible manner&#8221;). Contiguous &#8220;frames&#8221; of static text do not necessarily have identical durations (i.e. non-constant frame rate), so the analogy with video ends here (in SMIL, the &#8220;textstream&#8221; element is recommended instead of &#8220;text&#8221;). However, this model is also compatible with transitions (visual animation from one internal frame to the next, with no impact on the execution timeline), although DFXP only offers roll-up captioning, whereas SMIL offers much more with its library of transitions.</p>
<p>In my line of work (DAISY Digital Talking Books), text is primarily a static resource that is hosted within a SMIL presentation so that an audio narration can be finely-synchronized with its corresponding textual source (paragraph, sentence, word, etc.). This model also works with the reverse scenario where the text is a transcript of an original audio track, so this may also be suitable for the video+captions use-case. DAISY offers an additional structure on top of the timing and synchronization layer: hierarchical navigation, slightly richer to that of an ePUB book (table of content, page numbers, footnotes, sidebars, etc.). In this context, cross-publication navigation (i.e. outwards hyper-linking) is not a top-priority, however bookmarks and annotations are must-haves (inwards references to points or ranges within a given distribution unit).</p>
<p>Web-browsers offer off-the-shelf implementations of many crucial components (text rendering, navigation, etc.), so we&#8217;re keeping a close eye on the developments of HTML5 and Media Fragments. Let&#8217;s hope it doesn&#8217;t take another decade for these to be ratified by standardization bodies <img src='http://blog.gingertech.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Cheers, Daniel</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2582</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Mon, 23 Nov 2009 11:09:56 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2582</guid>
		<description>@Raphael

Yes, DFXP is one such format that supports outgoing hyperlinks. Also, DFXP has been mapped into FLV and MPEG4 (and with it QuickTime and 3GPP), so it does in fact have quite some support for the binary formats, too. It is definitely one format that turns media resources into hypermedia. SRT doesn&#039;t provide for such. I would, however, not yet go as far as calling DFXP the one format in use by everyone - the spread just isn&#039;t there yet and the open source tools support.</description>
		<content:encoded><![CDATA[<p>@Raphael</p>
<p>Yes, DFXP is one such format that supports outgoing hyperlinks. Also, DFXP has been mapped into FLV and MPEG4 (and with it QuickTime and 3GPP), so it does in fact have quite some support for the binary formats, too. It is definitely one format that turns media resources into hypermedia. SRT doesn&#8217;t provide for such. I would, however, not yet go as far as calling DFXP the one format in use by everyone &#8211; the spread just isn&#8217;t there yet and the open source tools support.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2580</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Mon, 23 Nov 2009 10:17:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2580</guid>
		<description>@Philip ups! I&#039;ll remove &quot;stop()&quot; :-) .

When I talk about &quot;separating the media resource concept from actual files&quot; I mean what we are already talking about with captions and the source element: the tracks of a media resource may actually be in several files. The actual markup inside HTML already has an indirection, if you so want: it points to the actual files that contribute to the resource.

If you really want to compare it to SMIL, then yes, we already have similarities in HTML5 to SMIL, where we reference the contributing resources. But other than that, we need to be careful about comparing to SMIL, since SMIL allows for so many complexities that go far beyond a single linear timeline and my post is trying to clarify that this is not what we want or can work with in HTML5.</description>
		<content:encoded><![CDATA[<p>@Philip ups! I&#8217;ll remove &#8220;stop()&#8221; <img src='http://blog.gingertech.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  .</p>
<p>When I talk about &#8220;separating the media resource concept from actual files&#8221; I mean what we are already talking about with captions and the source element: the tracks of a media resource may actually be in several files. The actual markup inside HTML already has an indirection, if you so want: it points to the actual files that contribute to the resource.</p>
<p>If you really want to compare it to SMIL, then yes, we already have similarities in HTML5 to SMIL, where we reference the contributing resources. But other than that, we need to be careful about comparing to SMIL, since SMIL allows for so many complexities that go far beyond a single linear timeline and my post is trying to clarify that this is not what we want or can work with in HTML5.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Raphaël Troncy</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2579</link>
		<dc:creator>Raphaël Troncy</dc:creator>
		<pubDate>Mon, 23 Nov 2009 09:59:36 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2579</guid>
		<description>Silvia,

Naive question: You wrote:
&quot;It seems to be clear that hyperlinks out of media files will come from some type of textual track. But a standard format for such time-aligned text tracks doesn’t yet exist. This is a challenge to be addressed in the near future.&quot;

Does DFXP not allow already to have outbound links in the caption? My understanding was that DFXP provides the ability to have rich (stylish) text instead of plain text a la SRT. Is it restricted to style information (color, font) or can you also include hyperlinks in DFXP?</description>
		<content:encoded><![CDATA[<p>Silvia,</p>
<p>Naive question: You wrote:<br />
&#8220;It seems to be clear that hyperlinks out of media files will come from some type of textual track. But a standard format for such time-aligned text tracks doesn’t yet exist. This is a challenge to be addressed in the near future.&#8221;</p>
<p>Does DFXP not allow already to have outbound links in the caption? My understanding was that DFXP provides the ability to have rich (stylish) text instead of plain text a la SRT. Is it restricted to style information (color, font) or can you also include hyperlinks in DFXP?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Philip Jägenstedt</title>
		<link>http://blog.gingertech.net/2009/11/23/model-of-a-time-linear-media-resource/comment-page-1/#comment-2575</link>
		<dc:creator>Philip Jägenstedt</dc:creator>
		<pubDate>Mon, 23 Nov 2009 09:17:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=710#comment-2575</guid>
		<description>Not to nitpick, but there is no stop() method, just play() and pause().

What does it mean, concretely, to &quot;separate the media resource concept from actual files&quot;? Adding a level of indirection? SMIL?</description>
		<content:encoded><![CDATA[<p>Not to nitpick, but there is no stop() method, just play() and pause().</p>
<p>What does it mean, concretely, to &#8220;separate the media resource concept from actual files&#8221;? Adding a level of indirection? SMIL?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
