<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Embedding time-aligned text into Ogg</title>
	<atom:link href="http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/</link>
	<description>Silvia&#039;s blog</description>
	<lastBuildDate>Sun, 28 Feb 2010 17:51:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-739</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Sat, 22 Nov 2008 12:45:21 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-739</guid>
		<description>Hi Martin,

Hmm... there seems to be a lot going under NB. The director&#039;s comments that Paul suggested would be displayed on screen, whereas these more metadata like comments would probably only be machine-readable. It may well be better to create another type for this. How about META?</description>
		<content:encoded><![CDATA[<p>Hi Martin,</p>
<p>Hmm&#8230; there seems to be a lot going under NB. The director&#8217;s comments that Paul suggested would be displayed on screen, whereas these more metadata like comments would probably only be machine-readable. It may well be better to create another type for this. How about META?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Martin Visser</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-738</link>
		<dc:creator>Martin Visser</dc:creator>
		<pubDate>Sat, 22 Nov 2008 12:10:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-738</guid>
		<description>Not sure if this is an addition, or if it is already considered to be included in say the NB metadata codec, but about camera position either relative (some sort of studio XY coord) or absolute (long/lat/altitude) and in fact pan, tilt, zoom or other camera related data. This could even apply to audio I guess. Either one would hope/expect this metadata would have a standard format that could be included.</description>
		<content:encoded><![CDATA[<p>Not sure if this is an addition, or if it is already considered to be included in say the NB metadata codec, but about camera position either relative (some sort of studio XY coord) or absolute (long/lat/altitude) and in fact pan, tilt, zoom or other camera related data. This could even apply to audio I guess. Either one would hope/expect this metadata would have a standard format that could be included.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-736</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Wed, 19 Nov 2008 23:13:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-736</guid>
		<description>Hi Jack,

The codec types are what a user in e.g. a Web browser would choose to be displayed, if it is so available and if it understand the format of content type, i.e. the format of the description.

For example, DFXP, smilText and realText can be used to describe any of these. But one instance document should only represent one text codec type. I.e. one instance DFXP document should be either a caption or a tickertext or a karaoke file.

When encapsulating this into Ogg, you want to know which it is. You want to know this at the level of encapsulation because the server might need to do content adaptation and throw away some of the tracks. Also, it is easier for the client to know this in a quick way and at a defined location in the bitstream, such that it can present to its user the ones that the user prefers.

So, it doesn&#039;t matter if you have a format like DFXP which is able to specify multiple of these types, or a format like srt which is only for subtitles (and maybe captions). The author will specify which format it is and which codec type as it gets multiplexed into Ogg and you&#039;re sorted.

As for your comparison with video codec types: you are right that video codec types, in particular the &quot;codecs&quot; parameter in a MIME type, has a very different meaning. I may have chosen a bad name to describe what I mean. If anyone has a better suggestion for what to call them - I think one person suggested codec category - let me know.

BTW: the actual MIME type of the text codec is also encapsulated into OGG, so you don&#039;t need to worry about that missing.</description>
		<content:encoded><![CDATA[<p>Hi Jack,</p>
<p>The codec types are what a user in e.g. a Web browser would choose to be displayed, if it is so available and if it understand the format of content type, i.e. the format of the description.</p>
<p>For example, DFXP, smilText and realText can be used to describe any of these. But one instance document should only represent one text codec type. I.e. one instance DFXP document should be either a caption or a tickertext or a karaoke file.</p>
<p>When encapsulating this into Ogg, you want to know which it is. You want to know this at the level of encapsulation because the server might need to do content adaptation and throw away some of the tracks. Also, it is easier for the client to know this in a quick way and at a defined location in the bitstream, such that it can present to its user the ones that the user prefers.</p>
<p>So, it doesn&#8217;t matter if you have a format like DFXP which is able to specify multiple of these types, or a format like srt which is only for subtitles (and maybe captions). The author will specify which format it is and which codec type as it gets multiplexed into Ogg and you&#8217;re sorted.</p>
<p>As for your comparison with video codec types: you are right that video codec types, in particular the &#8220;codecs&#8221; parameter in a MIME type, has a very different meaning. I may have chosen a bad name to describe what I mean. If anyone has a better suggestion for what to call them &#8211; I think one person suggested codec category &#8211; let me know.</p>
<p>BTW: the actual MIME type of the text codec is also encapsulated into OGG, so you don&#8217;t need to worry about that missing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jack Jansen</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-735</link>
		<dc:creator>Jack Jansen</dc:creator>
		<pubDate>Wed, 19 Nov 2008 13:51:07 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-735</guid>
		<description>I&#039;m not sure I understand how the codec types would be used, or, at least, they don&#039;t seem like codec types to me, more like a semantic annotation of what the track is about.

There doesn&#039;t seem to be a straight mapping between your codec types and encodings (neither 1-to-many nor many-to-1), for example, DFXP or smilText or realText could be used for any of subtitles, captions, karaoke and tickertext. And probably some of the others as well. So, only the document author would be able to set the codec type, based on the content of the track.

This make the use of these codec types completely different (from a client application point of view) from other codec types like for video. For the latter, the codec type signals (to the app) whether it&#039;ll be able to decode and display the track, nothing about its semantics. For these text codecs it&#039;s exactly the reverse: the client app only gets semantic info from the codec type, and cannot determine whether it&#039;ll be able to decode the stream...</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure I understand how the codec types would be used, or, at least, they don&#8217;t seem like codec types to me, more like a semantic annotation of what the track is about.</p>
<p>There doesn&#8217;t seem to be a straight mapping between your codec types and encodings (neither 1-to-many nor many-to-1), for example, DFXP or smilText or realText could be used for any of subtitles, captions, karaoke and tickertext. And probably some of the others as well. So, only the document author would be able to set the codec type, based on the content of the track.</p>
<p>This make the use of these codec types completely different (from a client application point of view) from other codec types like for video. For the latter, the codec type signals (to the app) whether it&#8217;ll be able to decode and display the track, nothing about its semantics. For these text codecs it&#8217;s exactly the reverse: the client app only gets semantic info from the codec type, and cannot determine whether it&#8217;ll be able to decode the stream&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-734</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Wed, 19 Nov 2008 07:04:22 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-734</guid>
		<description>TBBle,

I have just read it up on http://en.wikipedia.org/wiki/Fansub and apparently you are right. Strangely enough, this relates to &quot;closed&quot; and &quot;open&quot; captions - open captions are hardcoded into the imagery and cannot be removed, while closed captions are in a separate data stream that can be turned off.

Thanks for noticing.</description>
		<content:encoded><![CDATA[<p>TBBle,</p>
<p>I have just read it up on <a href="http://en.wikipedia.org/wiki/Fansub" rel="nofollow">http://en.wikipedia.org/wiki/Fansub</a> and apparently you are right. Strangely enough, this relates to &#8220;closed&#8221; and &#8220;open&#8221; captions &#8211; open captions are hardcoded into the imagery and cannot be removed, while closed captions are in a separate data stream that can be turned off.</p>
<p>Thanks for noticing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul "TBBle" Hampson</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-733</link>
		<dc:creator>Paul "TBBle" Hampson</dc:creator>
		<pubDate>Wed, 19 Nov 2008 06:11:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-733</guid>
		<description>I suspect that your distinction between hard and soft subtitles isn&#039;t accurate. As I understand it, hard subtitles are subtitles that are rendered into the images that make up the video during encoding, while soft subtitles are actually rendered at playback-time based on text input and timing tags.

DVD overlay subtitles are I guess a corner-case, since they&#039;re pre-rendered but those prerenders are triggered by timing values. But I&#039;m pretty sure they&#039;re hard subs because they&#039;re not in a textual format.

On the other hand, I believe the MKV format has a way of including a soft sub stream in the actual container (rather than as an external file), but I don&#039;t know how standard that is. I think mplayer supports it...

Anyway, as I understand it, what you&#039;re talking about here are all soft subs. Hard subs are handled by the encoder processing them into text at encoding time.</description>
		<content:encoded><![CDATA[<p>I suspect that your distinction between hard and soft subtitles isn&#8217;t accurate. As I understand it, hard subtitles are subtitles that are rendered into the images that make up the video during encoding, while soft subtitles are actually rendered at playback-time based on text input and timing tags.</p>
<p>DVD overlay subtitles are I guess a corner-case, since they&#8217;re pre-rendered but those prerenders are triggered by timing values. But I&#8217;m pretty sure they&#8217;re hard subs because they&#8217;re not in a textual format.</p>
<p>On the other hand, I believe the MKV format has a way of including a soft sub stream in the actual container (rather than as an external file), but I don&#8217;t know how standard that is. I think mplayer supports it&#8230;</p>
<p>Anyway, as I understand it, what you&#8217;re talking about here are all soft subs. Hard subs are handled by the encoder processing them into text at encoding time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: silvia</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-732</link>
		<dc:creator>silvia</dc:creator>
		<pubDate>Wed, 19 Nov 2008 06:10:47 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-732</guid>
		<description>I would agree with it being a NB category. NB is not about file information, but about semantic comments and things. This includes movie information and trivia IMHO.

In the end it really depends on how it&#039;s being presented, because there will be default displays for the different types of text annotations. For example, captions and subtitles will be at the bottom of the screen in a certain area, probably in black with a white boundary. However, we need input by designers, GUI experts and the like for what defaults make sense. It&#039;s not a trivial exercise overall. :-)</description>
		<content:encoded><![CDATA[<p>I would agree with it being a NB category. NB is not about file information, but about semantic comments and things. This includes movie information and trivia IMHO.</p>
<p>In the end it really depends on how it&#8217;s being presented, because there will be default displays for the different types of text annotations. For example, captions and subtitles will be at the bottom of the screen in a certain area, probably in black with a white boundary. However, we need input by designers, GUI experts and the like for what defaults make sense. It&#8217;s not a trivial exercise overall. <img src='http://blog.gingertech.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul Wayper</title>
		<link>http://blog.gingertech.net/2008/11/19/embedding-time-aligned-text-into-ogg/comment-page-1/#comment-731</link>
		<dc:creator>Paul Wayper</dc:creator>
		<pubDate>Wed, 19 Nov 2008 03:53:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.gingertech.net/?p=245#comment-731</guid>
		<description>Hi Silvia,

The other type of text annotation I&#039;ve seen recently is a director&#039;s commentary or behind-the-scenes information.  In &quot;Who Framed Roger Rabbit&quot; they have a &#039;Toontown Confidential&#039; version of the playback where text gets superimposed on the movie at given points to describe extra information about the scene - such as the number of different effects that a particular scene used or information about particular characters.  I think this might fit into your &#039;NB&#039; category but it&#039;s not strictly metadata about the movie _file_ - I think it fits more into a &#039;movie information&#039; (e.g. &#039;NFO&#039;) or &#039;trivia&#039; (&#039;TRV&#039;) tag.

Hope this helps,

Paul</description>
		<content:encoded><![CDATA[<p>Hi Silvia,</p>
<p>The other type of text annotation I&#8217;ve seen recently is a director&#8217;s commentary or behind-the-scenes information.  In &#8220;Who Framed Roger Rabbit&#8221; they have a &#8216;Toontown Confidential&#8217; version of the playback where text gets superimposed on the movie at given points to describe extra information about the scene &#8211; such as the number of different effects that a particular scene used or information about particular characters.  I think this might fit into your &#8216;NB&#8217; category but it&#8217;s not strictly metadata about the movie _file_ &#8211; I think it fits more into a &#8216;movie information&#8217; (e.g. &#8216;NFO&#8217;) or &#8216;trivia&#8217; (&#8216;TRV&#8217;) tag.</p>
<p>Hope this helps,</p>
<p>Paul</p>
]]></content:encoded>
	</item>
</channel>
</rss>
