[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AVT] RE: <draft-ietf-avt-rtp-vmr-wb-03.txt>: sampling rate



Hi Sassan and Colin,

I think we have two issues:

A. Is there any benefit to indicate or request that the sampling frequency used at the sender.

B. Is it necessary to use the sampling frequency as RTP timestamp rate.

I will start with A that I think is easier to explain and also can provide some information for issue B. If you find any of my assumptions and statements are incorrect, please correct me.

To my understanding of the VMR-WB after a conversation with Jonas Svedberg is that the VMR-WB will provide a somewhat better encoding of 8kHz material if it is indicated that the input is 8kHz. However there is no need due to compatibility or decoder operation to signal the case where the 8kHz is used as input into the encoder. These would then result that the only case needed to be signaled between encoder and decoder is cases where the decoder will use output at 8kHz. Because if the decoder can request that the encoder uses 8kHz input some improvement of the 8kHz material is achieved. In the other cases where the receiver is capable of 16kHz it doesn't matter for the receiver if the original audio was 8 or 16kHz from a decoding point of view.

Colin, if one looks at issue B. Is it really needed to use the RTP timestamp frequency equal to the sampling rate used? I would say NO to that question. My reasoning is the following.

- Many audio input is sampled from a source at a higher rate then the encoder may handle. Thus a resampling and pre-processing stage is employed based on the encoders input frequency rather then producing that rate initially from the hardware. Some of the reason is that the pre-processing may actually yield better results than what the hardware at given input rate can gain. Another reason may be that one like to avoid switching the hardware between rate if changing the encoding.

- The frame based decoders does not need to know the encoders input rate. The encoder may anyway resample this into other rates for internal processing and band limited signals. I would claim that VMR-WB, AMR-WB+ and AAC are all example of codecs that perform this kind of tricks. On the receiver side they produce a output signal that has any sampling frequency the receiver finds most useful. Either causing clipping of the higher frequencies, but more commonly to a higher clock rate, despite that no more information is provided simply for ease of use.

- The frame based codecs do only need a RTP timestamp that allows the receiver to correctly reconstruct the time line when the encoding is done with the most audio bandwidth. In the VMR-WB case this is 16kHz. AMR-WB+ is even more strange, as we have selected an RTP timestamp rate that results in that all internal sampling frequencies will result in integer timestamp ticks. Thus actually allowing one to correctly calculate frame alignment when the internal sampling frequency changes. That the frequency also is possible to recalculate into several common sampling frequencies with few partial sample alignments was also considered.

Thus I would use this to argue that indicating the actual sampling frequency is not necessarily as long as the receiver is capable of correctly reconstruct the media stream with its timing information in full resolution.

In the VMR-WB case I would think that having only one timestamp rate of 16kHz does not effect codec operation and would simplify the handling when one has some senders that do use 8kHz, especially when gateways need to encoded sometime 8kHz material from pre-recorded responses and in other cases WB channel data. This do avoid the need to perform RTP timestamp rate switches.

If desired to have this possibility to request by a receiver that the sender do use 8kHz input then one should introduce a MIME parameter for this. However I would like to avoid using the "rate" parameter as it results in unnecessary barriers in form of signalling and RTP timestamp rate switching.


Cheers

Magnus Westerlund

Multimedia Technologies, Ericsson Research EAB/TVA/A
----------------------------------------------------------------------
Ericsson AB                | Phone +46 8 4048287
Torshamsgatan 23           | Fax   +46 8 7575550
S-164 80 Stockholm, Sweden | mailto: magnus.westerlund at ericsson.com

_______________________________________________
Audio/Video Transport Working Group
avt at ietf.org
https://www1.ietf.org/mailman/listinfo/avt