US 7,613,605 B2
Audio signal encoding apparatus and method
Masanobu Funakoshi, Kawasaki (Japan)
Assigned to Canon Kabushiki Kaisha, Tokyo (Japan)
Filed on May 16, 2007, as Appl. No. 11/749,563.
Application 11/749563 is a continuation of application No. PCT/JP2005/021014, filed on Nov. 16, 2005.
Claims priority of application No. 2004-335005 (JP), filed on Nov. 18, 2004; and application No. 2005-328945 (JP), filed on Nov. 14, 2005.
Prior Publication US 2007/0265836 A1, Nov. 15, 2007
Int. Cl. G10L 19/00 (2006.01)
U.S. Cl. 704—219  [704/222; 704/229; 375/240.22] 16 Claims
OG exemplary drawing
 
1. An audio signal encoding apparatus comprising:
a frame dividing unit configured to divide an audio input signal into processing unit frames for respective channels;
a psychoacoustic arithmetic unit configured to analyze the audio input signal to determine a transform block length and to make an auditory masking calculation;
a filter bank unit configured to decompose a frame to be processed into blocks in accordance with the transform block length determined by said psychoacoustic arithmetic unit to transform time domain signals in the frame into one or more sets of frequency spectrum;
a scale factor calculation unit configured to divide the frequency spectrum output from said filter bank unit into a plurality of frequency bands, and calculate scale factors for weighting the spectrum in the respective frequency bands based on an arithmetic result of said psychoacoustic arithmetic unit;
a quantization step determination unit configured to determine a quantization step of the entire frame prior to spectrum quantization by subtracting an information amount of all quantized spectrum from a perceptual information amount of all the spectrum before quantization, which are weighted by the scale factors calculated by said scale factor calculation unit, and multiplying the difference by a coefficient obtained from a step width of a quantization coarseness;
a spectrum quantization unit configured to quantize the frequency spectrum using the scale factors and the quantization step; and
a bit shaping unit configured to form and output a bitstream obtained by shaping quantized spectrum output from said spectrum quantization unit in accordance with a predetermined format,
wherein said quantization step determination unit includes a quantized spectral information amount prediction unit configured to predict the information amount of all the quantized spectrum based on a bit size assigned to a frame to be encoded.