Home
Patent Search
IMT Blog
REGISTER
|
SIGN IN
United States Patent Application
20020106022
Kind Code
A1
Satoh, Kazushi ; et al.
August 8, 2002
Image information conversion apparatus and image information conversion method
Abstract
The invention provides an image information conversion apparatus and method by which picture quality deterioration caused by setting of an initial value can be prevented when code amount control in MPEG4 image coding is performed based on information extracted from MPEG2 image compression information. An initial reference quantization scale determination section determines an initial value for a reference quantization scale from predetermined MPEG2 image compression information, the number of macro blocks to be included in an MPEG4 bit stream, a code amount allocated to the first I picture of the MPEG2 image compression information stored in an information buffer, an average quantization scale and a target code amount for the first I-VOP of the MPEG4 bit stream calculated by an MPEG4 image information coding section, and then calculates an initial value for a virtual buffer occupation amount based on the determined initial value for the reference quantization scale.
Inventors:
Satoh; Kazushi
(Kanagawa, JP)
, Takahashi; Kuniaki
(Kanagawa, JP
)
, Suzuki; Teruhiko
(Chiba, JP
)
, Yagasaki; Yoichi
(Tokyo, JP
)
Correspondence Name and Address:
Suite 501 1233 20th Street, NW
RADER, FISHMAN & GRAUER, P.L.L.C
Washington
DC
20036
US
Series Code:
986436
Filed:
November 8, 2001
U.S. Current Class:
375/240.03;
348/448; 375/240.13
U.S. Class at Publication:
375/240.03;
348/448; 375/240.13
Intern'l Class:
H04N 007/12
Claims
What is claimed is:
1. An image information conversion apparatus which receives first image compression information as an input thereto and outputs second image compression information, each of the first image compression information and the second image compression information including at least intra-image coded pictures and inter-image prediction coded pictures, comprising: quantization scale determination means for using information extracted from the first image compression information to determine an initial value for a reference quantization scale to be used for production of an intra-image coded picture of the second image compression information and determining an initial value for a virtual buffer occupation amount for an intra-image coded picture based on the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
2. An image information conversion apparatus according to claim 1, wherein the information extracted from the first image compression information is an average quantization scale of the first intra-image coded picture of the first image compression information.
3. An image information conversion apparatus according to claim 2, wherein the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information is determined by operation of the product of a ratio of a code amount of the first image compression information to a code amount of the second image compression information, a ratio of a frame rate of the second image compression information to a frame rate of the first image compression information, and the average quantization scale of the first intra-image coded picture of the first image compression information.
4. An image information conversion apparatus according to claim 3, wherein an integer nearest to the value obtained by the arithmetic operation from among integers representative of the quantization scale used for coding of the second image compression information is used as the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
5. An image information conversion apparatus according to claim 3, wherein the initial value for the virtual buffer occupation amount for the intra-image coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate.
6. An image information conversion apparatus according to claim 5, wherein the inter-image prediction coded pictures include a forward prediction coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward prediction coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the intra-image coded picture and a first constant whereas the initial value for the virtual buffer occupation amount for the bi-directionally predicted coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the forward prediction coded picture and a second constant.
7. An image information conversion apparatus according to claim 2, wherein the initial value for the reference quantization scale for the first intra-image coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount allocated to the first intra-image coded picture of the first image compression information to a target code amount for the first intra-image coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the first intra-image coded pictures of the first image compression information.
8. An image information conversion apparatus according to claim 7, wherein an integer nearest to the value obtained by the arithmetic operation from among integers representative of the quantization scale used for coding of the second image compression information is used as the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
9. An image information conversion apparatus according to claim 7, wherein the initial value for the virtual buffer occupation amount for the intra-image coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate.
10. An image information conversion apparatus according to claim 9, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the intra-image coded picture and a first constant whereas the initial value for the virtual buffer occupation amount for the bi-directionally predicted coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the forward predictive coded picture and a second constant.
11. An image information conversion apparatus according to claim 3, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the reference quantization scale to be used for production of the first forward predictive coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount of the first image compression information to the code amount of the second image compression information, a ratio of the frame rate of the second image compression information to the frame rate of the first image compression information, and an average quantization scale of the first inter-image predictive coded picture of the second image compression information, whereafter the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount of the first image compression information to the code amount of the second image compression information, a ratio of the frame rate of the second image compression information to the frame rate of the first image compression information, and an average quantization scale of the first bi-directionally predicted coded picture of the second image compression information.
12. An image information conversion apparatus according to claim 11, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first inter-image predictive coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate, whereafter the initial value for the virtual buffer occupation amount for the bi-directionally predicted coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to the variable based on the ratio between the bit rate and the display rate.
13. An image information conversion apparatus according to claim 3, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the reference quantization scale to be used for production of the first forward predictive coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount allocated to the first inter-image predictive coded picture of the first image compression information to a target code amount for the first inter-image predictive coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the forward predictive coded picture of the first image compression information, whereafter the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount allocated to the first bi-directionally predicted coded picture of the first image compression information to a target code amount for the first bi-directionally predicted coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the bi-directionally predicted coded picture.
14. An image information conversion apparatus according to claim 13, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined based on a ratio of the product of the initial value for the reference quantizations scale to be used for production of the first inter-image predictive coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate, whereafter the initial value for the virtual buffer occupation amount for a bi-directionally predicted coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to the variable based on the ratio between the bit rate and the display rate.
15. An image information conversion apparatus according to claim 1, wherein the first image compression information is MPEG2 image compression information standardized by the Moving Picture Experts Group, and the second image compression information is MPEG4 image compression information.
16. An image information conversion method for receiving first image compression information as an input thereto and outputting second image compression information, each of the first image compression information and the second image compression information including at least intra-image coded pictures and inter-image predictive coded pictures, said method comprising the steps of: using information extracted from the first image compression information to determine an initial value for a reference quantization scale to be used for production of an intra-image coded picture of the second image compression information; and determining an initial value for a virtual buffer occupation amount for an intra-image coded picture based on the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
17. An image information conversion method according to claim 16, wherein the information extracted from the first image compression information is an average quantization scale of the first intra-image coded picture of the first image compression information.
18. An image information conversion method according to claim 17, wherein the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information is determined by operation of the product of a ratio of a code amount of the first image compression information to a code amount of the second image compression information, a ratio of a frame rate of the second image compression information to a frame rate of the first image compression information, and the average quantization scale of the first intra-image coded picture of the first image compression information.
19. An image information conversion method according to claim 18, wherein an integer nearest to the value obtained by the arithmetic operation from among integers representative of the quantization scale used for coding of the second image compression information is used as the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
20. An image information conversion method according to claim 18, wherein the initial value for the virtual buffer occupation amount for an intra-image coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate.
21. An image information conversion method according to claim 20, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for an intra-image coded picture and a first constant whereas the initial value for the virtual buffer occupation amount for a bi-directionally predicted coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the forward predictive coded picture and a second constant.
22. An image information conversion method according to claim 17, wherein the initial value for the reference quantization scale for the first intra-image coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount allocated to the first intra-image coded picture of the first image compression information to a target code amount for the first intra-image coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the first intra-image coded pictures of the first image compression information.
23. An image information conversion method according to claim 22, wherein an integer nearest to the value obtained by the arithmetic operation from among integers representative of the quantization scale used for coding of the second image compression information is used as the initial value for the reference quantization scale to be used for production of the first intra-image coded picture of the second image compression information.
24. An image information conversion method according to claim 22, wherein the initial value for the virtual buffer occupation amount for an intra-image coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate.
25. An image information conversion method according to claim 24, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for an intra-image coded picture and a first constant whereas the initial value for the virtual buffer occupation amount for the bi-directionally predicted coded picture is determined by operation of the product of the initial value for the virtual buffer occupation amount for the forward predictive coded picture and a second constant.
26. An image information conversion method according to claim 18, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the reference quantization scale to be used for production of the first forward predictive coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount of the first image compression information to the code amount of the second image compression information, a ratio of the frame rate of the second image compression information to the frame rate of the first image compression information, and an average quantization scale of the first inter-image predictive coded picture of the second image compression information, whereafter the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount of the first image compression information to the code amount of the second image compression information, a ratio of the frame rate of the second image compression information to the frame rate of the first image compression information, and an average quantization scale of the first bi-directionally predicted coded picture of the second image compression information.
27. An image information conversion method according to claim 26, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first inter-image predictive coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate, whereafter the initial value for the virtual buffer occupation amount for a bi-directionally predicted coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to the variable based on the ratio between the bit rate and the display rate.
28. An image information conversion method according to claim 18, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the reference quantization scale to be used for production of the first forward predictive coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount allocated to the first inter-image predictive coded picture of the first image compression information to a target code amount for the first inter-image predictive coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the forward predictive coded picture, whereafter the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information is determined by operation of the product of a ratio of the code amount. allocated to the first bi-directionally predicted coded picture of the first image compression information to a target code amount for the first bi-directionally predicted coded picture of the second image compression information, a ratio of the number of predetermined coding units included in one frame of the second image compression information to the number of predetermined coding units included in one frame of the first image compression information, and an average quantization scale of the bi-directionally predicted coded picture.
29. An image information conversion method according to claim 28, wherein the inter-image predictive coded pictures include a forward predictive coded picture and a bi-directionally predicted coded picture, and the initial value for the virtual buffer occupation amount for the forward predictive coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first inter-image predictive coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to a variable based on a ratio between a bit rate and a display rate, whereafter the initial value for the virtual buffer occupation amount for the bi-directionally predicted coded picture is determined based on a ratio of the product of the initial value for the reference quantization scale to be used for production of the first bi-directionally predicted coded picture of the second image compression information and the highest value of integers representative of the quantization scale used for coding of the second image compression information to the variable based on the ratio between the bit rate and the display rate.
30. An image information conversion method according to claim 16, wherein the first image compression information is MPEG2 image compression information standardized by the Moving Picture Experts Group, and the second image compression information is MPEG4 image compression information.
Description
BACKGROUND OF THE INVENTION
[0001] This invention relates to an image information conversion apparatus and an image information conversion method, and more particularly to an image information conversion apparatus and an image information conversion method which are used to receive, through network media such as a satellite broadcast, a cable television broadcast or the Internet or process, on a recording medium such as an optical disk or a magneto-optical disk, image information in the form of a bit stream compressed by orthogonal transform such as discrete cosine transform and motion compensation.
[0002] In recent years, an apparatus which complies with a method wherein image information is handled as digital data and the redundancy unique to image information is utilized to compress image information by orthogonal transform such as, for example, discrete cosine transform and motion compensation in order to allow transmission and storage of information with a high efficiency has been and is being popularized in both of information distribution from a broadcasting station or the like and information reception by general homes.
[0003] Particularly, MPEG2 standardized by the MPEG (Moving Picture Experts Group) is defined as a general purpose image coding system in the ISO/IEC 13818-2 and covers both of interleaved scan images and progressive scan images as well as standard resolution images and high resolution images. Therefore, it is expected that the MPEG2 be used by wide varieties of applications from professional applications to consumer applications in the future.
[0004] Where such an MPEG2 compression system as described above is used, realization of a high compression ratio and a good picture quality can be anticipated by allocating, to interleaved scan images of a standard resolution having, for example, 720.times.480 pixels, a code amount (hereinafter referred to as bit rate) of 4 to 8 Mbps or by allocating, to interleaved scan images of a high resolution having, for example, 1,920.times.1,088 pixels, a bit rate of 18 to 22 Mbps.
[0005] The MPEG2 is directed to high picture quality coding suitable principally for broadcasting, but is not ready for a coding system of a bit rate lower than, that is, of a compression ratio higher than, that of the MPEG1. However, from popularization of portable terminals, it has been expected that the need for a coding system of a higher compression ratio increase in the future. Therefore, the MPEG4 coding system has been standardized, and the image coding system of the MPEG4 was approved as international standards of the ISO/IEC 14496-2 in December 1998.
[0006] In order to process MPEG2 image compression information (hereinafter referred to as MPEG2 bit stream) coded once so as to be suitable for digital broadcasting on a portable terminal or the like, it is demanded to convert the MPEG2 bit stream into MPEG4 image compression information (hereinafter referred to as MPEG4 bit stream) of a lower bit rate.
[0007] An image information conversion apparatus (transcoder) which satisfies the demand is disclosed in Susie J. Wee, John G. Apostlopoulos and Nick Feamster, "Field-to-Frame Transcoding with Spatial and Temporal Downsampling", ICIP '99 (hereinafter referred to as document 1). The image information conversion apparatus mentioned is shown in FIG. 4.
[0008] Referring to FIG. 4, the image information conversion apparatus 100
shown includes a picture type discrimination section 101, an MPEG2 image information (I/P picture) decoding section 102, a reduction section 103, a video memory 104, an MPEG4 image information (I/P-VOP) coding section 105, a motion vector synthesis section 106, and a motion vector detection section 107. It is to be noted that the VOP (Video Object Plane) in the MPEG4 corresponds to the frame in the MPEG2.
[0009] The picture type discrimination section 101 receives data of frames of MPEG2 image compression information (hereinafter referred to as MPEG2
bit stream) of an interleaved scan as an input thereto and discriminates whether data of each frame is of MPEG2 image information (hereinafter referred to as I/P picture which signifies an intra-image coded picture/forward predictive coded picture) or of a B picture (bi-directionally predicted picture) The picture type discrimination section 101 outputs only the former data to the MPEG2 image information decoding section 102 of the following stage.
[0010] The MPEG2 image information decoding section 102 executes processing similar to that of an ordinary MPEG2 image information decoding section. However, since data regarding B pictures are discarded by the picture type discrimination section 101, only it is required for the MPEG2 image information decoding section 102 to have a function of decoding only I/P pictures.
[0011] The reduction section 103 receives pixel values from the MPEG2
image information decoding section 102 and performs processing of reducing the pixel values to 1/2 in the horizontal direction and discarding data of one of the first and second fields in the vertical direction while leaving data of the other field to produce a progressive scan image having a size of 1/4 that of the inputted image information.
[0012] If the MPEG2 bit stream inputted from the MPEG2 image information decoding section 102 represents images compliant with the standards of the NTSC (National Television System Committee), that is, interleaved scan images of 720.times.480 pixels and 30 Hz, then the images after the reduction by the reduction section 103 have a size of 360.times.240
pixels. However, in order to allow the processing in a unit of a macro block when the MPEG4 image information coding section 105 in a succeeding stage performs coding, the pixel numbers both in the horizontal and vertical directions must be multiples of 16. Accordingly, the reduction section 103 further performs supplementation or discarding of pixels for satisfying the requirement. In particular, in the specific case described above, eight lines, for example, at the right end or the left end in the horizontal direction are discarded so that the image has a size of 352.times.240 pixels.
[0013] The progressive scan image produced by the reduction section 103 is stored into the video memory 104 and then undergoes coding processing by the MPEG4 image information coding section 105, and is outputted as an MPEG4 bit stream.
[0014] Motion vector information in the inputted MPEG2 bit stream is supplied to the motion vector synthesis section 106, by which it is mapped to motion vectors for the image information after the reduction.
[0015] The motion vector detection section 107 detects motion vectors of a high degree of accuracy based on the motion vector values synthesized by the motion vector synthesis section 106.
[0016] The image information conversion apparatus 100 disclosed in document 1 produces an MPEG4 bit stream of progressive scan images having a size of 1/2.times.1/2 that of an inputted MPEG2 bit. stream. For example, where the inputted MPEG2 bit stream complies with the NTSC standards, the MPEG4 bit. stream to be outputted has the SIF size (352.times.240 pixels). The image information conversion apparatus 100
can convert the inputted MPEG2 bit stream also into an image of any other image size, for example, the QSIF (176.times.112 pixels) size which is a size of approximately 1/4.times.1/4 in the example described above, by modifying the operation of the reduction section 103.
[0017] Further, the image information conversion apparatus 100 performs, as a process by the MPEG2 image information decoding section 102, a decoding process using all of eighth-order discrete cosine transform coefficients in the inputted MPEG2 bit stream for the horizontal and vertical directions or EL decoding process using only low-frequency components from among eighth-order discrete cosine transform coefficients only for the horizontal direction or for both of the horizontal and vertical directions thereby to reduce the arithmetic operation amount for the decoding process and the video memory capacity while suppressing the picture quality deterioration to the minimum.
[0018] In the image information conversion apparatus 100 shown in FIG. 4, the code amount control of the MPEG4 image information coding section 105
makes a significant factor of determination of the picture quality of an MPEG4 bit stream In the ISO/IEC 14496-2, the system for code amount control is not specifically prescribed, and each vendor can use a system which is considered optimum from the point of view of the arithmetic operation amount and the output picture quality in accordance with an application to be used. In the following, a system prescribed in the MPEG2 Test Mode 15 (ISO/IEC JTC1/SC29/WG11 N0400) as a representative code amount control system is described.
[0019] For the code amount control, bit distribution to each picture is performed as a first step using a target code amount (target bit rate) and a GOP (Group Of Pictures) configuration as input variables, and then rate control is performed using a virtual buffer, whereafter adaptive quantization for each macro block is performed finally taking a visual characteristic into consideration. The operation of the code amount control is illustrated in FIG. 5.
[0020] Referring to FIG. 5, first in step S101, the MPEG4 image information coding section 105 distributes an allocation bit amount for each picture in a GOP in accordance with a bit amount (hereinafter represented by R) to be allocated to those pictures which are not decoded as yet including allocation object pictures. This distribution is repeated in order of coded pictures in the GOP. In this instance, the code amount allocation to each picture is performed based on the following two assumptions.
[0021] First, it is assumed that the product of an average quantization scale code to be used for coding of each picture and the generated code amount is fixed for each picture type unless the screen does not change. Therefore, after each picture is coded, variables X.sub.i, X.sub.p and X.sub.b (global complexity measures) each representative of the complexity of the screen are updated in accordance with the following expressions (1) to (3) for individual picture types:
X.sub.i=S.sub.i.multidot.Q.sub.i (1)
X.sub.p=S.sub.p.multidot.Q.sub.p (2)
X.sub.b=S.sub.b.multidot.Q.sub.b (3)
[0022] where S.sub.i, S.sub.p and S.sub.b are the generated code bit amounts upon picture coding, and Q.sub.i, Q.sub.p and Q.sub.b are average quantization scale codes upon picture coding. The variables X.sub.i, X.sub.p and X.sub.b have initial values represented by the following expressions (4) to (6), respectively, using the target code amount (target bit rate) bit_rate [bits/sec]:
X.sub.i=160.times.bit_rate/115 (4)
X.sub.i=60.times.bit_rate/115 (5)
X.sub.i=42.times.bit_rate/115 (6)
[0023] Secondly, it is assumed that the picture quality of the entire image is always optimized when the ratios K.sub.p and K.sub.b of the quantization scale code of P and B pictures with reference to the quantization scale code of an I picture have values defined by the following expression (7):
K.sub.p=1.0;K.sub.b=1.4 (7)
[0024] In particular, the quantization scale code of a B picture is always 1.4 times that of the quantization scale codes of I and P pictures. Here, it is supposed that, by coding a B picture rather roughly than I and P pictures, if the code amount saved with a B picture is added to that of an I or P picture, then the picture quality of the I or P picture is improved, and also the picture of a B picture which refers to the I or P picture is improved.
[0025] From the two assumptions specified as above, the allocation bit amounts (T.sub.i, T.sub.p, T.sub.b) to the different pictures of the GOP have values given by the following expressions (8) to (10), respectively: 1 T i = max { R 1 + N p X p X i K p + N b X b X i K b , bit_rate 8 .times. picture_rate } ( 8 ) T p = max { R N p + N b K p X b K b X p , bit_rate 8 .times. picture_rate } ( 9 ) T b = max { R N b + N p K b X p K p X b , bit_rate 8 .times. picture_rate } ( 10 )
[0026] where N.sub.p and N.sub.b are the numbers of P and B pictures which are not coded in the GOP as yet.
[0027] Based on the allocation code amounts determined in this manner, each time a. picture is coded in steps S101 and S102, the bit amount R to be allocated to a non-coded picture in the GOP is updated in accordance with the following expression (11):
R=R-S.sub.i,p,b (11)
[0028] On the other hand, when the first picture in the GOP is to be coded, the bit amount R is updated in accordance with the following expression (12): 2 R = bit_rate .times. N picture_rate + R ( 12 )
[0029] where N is the number of pictures in the GOP. The initial value of the bit amount R at the start of a sequence is 0.
[0030] In step S102, in order to make the allocation bit amounts (T.sub.i, T.sub.p, T.sub.b) to the pictures determined in accordance with the expressions (8) to (10) in step S101 and actual generation code amounts coincide with each other, quantization scale codes are determined based on capacities of three different virtual buffers set independently of each other for the individual pictures by feedback control in a unit of a macro block. First, prior to code of a jth macro block, the occupation amounts of the virtual buffers are determined in accordance with the following expressions (13) to (15): 3 d j i = d o i + B j - 1
- T i .times. ( j - 1 ) MB_cnt ( 13 ) d j p = d o p + B j - 1 - T p .times. ( j - 1 ) MB_cnt ( 14 ) d j b = d o b + B j - 1 - T b .times. ( j - 1 ) MB_cnt ( 15 )
[0031] where d.sub.o.sup.i, d.sub.o.sup.p and d.sub.o.sup.b are the initial occupation amounts of the virtual buffers, B.sub.j is the generation bit amount from the top of the picture to the jth macro block, and MB_cnt is the number of macro blocks in one picture. The occupation amounts (d.sub.MB.sub..sub.--.sub.cnt.sup.i, d.sub.MB.sub..sub.--.sub.cnt- .sup.p, d.sub.MB.sub..sub.--.sub.cnt.sup.b) of the virtual buffers upon ending of coding of the individual pictures are used as initial values (d.sub.o.sup.i, d.sub.o.sup.P, d.sub.o.sup.b) for the virtual buffer occupations for the next pictures.
[0032] Then, the quantization scale code for the jth macro block is calculated in accordance with the following expression (16): 4 Q j = d j .times. 31 r ( 16 )
[0033] where r is a variable called reaction parameter used to control the response of a feedback loop and given by the following expression (17): 5 r = 2 .times. bit_rate picture_rate ( 17 )
[0034] The initial values of the virtual buffers at the start of coding are given by the following expressions (18) to (20): 6 d o i = 10
.times. r 31 ( 18 ) d o p = K p d o i ( 19 ) d o b = K b d o i ( 20 )
[0035] In step S103, the quantization scale codes determined in step S102
are modified with a variable called activity for each macro block so that they may be quantized finely at a flat portion at which deterioration can be visually observed comparatively conspicuously but may be quantized roughly at a complicated pattern portion at which deterioration can be visually observed comparatively less conspicuously.
[0036] The activity is given by the following expression (21) using pixel values of totaling 8 blocks including 4 blocks of a frame discrete cosine transform mode and 4 blocks of a field discrete cosine transform mode using brightness signal pixel values of the original picture: 7 act j = 1 + min sblk = 1 , 8 ( var_sblk ) var_sblk = 1
64 k = 1 64 ( P k - P
Quick Search
patentmonkey
UpgradeAccount
IMTBlog
BestLegalBids