close

Вход

Забыли?

вход по аккаунту

?

JP2012161073

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2012161073
An audio output calibration system and an audio output calibration method are provided. An
audio output calibration system according to the present invention comprises a setting module
which constructs a coordinate system and records position coordinates of a camera and a
speaker, and a detection which determines a coordinate position of a listener when the camera
detects a listener. Calculate the distance between the module and the speaker and the listener,
specify the speaker farthest from the listener, and calculate the ratio of the strength of the audio
signal of the designated and non-designated speakers and the time difference of the output time
of the audio signal Based on the time difference, the output time of the audio signal of the nondesignated speaker is delayed based on the difference between the calculation module and the
non-designated speaker is output simultaneously with the designated speaker, and the voice
signal of the non-designated speaker Adjust the intensity of the non-designated speakers and
output an audio signal of the same strength as the designated speakers Includes a calibration
module that, a. [Selected figure] Figure 1
Audio output calibration system and audio output calibration method
[0001]
The present invention relates to audio calibration techniques, and more particularly to a system
for calibrating the audio output of multiple speakers and a calibration method thereof.
[0002]
Audio equipment installed in a meeting room etc. usually outputs voice through a plurality of
09-05-2019
1
speakers.
At this time, the listener's perception of the voice differs depending on the installation position of
the speaker and the position of the listener himself. For example, if the speaker is moving or the
listener is always moving during a conference, the distance from the speaker to the listener may
change, resulting in differences in the time and intensity of the audio output. As a result,
differences also occur in the listener's auditory perception.
[0003]
Therefore, in order to provide each listener with a uniform sound effect, it is necessary to
determine in advance the mounting position of each of the plurality of speakers and to confirm
the sound effect. However, since the mounting position of the speaker and the position of the
listener change depending on the actual situation, the listener can not obtain a comfortable
acoustic effect.
[0004]
In view of the above problems, according to the present invention, an audio output calibration
system and an audio output calibration method capable of obtaining a uniform acoustic effect
regardless of the position of the listener by calibrating the audio output from a plurality of
speakers. Intended to provide.
[0005]
In order to achieve the above object, the audio output calibration system according to the present
invention is used in an electronic device connected to a plurality of speakers and at least two
cameras having a function of identifying the presence or absence of a person.
The audio output calibration system constructs a coordinate system based on the position of the
camera and / or the speaker, and a setting module for recording position coordinates of each
camera and each speaker, and when the camera senses a listener. , The detection module which
determines the coordinate position of the listener, the distance between each speaker and the
listener is calculated based on the listener and the position coordinates of each speaker, and the
speaker farthest from the listener is specified, and the specified speaker The ratio of the strength
09-05-2019
2
of the output audio signal to the output audio signal of each non-designated speaker is
calculated, and the time difference between the output time of the audio signal of the designated
speaker and the output time of the audio signal of each non-designated speaker is each
Calculating module and calculating signal output time of each non-designated speaker based on
the time difference Each non-designated speaker is made to output an audio signal at the same
time as the designated speaker by delaying each non-designated speaker, and the intensity of the
audio signal outputted by each non-designated speaker is adjusted based on the ratio of the
strengths. And a calibration module for outputting an audio signal of the same strength as the
designated speaker.
[0006]
Also, to achieve the above object, the audio output calibration method according to the present
invention is used in an electronic device connected to at least two cameras having a function of
identifying the presence or absence of a plurality of speakers and a person. The audio output
calibration method comprises the steps of constructing a coordinate system based on the
position of the camera and / or the speaker, and recording position coordinates of the camera
and each speaker, when the camera senses a listener. Determining the position coordinates of the
listener; calculating the distance between each speaker and the listener based on the position
coordinates of the listener and each speaker; and specifying the speaker farthest from the
listener; Calculating the ratio of the strength of the audio signal output by each of the nondesignated speakers and the time difference between the output time of the voice signal of the
designated speaker and the output time of the voice signal of each non-designated speaker; Each
non-designated speaker is delayed by delaying the signal output time of each non-designated
speaker. Outputting an audio signal at the same time as the designated speaker, and adjusting
the strength of the audio signal of each non-designated speaker based on the ratio of the
strength to output an audio signal of the same strength as the designated speaker And.
[0007]
Compared to the prior art, the audio output calibration system and audio output calibration
method of the present invention can be calibrated to the audio output by multiple speakers. This
makes it possible to obtain a uniform sound effect regardless of the position of the listener.
[0008]
09-05-2019
3
FIG. 1 illustrates an audio output calibration system according to an embodiment of the present
invention. FIG. 1 is a functional block diagram of an audio output calibration system according to
an embodiment of the present invention. FIG. 5 illustrates the structure of the coordinate system
of the audio output calibration system according to an embodiment of the present invention. 3 is
a flow chart of an audio output calibration method according to an embodiment of the present
invention.
[0009]
As shown in FIG. 1, an audio output calibration system 2 according to an embodiment of the
present invention is used in an electronic device 1. At least two cameras 30 and 32 and a
plurality of speakers 40 and 42 are connected to the electronic device 1. To simplify the
description, embodiments of the present invention will be described with two cameras 30, 32
and two speakers 40, 42 as an example. However, in practical applications, the number of the
camera and the speaker is not limited to two.
[0010]
The electronic device 1 is an independent electronic device connected to a device such as a
sound facility or a sound facility. The audio output calibration system 2 is used to determine the
position of the listener, and by performing calibration on the audio output from the plurality of
speakers 40, 42, the same intensity for the plurality of speakers 40, 42 Simultaneously output
the voice signal of
[0011]
The cameras 30 and 32 have a function of identifying the presence or absence of a person by
sensing the face of the person. After being activated, the cameras 30 and 32 rotate and shoot to
identify the presence or absence of a listener. In another embodiment of the present invention,
the cameras 30, 32 may be general cameras, and the electronic device 1 is provided with
software capable of identifying the face of a person to analyze and process a photographed
image. Check for the presence of listeners.
09-05-2019
4
[0012]
As shown in FIG. 2, the electronic device 1 includes a processor 10 and a storage device 12. The
processor 10 executes various software installed in the electronic device 1. For example,
application software such as the voice output calibration system 2 or the operation system is
executed.
[0013]
The storage unit 12 is used to store various data such as a photographed image, data obtained by
installation and calculation using the sound output calibration system 2 and the like. The storage
device 12 is an internal storage device of the electronic device 1 and is a portable memory card
or a flash memory.
[0014]
The audio output calibration system 2 comprises a setting module 20, a detection module 22, a
calculation module 24 and a calibration module 26. The setting module 20 constructs a
coordinate system based on the positions of the cameras 30, 32 and / or the speakers 40, 42,
and records position coordinates of the cameras 30, 32 and the speakers 40, 42.
[0015]
For example, in the coordinate system shown in FIG. 3, the camera 30 is a point A1, the camera
32 is a point A2, and the midpoint of the shortest distance between the cameras 30 and 32 is an
origin O. The speaker 40 is B1 and the speaker 42 is B2. The distance between the cameras and
the distance between each camera and the speaker can obtain data by actual measurement. Thus,
in the calculations described below, the distance L between the camera 30 and the camera 32,
the distance E between the camera A1 and the speaker B1, and the distance F between the
camera A2 and the speaker B2 are known distances Since the positions of the cameras 30, 32
and the speakers 40, 42 are also fixed in the coordinate system, the position coordinates of the
cameras 30, 32 and the speakers 40, 42 are also known.
09-05-2019
5
[0016]
The method of constructing the coordinate system and the method of calculation described
below are given as examples, and the present invention is not limited thereto. Depending on the
actual needs, a coordinate system may be established by determining the coordinate system
based on the position of the speaker or determining the coordinate system based on the
positional relationship between the camera and the speaker, or other types such as a spherical
coordinate system It is possible to construct a coordinate system of Moreover, the method of
calculating the relative position of each said apparatus by a different system using the
information of the known position of this coordinate system is not limited to this.
[0017]
In actual use, the coordinate system is a virtual coordinate system, and the user simply inputs
data indicating the distance between the cameras, the distance between the speakers and the
distance between the cameras and the speakers into the voice output calibration system 2
Automatically calculate and get the result.
[0018]
The detection module 22 determines the presence of a listener when the face of a person is
detected from the shadow images captured by the cameras 30 and 32.
For example, if the face of a person shown in the image captured by the camera 30, 32 is
positioned at the middle of the wide angle of the image, the detection module 22 determines the
presence of the listener. The detection module 22 is also used to determine position coordinates
of the listener when the listener is detected. For example, position coordinates of the listener are
calculated based on the rotation angle of the camera and the distance between the cameras.
[0019]
The cameras 30, 32 capture a rotating shadow image. When a human face in a shadow image is
sensed, the detection module 22 obtains the rotation angle of the cameras 30, 32. Taking the
coordinate system shown in FIG. 3 as an example, one vertical line (indicated by a broken line)
passes through points A1 and A2 in the coordinate system of the cameras 30 and 32,
09-05-2019
6
respectively. The angles θ1 and θ2 by which the cameras 30 and 32 rotate with respect to the
two vertical lines are known angles. When the camera 30 rotates by an angle θ1 and the camera
32 rotates by an angle θ2, the detection module 22 detects the listener P. Here, it is assumed
that the coordinates of the listener P are (P1, P2).
[0020]
The α angle and the β angle can be calculated by the above-mentioned angle θ1 and angle
θ2. For example, the value of the α angle is θ1 + 90 degrees, and the value of the β angle is
θ2 + 90 degrees. Next, the distance a from the camera 30 to the listener P and the distance b
from the camera 32 to the listener P are calculated according to the following formula.
[0021]
[0022]
The coordinates (P1, P2) of the listener P can be determined by obtaining the numerical values a
and b.
That is, P1 = L ÷ 2 + a × cos (180 ° −α); P2 = a × sin (180 ° −α). It is also possible to
determine the coordinate position of the listener P by another mathematical calculation method.
[0023]
The calculation module 24 is responsible for calculating the distance from the speakers 40 and
42 to the listener based on the position coordinates of the listener and the speakers 40 and 42,
respectively. For example, the distance from the speaker 40 to the listener is dn, and the distance
from the speaker 42 to the listener is df.
[0024]
09-05-2019
7
The calculation module 24 is also used to specify the speaker farthest from the listener. For
example, as shown in FIG. 3, when df> dn, the speaker 42 is a designated speaker, and the
speaker 40 is a non-designated speaker.
[0025]
Furthermore, the calculation module 24 calculates the ratio between the strength of the audio
signal output from the designated speaker and the strength of the audio signal output from the
non-designated speaker, and the time and the time for the designated speaker to output the
audio signal The time difference between the time when the non-designated speaker outputs the
audio signal is calculated.
[0026]
For example, in the embodiment of the present invention, it is assumed that the intensity of the
signal output by the non-designated speaker (for example, the speaker 40) is Sn and the intensity
of the signal output by the designated speaker (for example, the speaker 42) is Sf. The calculation
module 24 calculates an intensity ratio of audio signals of the designated speaker and each nondesignated speaker based on the formula Sn = Sf × (dn ÷ df) <2>.
[0027]
Since the distance from the non-designated speaker (for example, the speaker 40) to the listener
is shorter than the distance from the designated speaker (for example, the speaker 42) to the
listener, the audio signal output from the non-designated speaker (speaker 40) is Get into the
listener's ear earlier than the designated speaker (speaker 42).
Therefore, it is necessary to delay the signal output time of the non-designated speaker (speaker
40) to match the signal output time of the non-designated speaker with the signal output time of
the designated speaker.
Therefore, in the embodiment of the present invention, it is assumed that the time point at which
the non-designated speaker (speaker 40) outputs the audio signal is Tn, and the time point at
which the designated speaker (speaker 42) outputs the audio signal is Tf. And set as Tn = Tf +
(df−dn) ÷ c. In this formula c is the speed of sound. The sound speed c can be changed
according to the actual situation. For example, the speed of sound when propagated at 15 ° C. in
air is about 340 m / s, and the speed of sound when propagated at 28 ° C. in air is about 348.5
09-05-2019
8
m / s.
[0028]
As can be seen from the formula Tn = Tf + (df−dn) ÷ c, Tn> Tf. That is, according to the present
invention, the output time point of the audio signal of the non-designated speaker (speaker 40) is
output time of the audio signal of the designated speaker (speaker 42) through the formula Tn =
Tf + (df−dn) ÷ c. I am making it slower than the point. The calculation module 24 calculates the
time difference of the audio signal output time between the designated speaker and each nondesignated speaker based on this formula Tn = Tf + (df−dn) ÷ c.
[0029]
The calibration module 26 delays each audio signal output time of each non-designated speaker
(for example, the speaker 40) based on the time difference obtained by the above calculation,
thereby each non-designated speaker and the designated speaker (for example, the speaker 42)
simultaneously output an audio signal. For example, if the time difference is 2 seconds, the
calibration module 26 delays the output time of the audio signal of the non-designated speaker
by 2 seconds to output the voice signal simultaneously with the designated speaker. Let's do it.
[0030]
In addition, the calibration module 26 adjusts the strength of the audio signal output from each
non-designated speaker based on the strength ratio of the audio signal obtained by the
calculation to set each non-designated speaker (for example, the speaker 40) as the designated
speaker An audio signal of the same strength as (for example, the speaker 42) is output. For
example, when the intensity ratio of the audio signal between the speaker 40 and the speaker 42
is 1/2, the intensity of the audio signal of the speaker 40 is increased or the intensity of the
audio signal of the speaker 42 is reduced. The speaker 40 and the speaker 42 output audio
signals of the same strength.
[0031]
However, the above-mentioned calculation method is given as an example, and is not limited to
these in actual use, and the above data can also be calculated based on different mathematical
09-05-2019
9
methods.
[0032]
As shown in FIG. 4, the audio output calibration method according to the embodiment of the
present invention comprises the following steps.
[0033]
In step 1, the setting module 20 constructs a coordinate system based on the positions of the
cameras 30, 32 and / or the speakers 40, 42, and position coordinates of the cameras 30, 32 and
the speakers 40, 42. Record.
[0034]
In step 2, the detection module 22 determines whether a listener is sensed using the cameras 30
and 32.
For example, when the face of a person shown in a shadow image photographed by the cameras
30 and 32 is located in the middle of the wide angle of the shadow image, the detection module
22 determines the presence of a listener and moves to step 3.
However, if no listener is detected, then return to step 2 to continue detection.
[0035]
In step 3, the detection module 22 determines the position coordinates of the listener.
For example, position coordinates of the listener are calculated based on the rotation angle of the
camera and the distance between the cameras.
[0036]
09-05-2019
10
In step 4, the calculation module 24 calculates the distance between the speakers 40, 42 and the
listener based on the position coordinates of the listeners and speakers 40, 42. For example, the
distance between the speaker 40 and the listener is dn, and the distance between the speaker 42
and the listener is df.
[0037]
In step 5, the calculation module 24 specifies the speaker farthest from the listener. For example,
as shown in FIG. 3, when df> dn, the speaker 42 is a designated speaker, and the speaker 40 is a
non-designated speaker.
[0038]
In step 6, the calculation module 24 calculates a ratio between the strength of the audio signal
output from the designated speaker 42 and the strength of the audio signal output from the nondesignated speaker 40.
[0039]
In step 7, the calculation module 24 calculates a time difference between the time when the
designated speaker 42 outputs an audio signal and the time when each non-designated speaker
40 outputs an audio signal. .
[0040]
In step 8, the calibration module 26 delays the signal output time of each non-designated
speaker 40 based on the time difference obtained by calculation to simultaneously make each
non-designated speaker 40 and the designated speaker 42 an audio signal simultaneously. Make
it output.
Also, the intensity of the audio signal output from each non-designated speaker 40 is adjusted
based on the intensity ratio of the audio signal, and each non-designated speaker 40 outputs an
audio signal having the same intensity as the designated speaker 42.
09-05-2019
11
[0041]
As mentioned above, although the preferred embodiment of the present invention was described
in detail, the present invention is not limited to the above-mentioned embodiment, and various
modification or correction is possible within the scope of the present invention. Needless to say,
it is also included within the scope of the claims of the present invention.
[0042]
REFERENCE SIGNS LIST 1 electronic device 2 audio output calibration system 10 processor 12
storage device 20 setting module 22 detection module 24 calculation module 26 calibration
module 30, 32 camera 40, 42 speaker
09-05-2019
12
Документ
Категория
Без категории
Просмотров
0
Размер файла
20 Кб
Теги
jp2012161073
1/--страниц
Пожаловаться на содержимое документа