close

Вход

Забыли?

вход по аккаунту

?

JP2003163908

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2003163908
[0001]
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a
voice positioning method in a conference recording system for recording using a plurality of
microphones, and more specifically, records a conference, a lecture, a conference presentation, a
business presentation, etc. In order to tag and structure the video recording with the content of
the speaker's remark, etc., the voice recording can be made simple and high quality, and in the
conference recording system, the location information of the conference participants etc. can be
collected. The present invention relates to a voice positioning method.
[0002]
2. Description of the Related Art Conventionally, in order to record a conference, a lecture, a
conference presentation, a business presentation, etc. as video and audio, and to tag or structure
the video recording with the content of the speech of the speaker, the direction of the speaker is
Methods of detection have been considered. For example, the "camera control method in a video
conference system" described in Japanese Patent Application Laid-Open No. Hei 6-276514 is
provided with an electromagnetic wave source at a microphone so that the camera is directed in
the emission direction of the electromagnetic wave. Also, the "TV conference person camera
turning system" described in Japanese Patent Application Laid-Open No. 5-153582 is a method
of specifying a speaker using a plurality of directional microphones arranged radially. The
"television conference screen switching control device" described in the publication is a voice /
non-voice discrimination method using a plurality of microphones and a priority detection
method of a plurality of speakers.
04-05-2019
1
[0003]
The above-described method of detecting the direction of the speaker involves the use of a large
number of microphones and the problem that the installation of these devices is difficult and is
used only in large-scale meetings. And hinder the spread and use of such devices. In addition, in
the case of a device that applies a device to a sound collection device to detect the direction of
the speaker, distance information of the speaker can not be obtained. Similarly, microphones
provided with electromagnetic wave (including light) sources and using a directional antenna or
a video camera to specify the position can not obtain distance information between microphones.
In this case, the reproduction of the recorded sound field and the distance information between
the microphones necessary for removing the noise are lost, and it is difficult to realize such a
function.
[0004]
Therefore, the present invention facilitates the wiring operation between the devices having the
microphones which takes the most time in the installation operation of the conference recording
system, and automatically forms the network between the devices and specifies the relative
position of the installed microphones. It is an object of the present invention to provide a voice
positioning system in a conference recording system capable of performing the re-generation of
a recorded sound field and the removal of noise more effectively.
[0005]
SUMMARY OF THE INVENTION The present invention has been made in view of the above
situation, and the invention of claim 1 relates to a plurality of voice positioning methods in a
conference recording system for recording using a plurality of microphones. Among the
microphones, at least one is a master, and a plurality of microphones other than the master are
slaves, and the master and the slave each have a speaker, a network communication device, and a
central processing unit, and the master and the slave It is characterized in that the formation of a
network between each other and the identification of the positions of the parent device and the
child device by sound waves using the speaker are automatically performed.
[0006]
The invention of claim 2 is characterized in that, in the voice localization system in the
conference recording system of the invention of claim 1, the network between the master unit
and the slave units is formed using a wireless LAN.
04-05-2019
2
[0007]
According to the third aspect of the present invention, in the audio localization method in the
conference recording system according to the first or second aspect of the present invention, the
network is periodically reformed in order to cope with addition or movement of a slave
microphone during recording. Repositioning of the master unit and the slave unit by sound
waves is characterized.
[0008]
The invention of claim 4 is characterized in that, in the voice recording system according to the
conference recording system of the invention of claims 1 to 3, the positions of the master unit
and the slave unit by the sound wave are specified using a plurality of frequencies. Do.
[0009]
The invention of claim 5 is characterized in that, in the voice localization system in the
conference recording system of the invention of claim 4, all or any one of the plurality of
frequencies is outside an audio frequency band.
[0010]
DESCRIPTION OF THE PREFERRED EMBODIMENTS The preferred embodiments of the present
invention will be described below based on the examples shown in FIGS.
FIG. 1 is a diagram showing the overall configuration of the voice position identification system
of the present invention.
In the voice position specifying method in the conference recording system of the embodiment, it
comprises at least one parent device 100 and a plurality of arbitrary slave devices 200a to 200n
(in the example shown in FIG. 1, six of the slave devices 200a to 200f ), Each has a microphone,
and the master and slave have components as shown in FIG.
[0011]
04-05-2019
3
2A and 2B are diagrams showing the configuration of a master unit and a slave unit. FIG. 2A is a
block diagram showing the configuration of the master unit 100, and FIG. 2B is a block diagram
showing the configuration of each slave unit 200a to 200n. It is.
In addition, since the child devices 200a to 200n have the same configuration, the reference
numerals a to n are omitted.
The main unit 100 includes a CPU 101, a clock 102, a RAM 103, a ROM 104, an HDD 105, an
expansion bus 106, a wireless network I / F 107, an audio input / output I / F 108, a display card
109, a keyboard I / F 110, a mouse I / F 111, etc. The wireless network I / F 107, the audio input
/ output I / F 108, the display card 109, the keyboard I / F 110, and the mouse I / F 111 are
connected to the wireless antenna 113, the microphone 114, the speaker 115, the display 116,
the keyboard 117, A mouse 118 is connected.
[0012]
The base unit 100 is provided with a wireless antenna 113 necessary for automatically
configuring a wireless network in addition to the microphone 114, and a speaker 115 required
for emitting a position detection sound signal.
These are connected to the CPU 101 via the wireless network I / F 107, the audio input / output
I / F 108, and the bus 112.
The CPU 101 uses the clock 102 as a reference signal for operation control and a reference
signal for time control to perform the control operation necessary for the present invention, and
has a function of exchanging start and end of recording, system configuration, etc. with the
operator. .
[0013]
The other components constituting the parent device 100 are devices generally necessary to
operate the CPU 101. That is, the RAM 103 for storing the program, the ROM 104, the HDD 105,
04-05-2019
4
the display 116 necessary for the operator to command or monitor the operation of the
apparatus, the display card 109, the keyboard 117, the keyboard I / F 110, the mouse 118, the
mouse It is I / F111 grade | etc.,. Also, an expansion bus 106 is provided for future expansion of
the device.
[0014]
In the slave unit 200 (200a to 200n), the CPU 201, clock 202, RAM 203, ROM 204, wireless
network I / F 205, audio input / output I / F 206, etc. are connected to the bus 207, and wireless
network I / F 205, audio input / output I / F A wireless antenna 208, a microphone 209, and a
speaker 210 are connected to F206, respectively. The slave unit 200 requires no operator except
for turning on the power, so that part of the slave unit 200 is omitted from the master unit 100.
[0015]
The operation of the voice localization method in the conference recording system of the
embodiment will be described. 3 and 4 are diagrams showing the process of the voice position
specifying method in the conference recording system of the embodiment, and FIG. 3 is a flow
chart showing the process of the parent machine, and FIG. 4 is a flow chart showing the process
of the slave machine . The outline of the process performed by the master and slave is as follows.
A1. A network is formed between the master and slave using a wireless LAN. (Initialization
process 300 of FIG. 3, initialization process 400 of FIG. 4) A2. Calculation of the distance
between the master and slave (Mother-child distance calculation 310 in FIG. 3 and the masterchild distance calculation 410 in FIG. 4), distance calculation between the slave and child (Gee in
FIG. 3) The relative position between the master unit and the slave unit and between the slave
units is determined by the slave unit calculation 320 and the slave unit-child unit calculation
420) in FIG. A3. Start recording. (Recording process 340 in FIG. 3, recording process 440 in
FIG. 4)
[0016]
The operation of A1 will first be described based on FIG. Power is turned on by the operation of
the operator (S301). A broadcast that releases a master ID, which can be recognized as a master,
to a slave as minimum information, after performing initialization generally required as a CPU
(S302) (Broadcast: all devices connected to a network (S303) and waits for a response from the
04-05-2019
5
slave unit (S304). If the response from the slave unit is within the preset timeout time, the ID of
the slave unit is registered (S305, S306). After the time-out elapses (S306-YES), it is considered
that there has been a response from all possible slaves at this point, and the process of A1 is
completed, and the process proceeds to the process of A2.
[0017]
In the slave unit, at the time of the operation of the master unit (S303), if the initialization
processing of the initialization processing 400 of FIG. 4 has progressed to Listen (S403), after
receiving a broadcast from the master unit (S403) (S404-YES), ID transmission of the slave unit is
performed (S405). After that, confirmation from the returning master device as confirmation of
the registration operation (S305) of the slave device by the parent device is received (S406), the
processing of A1 is completed, and the processing shifts to processing of A2.
[0018]
The operation of A2 will be described. From the parent device, a broadcast (Broadcast) for
notifying that the distance detection processing between the parent device and the child device is
started flows (S311). Immediately after the completion of this processing, timer count processing
is performed (S312), and the timer is operated from the count value 0.
[0019]
The slave receives the broadcast from the master (S411), and similarly performs timer count
processing (S412), and operates the timer from the count value 0. As a result, the timer count is
started from the count value 0 at almost the same time (preferably several microseconds) in all
the slaves.
[0020]
Here, after the CPU 101 of the master unit actually issues a notification command, until the
notification (broadcast) is issued by the wireless antenna 113, the CPU 201 of the slave unit
starts timer count from the reception of the wireless antenna 208 of the slave unit (S412) Since
04-05-2019
6
the time lag until the execution of the) can be theoretically calculated by the number of
instructions etc., the timer count value of the master and slave should be appropriately corrected.
Also, the delay between the wireless antenna 113 of the master unit and the wireless antenna
208 of the slave unit is ignored because it is much faster than the speed of the sound wave to be
used for distance calculation.
[0021]
If it is confirmed in the master unit that all slave units have responded that the notification has
been accepted (S313-YES), the master unit outputs a distance detection sound signal (S314).
Next, the time at which the sound signal output is detected from the slave unit (timer count value
at the slave unit) is received (S315), and the distance between the master unit and all the slave
units is calculated based on this data (S316). In this distance calculation, the distance T is
obtained by multiplying the sound speed S by the value T obtained by subtracting the timer
count value T2 of the master unit from the timer count value T1 at which the slave received the
sound wave for distance detection. = S x (T1-T2) / 2).
[0022]
Next, the master transmits a distance detection command to each slave ID registered in (S305)
(S321). On the other hand, the slave unit confirms the instructed slave unit ID and its own ID
(S421), and when they match (S422-YES), emits a sound wave for distance detection by itself
(S423). If they do not match (S422-NO), a sound wave for distance detection from another slave
is detected (S424), and a timer count at the time of detection is transmitted to the master (S425).
[0023]
The master receives it and calculates the distance between the slaves (S324). This is performed
for all slaves (S325). When it is completed, the master unit gathers information on the distance
between the master unit and each slave unit. From these pieces of information, consistent
positional relationships and distances are obtained based on the triangle construction theorem
(S326). Then, the end of the position detection process is notified from the master to all the
slaves (S327), and the processing of A2 is completed (S328).
04-05-2019
7
[0024]
Finally, the recording start of A3 will be described. When the master receives a recording
command from the operator (S341), it issues a recording start command to all the slaves (S342).
The slave receives this (S441), transmits the recording data to the master (S442), and the master
receives and records the recording data (S344).
[0025]
At this time, the processing of A1 and A2 is performed again each time the set time t elapses so
as to be able to cope with the extension of the slave during recording and the movement of the
slave (S346), the set time counter Is set (S343). If it is detected that the set time t has elapsed
(S346-YES), an update process is performed (S347). Regarding this, it is always checked whether
there is a notification in (S 346) of the base unit (S 443), and if there is a notification (S 443-YES),
corresponding processing is performed (S 444).
[0026]
The recording process is continued until the recording stop is input to the master by the operator
(S345). When recording stop is input to the base unit (S345-YES), the recording end notification
is notified to the base unit (S332), and the base unit receives it (S445), and the base unit and the
base unit finish the processing. (S331) (S431).
[0027]
As apparent from the above description, according to the present invention, the most
troublesome wiring and adjustment processing of the installation operation of the conference
recording system is automatically performed, and the relative position of the installed
microphone Thus, it is possible to more effectively reproduce the recorded sound field and
remove noise, and to improve human dictation, speech recognition, quality, and efficiency.
[0028]
In particular, according to the invention of claim 1, the master unit and the slave unit each
04-05-2019
8
having a microphone have a speaker, a network communication device, and a central processing
unit, and form a network between the master unit and the slave unit, and between the slave units.
Since the process of specifying the positions of the parent device and the child device by sound
waves using the speaker is automatically performed, setting processing of a plurality of
microphones can be reduced.
According to the second aspect of the present invention, the installation process of the plurality
of microphones can be further reduced by using the wireless LAN. The invention of claim 3 can
cope with the addition and movement of a slave microphone during recording at any time. The
invention of claim 4 can provide information necessary for more accurate sound field
regeneration and noise removal by having distance information of sound waves of a plurality of
frequencies. According to the fifth aspect of the present invention, the processing can be
performed without letting a person hear the sound generated by the position detection
processing during recording.
04-05-2019
9
Документ
Категория
Без категории
Просмотров
0
Размер файла
18 Кб
Теги
jp2003163908
1/--страниц
Пожаловаться на содержимое документа