close

Вход

Забыли?

вход по аккаунту

?

DESCRIPTION JPWO2013118204

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JPWO2013118204
A waveguide (200) for guiding an incident sound wave, a microphone (203) for converting the
sound wave guided by the waveguide (200) into a sound signal of electricity, and a microphone
(203). And a signal processing unit (210) for processing the sound signal by using an acoustic
characteristic that the waveguide unit (200) gives to the sound signal, and the waveguide unit
(200) includes the waveguide unit (200). A direct sound that reaches the microphone portion
(203) directly without being reflected by the inner side surface and an indirect sound that is
reflected by the inner side surface and reaches the microphone portion (203) has different
acoustic characteristics. The signal processing unit (210) performs direct sound detection
processing to detect whether or not direct sound is input, based on the difference in acoustic
characteristics between the direct sound and the indirect sound.
Voice input device and display device
[0001]
The present disclosure relates to a voice input device that performs predetermined control
according to input voice, and a display device that switches a display state according to the input
voice.
[0002]
Patent Document 1 discloses a device that performs control by voice.
16-04-2019
1
This device includes a display unit for displaying a score and a microphone (hereinafter referred
to as a microphone) built in the device. The device discriminates a pitch such as a voice input to a
microphone or a sound emitted by a musical instrument, determines a playing place on a musical
score displayed on a display unit, and automatically performs a turn-over. By using the device,
the player does not have to take his hand off the instrument to perform the turn-over.
[0003]
However, in the case of such a device, the player does not have to perform a manual turn-over at
the time of normal performance, but at the time of practice, it is necessary to manually operate it
by the update switch.
[0004]
On the other hand, for example, it is conceivable to perform the turn-off by the voice emitted by
the performer.
In this case, the turn-over can be performed at any timing of the player.
[0005]
JP 11-153991 A
[0006]
However, in the above-described conventional apparatus, for example, when the apparatus is
played on the piano music stand while the piano is being played, the voice is further transmitted
to the piano while the piano sound is input to the microphone. It will be input superimposed on
the sound.
In the above-described conventional apparatus, it is very difficult to distinguish between voice
and piano sound, so there is a problem that the accuracy of the turn-off is not sufficient.
[0007]
16-04-2019
2
An object of the present disclosure is to provide a voice input device capable of detecting an
input of voice (direct sound) with high accuracy. Another object of the present invention is to
provide a display device capable of accurately detecting an input of voice (direct sound) and
performing turning-up accurately.
[0008]
The voice input device according to the present disclosure includes a waveguide unit for guiding
an incident sound wave, a microphone unit for converting a sound wave passing through the
inside of the waveguide unit into a sound signal of electricity, and a sound converted by the
microphone unit. And a signal processing unit that processes signals using the acoustic
characteristics given by the waveguide unit to the sound wave, and the waveguide unit passes
through the inside of the waveguide unit and enters the microphone unit. Among the above, a
structure giving different acoustic characteristics by a direct sound that reaches the microphone
without being reflected by the inner side of the waveguide and an indirect sound that is reflected
by the inner side and reaches the microphone The signal processing unit performs a direct sound
detection process that detects whether the direct sound is input or not, using the difference in
the acoustic characteristics of the direct sound and the indirect sound.
[0009]
The apparatus according to the present disclosure can not only be realized as an apparatus, but
also realized as a method having processing means constituting the apparatus as steps, or
realized as a program for causing a computer to execute those steps, or recording the program
The present invention can be realized as a recording medium such as a computer readable CDROM, or as information, data or a signal indicating the program.
And these programs, information, data, and signals may be distributed via a communication
network such as the Internet.
[0010]
The voice input device in the present disclosure is effective for detecting voice (direct sound)
input with high accuracy.
16-04-2019
3
[0011]
FIG. 1 is an external view showing an appearance of a musical score display device provided with
a voice input device according to a first embodiment.
FIG. 2 is a cross-sectional view showing a cross section of the A-A 'portion of the musical score
display device shown in FIG. FIG. 3 is a block diagram showing configurations of a signal
processing unit and a display control unit in the first embodiment. FIG. 4A is a perspective view
showing a configuration example of a waveguide of the voice input device according to the first
embodiment. FIG. 4B is a perspective view showing a modified example of the waveguide of the
voice input device. FIG. 4C is a perspective view showing a modified example of the waveguide of
the voice input device. FIG. 5A is a diagram showing an example of indirect sound in the sound
signal incident on the voice input device. FIG. 5B is a diagram showing an example of direct
sound in the sound signal incident on the voice input device. FIG. 6A is a circuit diagram showing
an equivalent acoustic characteristic circuit in the case where an indirect sound is input to the
waveguide of the first embodiment. FIG. 6B is a circuit diagram showing an equivalent acoustic
characteristic circuit when direct sound is input to the waveguide of the first embodiment. FIG. 7
is a graph showing the transfer characteristics of the acoustic equivalent circuits shown in FIGS.
6A and 6B. FIG. 8 is a diagram showing an example of physical quantities exemplified in the first
embodiment. FIG. 9 is a flowchart showing an example of the operation of the display control
unit of the first embodiment. FIG. 10A is a diagram showing an example of a display state of the
display unit of the musical score display device according to Embodiment 1. FIG. 10B is a
diagram showing an example of a display state in the display unit of the musical score display
device according to Embodiment 1. FIG. 10C is a diagram showing an example of a display state
of the display unit of the musical score display device according to Embodiment 1. FIG. 11A is a
view showing a modified example of the display state in the display unit of the score display
device. FIG. 11B is a view showing a modified example of the display state in the display unit of
the score display device. FIG. 11C is a view showing a modified example of the display state in
the display unit of the score display device. FIG. 12 is a diagram showing an example of a display
state in the display unit of the voice input device.
[0012]
Hereinafter, embodiments will be described in detail with reference to the drawings as
appropriate. However, the detailed description may be omitted if necessary. For example, detailed
description of already well-known matters and redundant description of substantially the same
configuration may be omitted. This is to avoid unnecessary redundancy in the following
16-04-2019
4
description and to facilitate understanding by those skilled in the art.
[0013]
It is noted that the inventors provide the attached drawings and the following description so that
those skilled in the art can fully understand the present disclosure, and intend to limit the
claimed subject matter by these Absent.
[0014]
Embodiment 1 Hereinafter, a display device provided with a voice input device according to
Embodiment 1 will be described with reference to FIGS. 1 to 10C.
[0015]
[1.
Device Configuration] First, the configuration of the display device will be described with
reference to FIGS.
[0016]
In the present embodiment, as an example of the display device, the case of a score display
device that displays a score will be described as an example.
FIG. 1 is an external view of a surface provided with a display panel 101 of a musical score
display device 100 incorporating a voice input device according to the present embodiment. FIG.
2 is a cross-sectional view of the AA 'portion of the musical score display device 100 shown in
FIG. FIG. 3 is a block diagram showing the configuration of each processing unit of the musical
score display device 100 shown in FIG.
[0017]
16-04-2019
5
The music score display device 100 is a device that performs a “note-over” process of
switching a displayed music score page to the next page when a player's voice is detected.
Furthermore, in the present embodiment, the musical score display device 100 will be described
by way of example in which a musical score of a piano is displayed. The music score display
device is used by being placed on a piano music stand in the present embodiment. Here, the case
where the music score display device 100 is placed on the music stand so that the longitudinal
direction (X direction in FIG. 1) is the horizontal direction of the music stand will be described as
an example.
[0018]
In the present embodiment, the score display device 100 is a tablet terminal equipped with a
touch panel as an input interface, and as shown in FIGS. 1 to 3, the display panel 101, the voice
input device 102, and the display control unit 103. , Music score DB 104 (storage unit). In order
to make the description easy to understand, in the following description, the longitudinal
direction of the display panel 101 in FIG. 1 will be described as the X axis, the lateral direction as
the Y axis, and the display direction of the display panel 101 as the Z axis.
[0019]
The musical score display device 100 is a plate-like device in the present embodiment. As shown
in FIG. 1, on the surface of the score display apparatus 100, a display panel 101 and an opening
through which sound is input are arranged. In the present embodiment, the opening of the
musical score display device 100 is integrally formed with the opening of the waveguide 200 of
the voice input device 102 described later.
[0020]
The display panel 101 displays the musical score of the music to be played. The display panel
101 can be realized using a general panel. In the present embodiment, the display panel 101 is a
display panel of a tablet terminal. In addition, when the score display apparatus 100 is other
apparatuses, such as a smart phone, it is preferable that the display panel 101 is a display panel
with which the said apparatus was equipped.
16-04-2019
6
[0021]
In the present embodiment, the voice input device 102 receives the voice (direct sound) of the
performer who is playing the music and the sound other than the voice of the performer, for
example, (indirect sound) of the sound of the musical instrument etc. It is possible to detect direct
sound input. In the present embodiment, the voice input device 102, as will be described later, is
an indirect sound that is a sound other than a voice including a sound of a musical instrument or
the like played by the performer, and the performer turns off the score display device 100. To
identify the direct sound to indicate.
[0022]
The voice input device 102 has a waveguide unit 200, a microphone unit 203, and a signal
processing unit 210, as shown in FIGS.
[0023]
The waveguide unit 200 is a hollow member having an opening through which sound is input,
and transmits (guides) sound through the hollow portion.
FIG. 4A is a perspective view showing the shape (the shape of the hollow portion) of the
waveguide 200 according to the present embodiment. The waveguide unit 200 has a waveguide
upper portion 201 and a waveguide lower portion 202, as shown in FIG. 4A. In the present
embodiment, in order to simplify the description, the case where the shape of the hollow portion
of the waveguide upper portion 201 and the waveguide lower portion 202 is cylindrical will be
described as an example.
[0024]
The waveguide upper portion 201 is a portion of the waveguide 200 located on the sound input
side, as shown in FIG. 4A. The upper surface of the upper waveguide portion 201 is an opening
through which sound is input, and the bottom surface is in contact with the upper surface of the
lower waveguide portion 202 described later. The diameter of the bottom of the hollow portion
(diameter of the hollow portion in a plane parallel to the XY plane in FIG. 4A) of the waveguide
upper portion 201 is set to several mm to several cm. Further, the height of the hollow portion
16-04-2019
7
(the height of the cylinder) of the waveguide upper portion 201 is several mm to several cm. The
shape including the hollow portion of the waveguide upper portion 201 is set in consideration of
the size and shape of the waveguide lower portion 202, the size and shape of the musical score
display device 100, and the like.
[0025]
The waveguide lower portion 202 is a portion of the waveguide 200 located on the sound output
side (microphone unit 203 side) as shown in FIG. 4A. The upper part of the lower part 202 of the
waveguide is in contact with the bottom of the upper part 201 of the waveguide, and a
microphone is installed on the bottom. One space is formed by the upper waveguide portion 201
and the lower waveguide portion 202. The lower part 202 of the waveguide has a diameter of
the bottom of the hollow part (diameter of the hollow part in a plane parallel to the XY plane of
FIG. 4A) larger than that of the upper part 201 of the waveguide and set to several cm. Also, the
height of the hollow portion (the height of the cylinder) of the lower waveguide portion 202 is
several mm to several cm.
[0026]
The characteristic point of the shape of the waveguide 200 is that the size (opening area) of the
opening of the lower part 202 of the waveguide is larger than the size (opening area) of the
opening of the upper part 201 of the waveguide I assume. This is for causing Herzholm
resonance in the waveguide 200 as described later. The waveguide may be made of any material
such as plastic, metal and wood.
[0027]
The microphone unit 203 is disposed at the bottom of the waveguide unit 200 (the lower end of
the waveguide unit 200 in the Z-axis direction). The microphone unit 203 converts sound waves
(sound signals) including voice (direct sound) by human voice input from the waveguide unit 200
and sound of a musical instrument such as a piano (indirect sound) into an electric signal. The
sound signal converted into the electrical signal is output to the signal processing unit 210.
[0028]
16-04-2019
8
The signal processing unit 210 electrically processes the electrical signal output from the signal
processing unit 210 to perform direct sound detection processing for detecting direct sound
input, and outputs the detection result to the display control unit 103. Specific processing
content and the like will be described later.
[0029]
The display control unit 103 updates the display page of the score displayed on the display panel
101 based on the output from the signal processing unit 210.
[0030]
The musical score DB 104 is a DB storing musical scores to be displayed on the display panel,
and in the present embodiment, is configured of, for example, a non-volatile memory.
[0031]
[2.
Characteristics of Waveguide Section 200 for Direct Sound and Indirect Sound] Next, the direct
sound detection processing executed by the signal processing section 210 of the musical score
display device 100 will be described based on FIGS. 5A to 8.
[0032]
FIG. 5A is a view showing a cross section of the waveguide section 200 in the XZ plane and an
example of a path until the indirect sound reaches the microphone section 203. FIG.
FIG. 5B is a view showing a cross section of the waveguide section 200 in the XZ plane and an
example of a path until the direct sound reaches the microphone section 203. As shown in FIG.
5A and 5B, for the sake of explanation, the dimensional ratio of the diameter and the height is
different from that of the actual waveguide 200. As shown in FIG. 5A, the indirect sound incident
on the waveguide 200 is reflected by the side wall of the waveguide 200 and reaches the
16-04-2019
9
microphone 203. On the other hand, as shown in FIG. 5B, the direct sound that has entered the
waveguide unit 200 directly reaches the microphone unit 203 without being reflected by the side
surface of the waveguide unit 200. Although FIG. 5B shows direct sounds 1, 2, and 3, these do
not indicate that they occur simultaneously, but indicate the types of possible path patterns.
[0033]
Here, in the present embodiment, as described above, since the score display device 100 is
placed on the piano music stand, the voice generated by the performer reaches the microphone
unit 203 as a direct sound. On the other hand, the piano sound reaches the microphone unit 203
as an indirect sound. The piano sound may be reflected on the wall of the room, but it does not
reach the microphone unit 203 as a direct sound because the user is a shielding object, or it is
sufficient not to affect the direct sound detection processing. It is considered to reach the
microphone unit 203 in a state of being attenuated.
[0034]
Here, the inventor of the present application describes the relationship between the sound
pressure V1 of the indirect sound that has passed through the waveguide unit 200 and the sound
pressure Vmic of the microphone unit 203 and the sound pressure of direct sound that has
passed through the waveguide unit 200. The relationship between V1 and the sound pressure
Vmic of the microphone unit 203 was found to be different. FIG. 6A is an acoustic equivalent
circuit corresponding to an indirect sound (sound pressure V1 of piano sound). FIG. 6B is an
acoustic equivalent circuit corresponding to the direct sound (sound pressure V2 of the voice
emitted by the player). For the indirect sound, the inventor of the present application has
appropriately described the acoustic equivalent circuit shown in FIG. 6A for indirect sound and
the acoustic equivalent circuit shown in FIG. 6B for direct sound. I found it to be.
[0035]
More specifically, the waveguide unit 200 acts as so-called Helmholtz resonance for indirect
sound (piano sound). That is, the waveguide upper portion 201 of the waveguide 200 can be
expressed as an electrical circuit in which the acoustic inertance L (401) and the acoustic
resistance R (400) are connected in series. The lower waveguide portion 202 can be expressed as
an electrical circuit in which the acoustic compliance C (402) is connected in parallel. As a result,
16-04-2019
10
as shown in FIG. 6A, as for the entire waveguide portion 200, as shown in FIG. 6A, the acoustic
resistance R whose one end is connected to the terminal a1 and the other end is connected to
one end of the acoustic inertance L and the other end is It can be expressed as an electric circuit
including an acoustic inertance L connected to the terminal b1 and an acoustic compliance C
connected at one end to the terminal b1 and at the other end to the terminal a0 and the terminal
b0. The voltage V1 of the terminal a1 with respect to the terminal a0 is represented as the sound
pressure of the piano sound. The voltage Vmic of the terminal b1 with respect to the terminal b0
is the voltage detected by the microphone unit 203. This configuration is a circuit configuration
called a so-called resonant circuit.
[0036]
On the other hand, for direct sound (voice of the performer), the waveguide unit 200 acts like
Helmholtz resonance like indirect sound, but for direct sound by setting a predetermined number
of parameters Can be expressed as an electric circuit shown in FIG. 6B. The acoustic equivalent
circuit shown in FIG. 6B has a configuration in which a variable resistor Rx (403) is connected in
parallel with a circuit in which the acoustic resistance R and the acoustic inertance L are
connected in series, in addition to each configuration of the acoustic equivalent circuit shown in
FIG. The configuration is such that Rx is connected between the terminal a1 and the terminal b1.
In this case, the variable resistor Rx has a substantially infinite value for low frequencies, and acts
as a variable resistor that approaches 0 as the frequency increases.
[0037]
From the characteristics of the acoustic equivalent circuit shown in FIG. 6B, in the waveguide unit
200, the shorter the wavelength (the higher the frequency) of the input direct sound (such as the
voice of the performer), the easier it is to arrive at the microphone unit 203. I understand.
[0038]
FIG. 7 is a graph showing the transfer characteristic of the acoustic equivalent circuit
corresponding to the indirect sound shown in FIG. 6A and the transfer characteristic of the
acoustic equivalent circuit corresponding to the direct sound shown in FIG. 6B.
The vertical axis is the sound pressure Vmic (voltage of the electric signal) of the sound signal
collected by the microphone unit 203, and the horizontal axis is the frequency of the electric
16-04-2019
11
sound signal collected. In FIG. 7, a graph indicated by a broken line shows the amplitude
frequency characteristic of the acoustic equivalent circuit shown in FIG. 6A. Further, in FIG. 7, a
graph indicated by a solid line shows the amplitude frequency characteristic of the acoustic
equivalent circuit shown in FIG. 6B.
[0039]
The acoustic equivalent circuit shown in FIG. 6B has characteristics close to the acoustic
equivalent circuit shown in FIG. 6A because the variable resistance Rx approaches infinity in the
low frequency band of the direct sound input. On the other hand, in the acoustic equivalent
circuit shown in FIG. 6B, since the variable resistance Rx approaches 0 in the high frequency
band, the volume flow velocity (current) flows to the variable resistance Rx side than the series
circuit of the acoustic resistance R and the acoustic inertance L. Equivalent) becomes large.
Therefore, in the high frequency band of the direct sound, the attenuation per octave is smaller
than in the case of the indirect sound. In the example of the graph shown in FIG. 7, it can be seen
that the direct sound shown in FIG. 6B indicated by the solid line has smaller attenuation in the
high frequency band as compared with the indirect sound shown in FIG.
[0040]
FIG. 8 is a diagram showing a specific numerical example for determining the characteristics of
the waveguide unit 200 described above. In the following description, as shown in FIG. 8, the
radius r of the waveguide upper portion 201 = 0.5 cm, the opening area S = 0.79 cm <2>, the
height l = 0.5 cm, and the air density = 0. = 0. It is assumed that the volume V of the lower
portion 202 of the waveguide portion is 25.13 cm <3>.
[0041]
It is known that L, R and C of the above-mentioned equivalent circuit are given by the following
formulas (1) to (3).
[0042]
[0043]
16-04-2019
12
[0044]
[0045]
When the numerical values shown in FIG. 8 are applied to the equations (1) to (3), as shown in
FIG. 8, the acoustic inertance L = 7.2 × 10 <−4> g · cm <4>, the acoustic resistance R = 0 The
acoustic compliance C = 1.8 × 10 <−5> s <2> · cm <4> / g is calculated.
[0046]
In this case, the resonance frequency fq in the characteristics shown in FIG. 7 is given by
equation (4).
[0047]
[0048]
Using the specific values shown in FIG. 8, the resonance frequency fq is about 1.4 kHz.
The sound pressure detected by the microphone unit 203 is attenuated at a rate of 12 dB per
octave in the equivalent circuit (dotted line) of the indirect sound shown in FIG. 6A on the higher
side of the resonance frequency fq.
Further, on the high frequency side of the resonance frequency fq, the sound pressure of the
direct sound equivalent circuit (solid line) shown in FIG. 6B is attenuated at a rate of 6 dB for
each octave.
In the present invention, this characteristic is used to detect direct sound input.
[0049]
In FIG. 7, a region where the attenuation difference between the outputs of the two acoustic
16-04-2019
13
equivalent circuits is large, for example, a band of 12 kHz or more is set as a judgment frequency
band (denoted as a judgment band in FIG. 7).
In this case, the octave number Noct from the resonance frequency fq to the lower limit value
fmin of the determination frequency band is expressed by the following equation 5.
[0050]
[0051]
In the example shown in FIGS. 7 and 8, assuming that the lower limit value fmin of the judgment
frequency band is 12 kHz, from equation (5), Noct = LOG 2 (12 / 1.4) becomes approximately 3,
and for the resonance frequency fq It can be seen that the frequency is higher by about 3
octaves.
[0052]
Further, assuming that the sound pressure V2 of the direct sound (speech, solid line in FIG. 7) in
the frequency band equal to or higher than the resonance frequency fq is the attenuation factor
A2 (absolute value), the initial value is V 20 (0 dB in FIG. 7) It is represented by the following
equation 6.
[0053]
[0054]
In the example shown in FIGS. 7 and 8, the sound pressure V2 of the sound at 12 kHz, which is
the lower limit value of the determination frequency band, that is, the sound pressure V2 of the
solid line in FIG. It will be 18 dB.
[0055]
Also, assuming that the sound pressure V1 of the indirect sound (piano sound, broken line in FIG.
7) in the frequency band higher than the resonance frequency fq is the attenuation factor A1
(absolute value), the initial value is V 10 (0 dB in FIG. 7) , Is expressed by the following equation
7.
16-04-2019
14
[0056]
[0057]
In the examples shown in FIGS. 7 and 8, the sound pressure V1 of the piano sound at 12 kHz,
which is the lower limit value of the determination frequency band, that is, the sound pressure
V1 of the broken line in FIG. It becomes -36 dB.
[0058]
However, the voice to be uttered needs to be a voice including a level substantially equal to the
component of 12 kHz or more included in the piano sound.
For that purpose, for example, it is preferable that the sound be a transient sound with a sharp
rise or a consonant containing a large amount of high frequency components.
[0059]
As described above, under the conditions shown in FIGS. 7 and 8, the sound pressure V2 of the
direct sound at 12 kHz is -18 dB, and the sound pressure V1 of the indirect sound is -36 dB.
If the sound pressure Vmic input to the microphone unit 203 is equal to or higher than a
threshold, it can be determined that a direct sound has been input.
FIGS. 7 and 8 illustrate the case where the direct sound and the indirect sound input to the
microphone unit 203 have the same sound pressure in the low frequency band (the case where V
1 0 = V 2 0 = 0 dB). However, when the sound pressure of the piano sound (indirect sound) is
high (when the initial value is larger than 0 dB), the broken line graph is shifted upward.
Even in this case, it is possible to detect the input of the direct sound by setting the
determination frequency band in a frequency band in which the difference between the sound
pressure V2 of the direct sound and the sound pressure V1 of the indirect sound is sufficient.
16-04-2019
15
[0060]
[3.
Direct Sound Detection Processing] Next, details of the direct sound detection processing in the
signal processing unit 210 will be described.
[0061]
The signal processing unit 210 performs direct sound detection processing for detecting the
input of the direct sound by using the difference in acoustic characteristics between the direct
sound and the indirect sound as illustrated in FIG. 7.
When the direct sound is detected by the direct sound detection process, the signal processing
unit 210 outputs a control signal, in the present embodiment, a display switching flag Fsd, to the
display control unit 103 in the subsequent stage.
[0062]
As shown in FIG. 3, the signal processing unit 210 includes a low pass cutoff filter (HPF) 211, a
level detector 212, and a comparator 213.
[0063]
The HPF 211 removes or suppresses signals in a specific area, that is, a band other than the
determination frequency band.
The HPF 211 sets the frequency to be removed or suppressed according to the resonance
frequency fq derived from the shape of the waveguide unit 200 or the like.
For example, in the examples shown in FIG. 7 and FIG. 8, the HPF 211 is preferably a high-order
16-04-2019
16
low-pass cutoff filter that cuts signals of 12 kHz or less sharply because the region of 12 kHz or
more is the determination frequency band.
[0064]
The level detector 212 detects the level of the sound signal output from the HPF 211.
[0065]
The comparator 213 compares the level value detected by the level detector 212 with a preset
threshold value.
As a result of comparison, when the level value detected by the level detector 212 is larger, a
control signal (display switching flag Fsd) instructing switching of the display content is output to
the display control unit in the subsequent stage.
As the predetermined threshold, in the example shown in FIGS. 7 and 8, the sound pressure of
sound is -18 dB and the sound pressure of piano sound is -36 dB at a frequency of 12 kHz. For
example, -25 dB is selected. .
By setting the threshold in this way, when only the piano sound is input, the control signal is not
output because the level value -36 db detected by the level detector 212 is smaller than the
threshold -25 db.
On the other hand, when the direct sound is input, the level value −18 db detected by the level
detector 212 is larger than the threshold −25 db, so the control signal is output.
Therefore, if the threshold value is set to a value between the above-mentioned equation (6) and
equation (7), it becomes possible to output the display switching flag in response to only the
voice.
[0066]
From the above, direct sound and indirect sound can be obtained by the waveguide unit 200, the
16-04-2019
17
microphone unit 203 for collecting the sound passing through the waveguide unit 200, and the
signal processing unit 210 for processing the signal from the microphone unit 203. Even in an
environment in which only one or both are mixed, it is possible to accurately detect that a direct
sound has been input.
[0067]
More specifically, by using the waveguide unit 200 in which the acoustic characteristics of the
sound passing through the waveguide unit 200 and collected by the microphone unit 203 are
different between the direct sound and the indirect sound. The distinction between sounds and
indirect sounds is easier, and it is possible to extract only direct sounds.
The signal processing unit can select only the signal to be extracted using this different acoustic
characteristic. If the cross-sectional area of the light incident side (waveguide upper portion 201)
is smaller than the cross-sectional area of the sound collection side (waveguide lower portion
202), then the waveguide unit 200 The difference in acoustic characteristics between the direct
sound and the indirect sound according to the principle becomes large, and the direct sound can
be more easily detected. Here, the cross-sectional area is a cross-sectional area in a plane
perpendicular to the path of the sound that is perpendicularly incident on the microphone unit
203. The cross-sectional area is, for example, a cross-sectional area in a plane perpendicular to
the Z axis in the case of FIG. 4A.
[0068]
In the present embodiment, the acoustic characteristic is an attenuation amount in a frequency
band higher than the resonance frequency. The direct sound and the indirect sound have
different attenuation amounts for each octave in a frequency band higher than the resonance
frequency. Therefore, in the frequency band higher than the resonance frequency, the
attenuation amount of the direct sound signal level is smaller than the attenuation amount of the
indirect sound signal level, so the direct sound signal level becomes larger than the indirect
sound signal level. This enables the signal processing unit 210 to distinguish between direct
sound and indirect sound at the signal level in a frequency band higher than the resonance
frequency.
[0069]
16-04-2019
18
[4. Next, the details of the display switching process in the display control unit 103 will be
described based on FIGS. 9 to 10C. FIG. 9 is a flowchart of the display switching process. 10A to
10C are diagrams showing display states of the display panel 101 in each step of FIG.
[0070]
When the application program for displaying the score is activated and the displayed score and
page are designated, the display control unit 103 acquires display data of the designated page of
the score from the score DB 104 (step S11). Alternatively, all data of the musical score may be
read into a cache memory configured by a RAM (Randam Access Memory) or the like, and only
the corresponding page may be acquired from the cache memory.
[0071]
The display control unit 103 causes the display panel 101 to display a score using the acquired
display data (step S12). In the example shown in FIG. 10A, the display panel 101 exemplifies a
case where a score of two pages is displayed, and page 1 and page 2 are displayed.
[0072]
As shown in FIG. 10A, when the display switching flag Fsd is output from the signal processing
unit 210 (Yes in step S13), the display control unit 103 determines that the page of the currently
displayed score does not include the final page. (No in step S14), the next page of the page of the
currently displayed score is acquired from the score DB (step S15).
[0073]
The display control unit 103 causes the display panel 101 to display the page of the acquired
score (step S16).
Here, in the present embodiment, the display control unit 103 scrolls the page of the score. FIG.
10B shows the display state of the display panel 101 during page switching of the score. FIG.
16-04-2019
19
10C shows the display state of the display panel 101 after page switching of the score. Note that,
in FIGS. 10B and 10C, although the case of scrolling in the horizontal direction has been
described as an example, the present invention is not limited to this. The display control unit 103
may switch the display so as to scroll in the vertical direction, or may switch the display not to
scroll but to switch instantaneously, or may switch the display by another method.
[0074]
If the page of the currently displayed score includes the final page in step S14, the display
control unit 103 proceeds to step S13 without executing steps S15 and S16.
[0075]
Although not shown in FIG. 9, the display control unit 103 ends the display of the score on the
display panel 101 when the display end signal is input at an arbitrary timing.
Further, in the present embodiment, the case where the display is switched only in one direction
has been described as an example, but, for example, the switching direction of the page of the
score is changed according to the number of times direct sound is detected within a
predetermined time. It does not matter.
[0076]
[5. Effects, Etc.] As described above, in the present embodiment, the signal processing unit of
the voice input device includes direct sound that directly reaches the microphone unit without
being reflected by the inner side surface of the waveguide unit, and The signal processing is
performed using the difference in acoustic characteristics from the indirect sound that is
reflected by the inner side surface (inner wall) and indirectly reaches the microphone unit. Here,
the difference between the acoustic characteristics of the direct sound and the indirect sound
means that, as described above, in the frequency band above a predetermined frequency, for
example, in the frequency band above the resonance frequency, the attenuation of the sound
pressure of the direct sound is more indirect It shows that the amount of attenuation of the
sound pressure of the sound increases. As a result, in the determination frequency band set to
the frequency band higher than the resonance frequency, the difference between the sound
pressure of the direct sound and the sound pressure of the indirect sound becomes large.
16-04-2019
20
[0077]
As described above, the signal processing unit according to the present embodiment is different
from the sound pressure and the indirect sound attenuation amount due to the difference
between the direct sound attenuation amount and the judgment frequency band set in the
frequency band higher than the resonance frequency. Since the difference between the sound
pressure and the sound pressure is used, it is possible to detect the direct sound input with high
accuracy.
[0078]
More specifically, for example, a threshold is set between the lower limit value of the sound
pressure of the direct sound and the upper limit value of the sound pressure of the indirect
sound, and the sound pressure of the sound detected by the microphone unit is not less than the
threshold If it is determined whether or not it is possible, the direct sound input can be detected
with high accuracy.
[0079]
The waveguide of this embodiment is divided into two parts, an inlet part and an outlet part, in
order to bring out the difference in acoustic characteristics more clearly between direct sound
and indirect sound.
And the cross-sectional area of the waveguide upper part which is an entrance part of a
waveguide part is smaller than the cross-sectional area of the waveguide lower part which is an
exit part of a waveguide.
Due to the waveguide having such a shape, the difference between the acoustic characteristics of
the direct sound and the indirect sound is increased by the principle of the Herzholm resonance.
[0080]
The display device according to the present embodiment can detect the voice uttered by the user
with high accuracy by detecting the direct sound input using the above-described voice input
device. This makes it possible to switch the display with high accuracy in accordance with the
16-04-2019
21
voice of the user. It is possible to prevent an erroneous operation such as switching the display
when the user is not emitting voice, and power consumption can be reduced.
[0081]
(Modifications of the Embodiment, etc.) (1) In the above embodiment, the music score display
device 100 has been described by way of example in which the music score DB is constructed in
the memory in the device. Absent. The score display apparatus 100 may be configured to obtain
a score from another device such as a pocket server via, for example, a network.
[0082]
Moreover, although the case where the score display apparatus 100 is an apparatus which
displays the score of a piano was demonstrated to the example in the said embodiment, it does
not restrict to this. The musical score display device 100 is useful as a display device that
displays musical score of an instrument such as an organ or the like in which the sound of an
instrument is input as an indirect sound and the voice emitted by the player is input as a direct
sound. Furthermore, the score display apparatus 100 may be configured to display scores of a
plurality of types of musical instruments.
[0083]
(2) In the above embodiment, the score display device 100 is described as an example of a tablet
terminal. However, the present invention is not limited to this. The score display device 100 may
be realized by using a smartphone or the like, or may be realized by a dedicated device.
[0084]
Further, the musical score display device 100 does not necessarily have to realize the display
panel 101 and the voice input device 102 with the same device. For example, a tablet terminal, a
smartphone, or the like may be used as the voice input device 102, and a display panel of
another device or a dedicated display panel may be configured as the display panel 101.
16-04-2019
22
[0085]
(3) In the above embodiment, the display panel 101 is installed so that the longitudinal direction
(X direction in FIG. 1) is the lateral direction of the music stand, and the example of displaying
the score of two pages is illustrated. It is not limited to this. It may be installed so that the short
side direction (Y direction in FIG. 1) is the horizontal direction of the music stand, and one page
of score may be displayed. 11A to 11C show display states of the display panel 101 in the case of
displaying a score for one page. In this case, as shown in FIGS. 11A to 11C, the installation
position of the voice input device 102 may be disposed below the display panel 101 in the
installation state.
[0086]
In the above embodiment, in the case where the score for two pages is displayed on the display
panel 101, the case where the display is switched from page 1 and page 2 to page 3 and page 4
will be described as an example. Although explained, it is not limited to this. For example, the
state in which page 1 and page 2 are displayed may be switched to the state in which page 2 and
page 3 are displayed.
[0087]
(4) In the above embodiment, the voice input device 102 has been described as an example
incorporated in the score display device 100 for displaying a score, but the present invention is
not limited to this.
[0088]
The voice input device 102 may be incorporated into another display device used under an
environment in which direct sound and indirect sound are mixed, such as a photo frame with a
music reproduction function, for example.
In the case of such a display device, for example, when a direct sound is detected, control to
switch the display is performed.
16-04-2019
23
[0089]
When the voice input device 102 is used in a device other than the score display device 100, not
only display control but also other operation control may be performed using the direct sound
detection function of the voice input device 102. I do not mind.
[0090]
For example, it can be incorporated into a steering wheel of an automobile or the like to be used
as a device for detecting a driver's voice (direct sound).
In this case, when the driver's voice is detected to the in-vehicle device, a voice detection signal is
output, whereby the in-vehicle device performs processing according to the driver's voice (for
example, car navigation or in-vehicle AV device It becomes possible to execute on-off processing
etc.).
[0091]
FIG. 12 is a view showing an example of a steering wheel 500 of a car incorporating the voice
input device 102. As shown in FIG. As shown in FIG. 12, the voice input device 102 is
incorporated in the central portion of the handle 500. With this configuration, the driver's voice
is input as a direct sound to the voice input device 102, and another sound such as the
passenger's voice is input as an indirect sound.
[0092]
(5) In the above embodiment, the voice input device 102 only detects direct sound, and does not
execute analysis of what kind of voice the direct sound is, but it is configured to perform voice
analysis. It does not matter.
[0093]
(6) Although the waveguide unit 200 has a shape in which two cylinders are combined in the
above embodiment, the present invention is not limited to this.
16-04-2019
24
For example, as shown to FIG. 4B, it may be the shape which combined two square pillars from
which cross-sectional area differs, and as shown to FIG. 4C, cross-sectional area may be the same.
Note that the dimensions of the upper waveguide portion 201 and the dimensions of the lower
waveguide portion 202 depend on the installation space of the voice input device 102 in the
musical score display device 100 and the type of indirect sound, for example, indirect sound due
to difference in type of musical instrument. Make appropriate settings in consideration of
differences in frequency characteristics.
[0094]
(7) In the above embodiment, the signal processing unit 210 and the display control unit 103 are
typically realized as an LSI which is an integrated circuit. These may be individually made into
one chip, or may be made into one chip so as to include some or all. Further, although an LSI is
used here, it may be called an IC, a system LSI, a super LSI, or an ultra LSI depending on the
degree of integration. Further, the method of circuit integration is not limited to LSI's, and
implementation using dedicated circuitry or general purpose processors is also possible. After
the LSI is manufactured, a field programmable gate array (FPGA) that can be programmed or a
reconfigurable processor that can reconfigure connection and setting of circuit cells in the LSI
may be used.
[0095]
Furthermore, if integrated circuit technology comes out to replace LSI's as a result of the
advancement of semiconductor technology or a derivative other technology, it is naturally also
possible to carry out function block integration using this technology. Adaptation of
biotechnology etc. may be possible.
[0096]
Further, the signal processing unit 210 and the display control unit 103 may be realized as a
computer program (software) for causing a computer to execute the processing executed by the
signal processing unit 210 and the display control unit 103.
[0097]
16-04-2019
25
In this case, the computer program or the recording medium capable of reading the digital signal
from the computer, such as a flexible disk, hard disk, CD-ROM, MO, DVD, DVD-ROM, DVD-RAM,
BD (Blu-ray Disc), semiconductor It may be realized by being recorded in a memory or the like.
The present invention may also be realized as the digital signal recorded on these recording
media.
[0098]
Further, the computer program or the digital signal may be transmitted via a telecommunication
line, a wireless or wired communication line, a network represented by the Internet, data
broadcasting, or the like.
[0099]
In addition, it is implemented by another independent computer system by recording and
transferring the program or the digital signal on the recording medium, or transferring the
program or the digital signal via the network or the like. It may be
[0100]
As described above, the embodiment has been described as an example of the voice input device
and the display device in the present disclosure.
For that purpose, the attached drawings and the detailed description are provided.
[0101]
Therefore, among the components described in the attached drawings and the detailed
description, not only components essential for solving the problem but also components not
essential for solving the problem in order to exemplify the above-mentioned technology May also
be included.
16-04-2019
26
Therefore, the fact that those non-essential components are described in the attached drawings
and the detailed description should not immediately mean that those non-essential components
are essential.
[0102]
Moreover, since the above-mentioned embodiment is for illustrating the technique in this
indication, various change, substitution, addition, omission, etc. can be performed within the
range of a claim or its equivalent.
[0103]
The present disclosure is applicable to an apparatus that performs control by voice.
Specifically, the present invention is applicable to an electronic score using a display device such
as a tablet, a personal computer, and an in-vehicle device.
[0104]
DESCRIPTION OF SYMBOLS 100 score display apparatus 101 display panel 102 audio | voice
input apparatus 103 display control part 104 score DB200 waveguide part 201 waveguide part
upper part 202 waveguide part lower part 203 microphone part 210 signal processing part 211
HPF212 level detector 213 comparator 400, R acoustic Resistance 401, L acoustic inertance
402, C acoustic compliance 403, Rx variable resistance 500 handle
16-04-2019
27
Документ
Категория
Без категории
Просмотров
0
Размер файла
41 Кб
Теги
description, jpwo2013118204
1/--страниц
Пожаловаться на содержимое документа