close

Вход

Забыли?

вход по аккаунту

?

DESCRIPTION JP2014179983

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2014179983
Abstract: The present invention provides a video conference system that outputs speech by a
speaker as if it is output from near the speaker on a display. A plurality of displays arranged side
by side, a plurality of upper speakers are disposed near the top of each of the displays, and a
plurality of lower speakers are disposed near the bottom of the respective displays. A control unit
110 operatively connected to the display and the speakers routes video to each of the displays
and routes audio corresponding to each display to the upper and lower speakers located near the
displays. [Selected figure] Figure 1
Speaker arrangement with on-screen voice positioning function for telepresence system
[0001]
The present invention relates to a speaker arrangement with on-screen voice positioning function
for telepresence systems. This application claims the benefit of US Provisional Application No. 61
/ 779,591, filed March 13, 2013, which is incorporated herein by reference in its entirety.
[0002]
Organizations use video conferencing systems when holding meetings. The basic video
conferencing system includes one or two screens and a video conferencing device. However,
organizations are constantly looking for ways to make video conferencing more interactive and
more effective.
11-04-2019
1
[0003]
One way to achieve interactive video conferencing is to create a conference room with specially
designed displays, cameras, microphones, and tables. This type of video conferencing system
requires extensive space and capital investment for creation. An example of this type of video
conferencing system is the Polycom (R) RealPresence (TM) Experience High Definition (RPX (TM)
HD) system.
[0004]
For example, in the RPXTM HD system, a special room is set up to accommodate 4 to 28
participants. Specially designed rooms include special furniture, chairs, soundproof ceilings,
studio lighting, acoustic panel walls, video conferencing equipment, ceiling microphones, and full
screen video displays (8 'x 42' '; or 16' x 42 '') is installed. Another telepresence system is the
Polycom® <> Open Telepresence ExperienceTM (Polycom OTXTM) solution. This solution
consists of an integrated display and a set of cameras placed directly on the participant's table.
[0005]
Current video conferencing systems are optimized for stereo audio using stereo speakers.
Because the system uses a microphone near the speakers, if the local microphone picks up the
far-end audio from the speakers, it uses echo cancellation to prevent echoed audio from being
sent back to the far-end. Because such echoes are distracting, current video conferencing systems
are primarily optimized for echo cancellation, so that they can perform echo cancellation well in
stereo environments, but apply to more complex environments. I can not do it.
[0006]
Stereo speakers are placed on the left and right of the display of the telepresence system.
However, due to the large seating area of the telepresence system, using stereo speakers in these
locations does not work very well. The majority of the participants will sit outside the "sweet
spot" of the stereo speakers and will only listen to the speaker's voice output from the closest
11-04-2019
2
speaker (left or right side of the display set) .
[0007]
In previous telepresence systems, instead of placing speakers on both sides of the display, the
speakers were placed above or below the display. For example, when using an LCD screen or the
like, the speaker is placed at the edge of the display (or further beyond). As a result, there is a
distance between the speaker's voice output location and the speaker's video image, which not
only looks unnatural, but also weakens the interactive feeling of the multi-display telepresence
system. For these reasons, it becomes difficult to create the illusion that the sound is coming
from the center of the electronic display (where the speakers can not be placed).
[0008]
If the display used is small, the distance between the speaker and the center of the display screen
will not be large, and the difference between the speaker output and the speaker's video image
will not be bothered or annoyed. The use of taller screens in telepresence systems results in
greater vertical misalignment between the on-screen speaker's image and the speaker's voice's
apparent position, resulting in an unnatural and unpleasant experience. turn into.
[0009]
The subject matter of the present disclosure relates to overcoming, or at least reducing the
effects of, one or more of the above problems.
[0023] 1 illustrates an integrated video conferencing system according to the present disclosure.
[0024]
Shows an integrated videoconferencing system embedded in a credenza or other enclosure.
Shows an integrated videoconferencing system embedded in a credenza or other enclosure.
Shows an integrated videoconferencing system embedded in a credenza or other enclosure.
11-04-2019
3
[0025] 1 schematically illustrates one arrangement of displays and speakers in the disclosed
system.
[0026] 2 shows an exemplary configuration of a speaker in the disclosed system. 7 illustrates
another exemplary configuration of speakers in the disclosed system. 7 illustrates yet another
exemplary configuration of a speaker in the disclosed system.
[0027] 7A schematically illustrates another arrangement of displays and speakers in the
disclosed system, including a content display.
[0028] 7A schematically illustrates another arrangement of displays and speakers in the
disclosed system that does not have an upper center speaker.
[0029] 7 schematically illustrates another arrangement of displays and speakers in the disclosed
system that does not have a lower center speaker.
[0030] FIG. 8 schematically illustrates an arrangement in the disclosed system having two
displays and a vertical pair of speakers.
[0031] FIG. 8 schematically illustrates an arrangement in the disclosed system having five
displays and a vertical pair of speakers.
[0032] FIG. 8 schematically illustrates an arrangement in the disclosed system with five displays
and a vertical pair of speakers, without an upper center speaker.
[0033] FIG. 8 schematically illustrates an arrangement in the disclosed system with five displays
and a vertical pair of speakers, but without the two upper speakers.
[0034] 1 Fig. 6 schematically illustrates an arrangement of the disclosed system having a display
in the form of a projector screen or more. 1Fig. 6 schematically illustrates an arrangement of the
disclosed system having a display in the form of a projector screen or more. 1Fig. 6 schematically
illustrates an arrangement of the disclosed system having a display in the form of a projector
11-04-2019
4
screen or more.
[0035] Figure 1 illustrates schematically the arrangement of the disclosed system used for
multipoint video conferencing calls. Figure 1 illustrates schematically the arrangement of the
disclosed system used for multipoint video conferencing calls. Figure 1 illustrates schematically
the arrangement of the disclosed system used for multipoint video conferencing calls.
[0036] An integrated video conferencing system 100 (also known as a telepresence system),
schematically illustrated in FIG. 1, includes a control unit 110, a plurality of camera units 130LCR, and a plurality of displays 140-LCR. In the present embodiment, the system 100 has three
camera units 130-LCR and three displays 140-LCR, but the number of camera units and displays
can be changed. System 100 also includes additional audio / visual (A / V) devices and near-end
components, including microphone 150 and vertical speaker pair 160-LCR. The microphone 150
can be attached to the ceiling or to a table.
[0037] The control unit 110 may be an integral component or may have a plurality of
operatively coupled components. As shown, control unit 110 includes controller 115 and three
codecs 120-LCR (left (L), center (C), and right (R)). The codec 120-LCR processes video streams
for video conferencing. One of the codecs 120-C is the primary codec and interfaces with the
microphone 150 and the vertical speaker pair 160-LCR of the system via an audio line to process
the video conference audio. The main codec 120-C can also be connected to content input via a
content display (not shown), an XGA or an audio line. In order to process video, each codec 120LCR is connected to one camera unit 130-LCR via a component line. Each codec 120-LCR is also
connected to one display 140-LCR via a DVI line.
[0038] The controller 115 controls the operation of the system 110 in conjunction with each
codec 120-LCR and display 140-LCR via a control line. The controller 115 can include a
computer that coordinates the A / V system and networking details. The controller 115 and
codec 120-LCR, including conventional video conferencing components, perform video
conferencing between near-end participants and one or more far-end units 10 via the
communication connection 112 . These video conferencing components are recognized and used
in the art and will not be described in detail here.
[0039] Control unit 110 receives video conferencing information from one or more far-end units
10. The far-end unit will be a comparable single video conferencing system in a point-to-point
call, or multiple separate far-end endpoints in a multipoint call. In any case, multiple video feeds
11-04-2019
5
are provided to the control unit 110 along with associated multiple audio channels and
processed by the system 110. The control unit 110 also receives multiple video feeds from the
camera 130-LCR and creates a number (eg 3) of channels comparable to the audio channels from
the microphone 150. Each channel can use a separate microphone 150 or a combination or
group of microphones 150. The required channels can also be created using beam streaming
from multiple microphones 150, or other audio isolation techniques. As can be appreciated from
other embodiments disclosed herein, the number of channels in system 100 can be increased or
decreased, but the number of audio channels is preferably matched to the number of video feeds
captured.
[0040] Because system 110 has multiple speakers 162 and microphones 150, control unit 110
preferably uses echo cancellation for each channel of audio generated. In the present example
where system 130 outputs three audio channels through speaker pair 160-LCR and captures the
three audio channels of microphone 150, the disclosed system 100 has the ability to perform
echo cancellation on three correlated audio channels. It is preferable to have Because the number
of channels of the system 110 disclosed herein may be high or low, it is preferable that the echo
cancellation function have the ability to handle the required number of channels.
[0041] As shown in the embodiment of FIG. 2A, the control unit 110, the codec 120-LCR, the
camera unit 130-LCR, the display 140-LCR, and the vertical speaker pair 160-LCR are all
integrated in a housing, or a credenza 170 Can. Thus, with the use of the credenza 170, the
system 110 can be installed in office spaces or rooms that are not ready to install these video
conferencing components. With such an integrated ladder 170, the system 100 can be installed
in a room or space specifically designed to house the components.
[0042] As shown in the plan view of FIG. 2B, the displays 140-LCR can be arranged substantially
flat relative to each other along a plane or wall, so the corresponding speaker pairs 160-LCR are
likewise arranged on a plane can do. However, in FIG. 2C, the displays 140-LCR are arranged at
an angle relative to one another, and the speaker pair 160-LCR is arranged as well. In either case,
the participant will sit on the opposite side of the display 140-LCR in a seat located at the
opposite angle. The seat is preferably preset at a specified distance and orientation relative to the
display 140-LCR and other components of the disclosed system 100. In fact, as shown in FIG. 2B,
the integrated desk 175 can arrange and use the seats.
[0043] As illustrated in FIG. 1 and more clearly shown in the schematic arrangement of the
system in FIG. 3, each display 140-LCR of system 100 comprises a vertical speaker pair 160-LCR
and each vertical speaker pair of system 100 The 160-LCR has an upper speaker 162-A located
11-04-2019
6
at the top of each display 140-LCR, and a lower speaker 162-B located at the bottom of each
display 140-LCR along the vertical centerline.
[0044] As shown in FIG. 3, in a video conference using system 100, the speaker's voice displayed
on a given display 140-LCR is reproduced from the vertical speaker pair 160-LCR corresponding
to that display. Audio routing to a specific speaker pair 160-LCR based on which participant is
displayed on a specific display 140-LCR is handled by the control unit 110 and / or the
corresponding codec (120-LCR) of the display . For example, for the far end participant's video
routed to the left display 140-L, it may be routed to the display's vertical speaker pair 140-LCR
as an output of audio associated with these far end participants.
[0045] The top / bottom speakers 162A, 162B of each pair 160-LCR are preferably angled and
the participant's voice displayed on a given display 140-LCR is of the display 140-LCR. It is
preferable to adjust the balance of its output so that it is known that it has come out
approximately from the center. Overall, the angle and balance of the speakers 162A, 162B
depend on the estimated number and location of near-end participants who will sit in front of the
display 140-LCR, and details regarding angles and balance are specific to a particular
implementation It can be configured.
[0046] Furthermore, it is preferable to arrange the speaker 162 so as not to take up a space. For
example, FIGS. 4A-4C illustrate various types of speaker arrangements used in the disclosed
system. As shown in FIG. 4A, one speaker configuration 164 used in the disclosed system has
tweeters 166 arranged concentrically with respect to the woofer 168, the speaker configuration
164 being more compact, and It is easier to place them above and below the display (140). As
shown in FIG. 4B, another speaker configuration 164 'used in the disclosed system includes a
satellite tweeter 166 mounted vertically to the woofer 168. This is a conventional configuration,
aligned with the vertical speaker pair (160) of the system. However, as shown in FIG. 4C, the
satellite tweeter 166 can be mounted horizontally to the woofer 168, but this configuration 164
"is not preferred when placed above or below the display (140). This is because horizontal
mounting is off-axis with respect to the vertical speaker pair (160) used.
[0047] Even if such considerations are made, the near-end participants of the system 100 in FIG.
3 receive the low frequency output mainly from the lower speaker 162-B of each pair 160-LCR
and from the upper speaker 162-A. It will close the high frequency output mainly. To
compensate for this, the control unit 110 includes a shelving equalizer (not shown) that amplifies
the low frequency of the upper speaker 162-A of each pair 160-LCR and amplifies the high
frequency of the lower speaker 162-B. In one example, a low shelf of 700 Hz (+/− 3 dB) and a
11-04-2019
7
high shelf of 4 kHz (− / + 3 dB) can be applied to the reverse gain for the top / bottom speakers
162A, 162-B. This has the effect of "enhancing" the perceived position, making the output from
the center of each display 140-LCR more clear. Other equalization techniques can also be used,
depending on the implementation.
[0048] During a video conference, each speaker's horizontal perceived position (left to right) is
clearly at the center of the display 140-LCR onto which the speaker's video image is projected.
This position is very accurately perceived by participants sitting anywhere in the viewing area
(which may be extensive). In addition, the perceived sound is output vertically from the height of
the talker's head, ie, the center of each display 140-LCR. This creates a very interactive sensation
in the room, which is a very comfortable and natural experience for the end user.
[0049] Preferably, all the displays 140-LCRs of the system 100 have vertical pairs 160-LCRs. In
some implementations, an additional content display 142 may be used to display content during
a video conference, such content display 142 being on top of center display 140-C, as shown in
FIG. Deploy. In this arrangement, the upper speakers 162A of the center pair 160-C are arranged
further above the content display 142. System 100 is configured to accomplish the same purpose
as the other systems disclosed herein. As can be seen from the above description, the delay,
equalization, gain or other settings related to the speaker pair 160-LCR (especially the offset of
the top speaker of the center pair 160-C) shall be different from the previous arrangement, The
distance needs to be offset.
[0050] In some implementations, it may not be possible to place the speaker directly above the
display. For example, FIG. 6A illustrates another arrangement of the display 140-LCR and
speaker pair 160-LCR of the disclosed system 100. The content display 142 is disposed on the
top of the center display 140-C. Due to the placement height and the size of the room, it may not
be possible to place the top center speaker of the center pair 160C directly above the center
display 140-C. Alternatively, the upper center speaker may not be able to be added to the system
100 due to the location of the camera (not shown) or other impairments other than the content
142.
[0051] In either case, use a combination of gain settings and input channel mixing to compensate
for the unavailable top center speaker. Table 1 below shows an example of mixing gain values to
compensate for the unusable top center speaker in the five speakers 140-LCR of system 100 of
FIG. 6A.
11-04-2019
8
[0052] The gain and mixing settings shown elsewhere in this disclosure, and here, are exemplary,
values and differences in values may be the size of a room, the size of a display, the vertical /
horizontal distance between speakers, and others. It may differ depending on the specific details
of the implementation, such as elements. In other arrangements, the high waves of the lower
speaker 162-B can be amplified and the low frequencies of the upper speaker 162-A can be
amplified. Of course, the equalization settings in this and other arrangements of the present
disclosure may also differ due to differences in gain and mixing settings.
[0053] Gain and mixing settings virtualize the unavailable top center speaker. In fact, by
experiment, the audio information designated to the upper center speaker which can not be used
is reduced by the upper right speaker 162-A and the upper left speaker 162-A and mixed, and
the output of the lower center speaker 162-B is amplified to be virtual. It has been shown that
center speakers can be created. As a result, as desired, center channel information can be
perceived as being output from the center display 140-C.
[0054] By using gain and mixing settings comparable to virtualization of upper center speakers
that can not be used, virtualization of lower center speakers that can not be used with the
disclosed system 100 can be performed as well. For example, FIG. 6B schematically illustrates
another arrangement of the display 140-LCR and the speaker pair 160-LCR in the disclosed
system 100 without the lower center speaker. In this case, it is possible to virtualize the lower
center speaker which can not be used, by setting the gain and the mixing setting as described
above to be reversed.
[0055] Table 2 below shows an example of mixing gain values for the five speakers of the system
100 of FIG. 6B.
[0056] In the previous embodiment, the disclosed system 100 had three displays 140-LCR
arranged side by side. In general, the teachings of the present disclosure apply to the disclosed
system 100 in which multiple displays (eg, two or more) are arranged side by side. For example,
FIG. 7 schematically illustrates one arrangement in the disclosed system having two displays
140-LR and two vertical speaker pairs 120-LR. The control unit 110 performs video and audio
routing on each display 140-LR and vertical speaker pair 160-LR, and achieves the same purpose
as disclosed herein for other systems.
[0057] In yet another embodiment, FIG. 8A is disclosed having three or more displays 140 (L2,
L1, C, R1, R2) and five vertical speaker pairs (L2, L1, C, R1, R2) each. One arrangement of the
11-04-2019
9
system 100 is illustrated schematically. Control unit 110 performs video and audio routing to
each display 140 (L2, L1, C, R1, R2) and vertical speaker pair 160 (L2, L1, C, R1, R2), and for
other systems here Achieve the same purpose as disclosed in In particular, the system 100
requires a five-channel acoustic echo canceller to address possible acoustic coupling between the
speaker pair 160 and the microphone (not shown).
[0058] As before, in the disclosed system 100 arrangement, there may not be one or more upper
/ lower speakers in a given vertical speaker pair 160. For example, FIG. 8B has five displays 140
(L2, L1, C, R1, R2) and vertical speaker pairs 160 (L2, L1, C, R1, R2), directly above center
display 140-C. 1 schematically illustrates one arrangement of the disclosed system 100 that does
not have an upper center speaker. Gain and mixing settings comparable to the disclosure in Table
1 above can not be used with the lower center speaker 160-C and the upper speaker 162-A of
the inner left pair 160-L1 and the inner right pair 160-R1. The speaker can be virtualized.
[0059] In another embodiment, FIG. 8C includes five displays 140 (L2, L1, C, R1, R2) and vertical
speaker pairs 160 (L2, L1, C, R1, R2), and an internal pair 160-. FIG. 16 schematically illustrates
an arrangement of the disclosed system 100 that does not have top speakers 162-B in L1 and
160-R1. The gain and mixing settings comparable to the above disclosures can not be used on
the lower internal left speaker 160-L1, the upper speaker 162-A of the external left pair 160-L2,
and the center pair 160-C, and can not be used on the upper interior The left speaker can be
virtualized. Similarly, gain and mixing settings comparable to the above disclosure can not be
used with lower internal right speaker 160-R1, external right pair 160-R2 upper speaker 162-A,
and center pair 160-C. The upper internal right speaker can be virtualized.
[0060] Table 3 below shows an example of mixing gain values for the system 100 of FIG. 8C.
[0061] The system 100 disclosed above addresses the playback of the audio from the far end 10
for each display 140 having the participant's video corresponding to the audio. Thus, the
participant's voice displayed on a particular display 140 is routed to the speaker pair 160
associated with the particular display 140. The system 100 disclosed herein associates near-end
audio captured by the microphone 150 with video captured by the camera 130-LCR, and
corresponding audio and video are transmitted to the far end and similarly routed by the far end
system Configure to be Thus, the system 100 disclosed herein can rely on one or more dedicated
microphones 150 for each of the cameras 130-LCR. Alternatively, as mentioned above, beam
steering and speaker detection techniques etc. known in the art can also be used.
11-04-2019
10
[0062] The system 100 disclosed above has a matching number of speaker pairs 160 and
displays 140, but this is not strictly necessary. For example, the system 100 disclosed herein can
have five displays and four speaker pairs 160. Furthermore, the system 100 disclosed above
implies that the display 140 is an electronic display such as an LCD screen or the like. This is not
strictly necessary, as the display 140 may be a projector screen capable of projecting high
resolution video. In fact, FIG. 9A shows an arrangement of the disclosed system 100 with a
display 200 in the form of a widescreen (including one or more separate projector screens). This
widescreen projects wide-angle views from one or more remote projectors that are video
components of the system.
[0063] Various participants are projected on the screen 200. As mentioned above, the displayed
participants may be located at the same far end 10 or may be participants from multiple far ends
of the multipoint connection.
[0064] In any case, system 100 includes a plurality of vertical speaker pairs 160 with upper
speakers 162 -A located along the top of screen 200 and lower speakers 162 -B located along
the bottom of screen 200. Configure to have. 5Although one pair 160 (L2, L1, C, R1, R2) is
shown, the number of pairs 160 can be more or less. Similarly, the participant's video images
displayed on screen 200 can be more or less than illustrated. In fact, since the screen 200 is
intended to be continuous, there is no clear display as in the previous arrangement. Thus,
participants can project on different parts of the wide screen 200. Thus, the number of available
speaker pairs 160 may differ from the number of distinct participants, video feeds, participant
groups, portions of the screen, etc.
[0065] In this arrangement, the system 100 operates each speaker pair 160 as described above
to output the participant's voice projected onto the corresponding portion of the screen 200 in a
manner similar to the previous embodiment. Because the number of wide screen 200 and the
number of speaker pairs 160 may be different, the system 100 uses the number and
arrangement of the speaker pairs 160 according to the number of screens 200 or the screen 200
to display various participants' video. It can be selectively adjusted based on which part is used.
[0066] In the system 100 of FIG. 9A, the speaker 162 is located outside the screen 200. This is
not necessary if the screen is acoustically transparent. In the case shown in FIG. 9B, the vertical
speaker pair 160 is disposed relative to the acoustically transmissive screen 210 so that one or
both of the upper and lower speaker pairs 162 are disposed behind the acoustically transmissive
screen 210. (E.g. inside the screen 200).
11-04-2019
11
[0067] In another arrangement, as shown in FIG. 9C, instead of using pairs, the system 100 uses
speakers 220 located at the back of the acoustically transmissive screen 210, in the center of the
horizontal part, and each of the screens 210. Output the speaker's voice from the part. Although
the arrangement is the most intuitively acceptable arrangement if the screen 210 being used
passes audio, as the undesired muffling etc. do not occur, by experiment, the central arrangement
of speakers used in FIG. 9C The surprising result is that most viewers can not distinguish the
difference between 220 and the vertical speaker pair 160 used in FIGS. 9A-9B. Thus, placement
of the speaker pair 160 at or near the periphery of the screen 200 or 210, as in FIGS. 9A-9B,
makes these placements more desirable in certain implementations.
[0068] In fact, by placing the multiple speakers 160 towards the top and bottom of the screen
200, or 210, the system can be used more generally when using the system 100 for point-topoint connection or multipoint video conferencing calls. It will be possible to For example, FIG.
10A shows another system 100 according to the present disclosure having at least one display
200 and a plurality of upper and lower speakers 162-A and 162-B. Here, the at least one display
200 is a wide screen or video wall, but may also be a plurality of separate screens or electronic
displays. In this system 100, the upper and lower speakers 162-A and 162-B can be configured
and operated by the control unit 110 as a vertical speaker pair, similar to the arrangement
described above. In particular, this may be done when using system 100 for a point-to-point
video conference call with a far end (not shown).
[0069] If the system 100 is part of a multipoint video conference call with multiple end sites, the
system 100 can operate the upper and lower speakers 162-A and 162-B independently. That is,
the upper speaker 164-LCR installed towards the top of the screen 200 may have three separate
channels, and the lower speaker 166-LCR installed towards the lower part of the screen 200 may
have three separate channels. Can have a channel. Thus, the system 200 provides six
independent audio channels provided to six speakers (three arranged towards the top of the
display 200 and three arranged towards the bottom of the display). Have. Thus, in this system,
any speaker 164-LCR and 166-LCR can be active. As can also be seen from the above description,
system 100 can have any number of speakers.
[0070] 3 The video from one of the sites 201, 202 and 203 is displayed on the screen 200 in an
arrangement conducive to multipoint connection. In this example, the video from each site 201,
202 and 203 is output from two video feeds, but this is not necessarily the case. This is because
multipoint video conferencing calls may be placed between various systems with different video
capabilities.
11-04-2019
12
[0071] In video conferencing audio rendering, the system 100 renders any source from any of
the six channels of speakers 164-LCR and 166-LCR that are optimal for the location of the site
201, 202, and 203 video displayed on the screen 200. Create a two-dimensional spatial audio
experience that For example, the system 100 renders the audio of the first site 201 through the
upper center speaker 164-C to match where the video of the first site 201 is displayed on the
screen 200. When two channels of input audio originate from this first site 201, the audio is
coupled to one channel of the upper center speaker 164-C. Similarly, the system 100 renders the
audio of the second site 202 from the lower left speaker 166-L and the audio of the third site
203 from the lower right speaker 166-R, with the video arrangement of the sites 202 and 203 It
can be rendered.
[0072] Inter-site audio mixing can also be performed. For example, if the second and third sites,
both 202 and 203, have left and right input channels, the left audio from the second site 202 is
rendered to the lower left speaker 166-L and the right audio from the third site 203 is lower By
rendering on the right speaker 166-R, the audio arrangement can be configured to match the
layout. However, the right audio from the second site 202 and the left audio from the third site
203 can be combined to share the lower center speaker 166-C. This is because the location of
this speaker matches the location where the participants are displayed. As can also be seen from
the above description, system 100 can be used to perform these and other arrangements.
[0073] The control unit 110 has a spatial audio echo canceller (not shown) for multiple channels
(e.g. three, four or more channels) and is used to perform echo cancellation in the multi-channel
system 100. use. The echo canceler allows the system 100 to capture audio with a live
microphone, even when multiple audio channels are rendered by the speakers 164-LCR and 166LCR. As can also be seen from the above description, the system 100 of FIG. 10A requires a sixchannel voice echo canceller for six independent speaker paths.
[0074] In FIG. 10A, system 100 has a large video wall or screen 200 with speakers 164-LCR and
166-LCR configured as separate channels. The same applies to a system 100 with multiple
screens or electronic displays. For example, FIG. 10B shows a comparable system 100 having
first and second electronic displays 140-LR displaying three sites 201, 202, and 203 in a
multipoint video conference call. As shown, the video at the first site 201 is split between the two
displays 140-LR according to performance tolerances. The audio of the first site 201 is rendered
to either the upper left speaker 164-L or the upper right speaker 164-R. Alternatively, the
original stereo sound of the first site 201 is rendered to the two upper speakers 164-LR so as to
match the location of the participants of the site 201.
11-04-2019
13
[0075] The sound of the second site 202 is rendered from the lower left speaker 166-L and the
sound of the third site 203 is rendered from the lower right speaker 166-R, continuing with the
other sites 202 and 203 as well. Other configurations may also be used, depending on the system
connected to the multipoint video conference call. For example, the videos of the second and
third sites 202 and 203 illustrated in FIG. 10B are actually composed of videos from a four
camera system, and the system 100 has four lower panels of two displays 140-LR. indicate.
Stereo sound from such a single (four camera) system is rendered from the two lower speakers
166-LR. Since this system renders audio from four channels, the control unit 110 preferably has
the function of a four channel echo canceller.
[0076] Finally, FIG. 10C shows another example in which five multipoint sites 201-205 are
displayed on three displays 140-LCR. According to the normal configuration, the sound from
sites 201-204 is rendered by upper left speaker 164-L, lower left speaker 166-L, upper center
speaker 164-C, and lower center speaker 166-C, respectively. 5The audio from the second site
205 is rendered by the upper right and lower right speakers 164-R and 166-R as vertical speaker
pairs, as in the other embodiments disclosed herein.
[0077] As can also be seen from the above description, the teachings of the present disclosure
can be implemented in digital electronic circuitry, computer hardware, computer firmware,
computer software, or any combination thereof. The teachings of the present disclosure can be
implemented in a computer program tangibly embodied in a machine-readable storage device. A
programmable processor that executes with the programmable processor and executes program
instructions performs the functions of the present disclosure. The teachings of the present
disclosure are programmable with at least one programmable processor coupled to receive data
and instructions from a data storage system, at least one input device, and at least one output
device, and transmit data and instructions. It can be implemented in one or more computer
programs that are executable on the system. Storage devices suitable for tangibly implementing
computer program instructions and data include, by way of example, EPROMs, EEPROMs,
semiconductor memory devices such as flash memory devices, magnetic disks such as internal
hard disks and removable disks, magneto-optical disks, and It includes all forms of non-volatile
memory, including CD-ROM disks and the like. Any of the foregoing can be supplemented by, or
incorporated into, an ASIC (application specific integrated circuit).
[0078] The above description of the preferred and other embodiments is not intended to limit or
diminish the scope or applicability of the inventive concepts of the present application. With
respect to the benefits of the present disclosure, embodiments of the disclosed subject, or
features described above according to aspects, alone or in combination with other described
11-04-2019
14
features, or other embodiments of the disclosed subject It is understood that it can be utilized in
the aspect or.
[0079] As a condition of disclosing the inventive concept contained herein, the applicant claims
all patent rights granted by the appended claims. Accordingly, the appended claims are intended
to be construed to include all possible applications and modifications that may occur within the
scope of the following claims, or their equivalents.
[0080] 100 Integrated video conferencing system, 110 control units, 130-LCR multiple camera
units, 140-LCR multiple displays, 150 microphones, 160-LCR vertical speaker pairs.
11-04-2019
15
Документ
Категория
Без категории
Просмотров
0
Размер файла
30 Кб
Теги
description, jp2014179983
1/--страниц
Пожаловаться на содержимое документа