US20150030172A1 - Inter-Channel Coherence Reduction for Stereophonic and Multichannel Acoustic Echo Cancellation - Google Patents
Inter-Channel Coherence Reduction for Stereophonic and Multichannel Acoustic Echo Cancellation Download PDFInfo
- Publication number
- US20150030172A1 US20150030172A1 US14/334,915 US201414334915A US2015030172A1 US 20150030172 A1 US20150030172 A1 US 20150030172A1 US 201414334915 A US201414334915 A US 201414334915A US 2015030172 A1 US2015030172 A1 US 2015030172A1
- Authority
- US
- United States
- Prior art keywords
- audio signals
- modulation
- frequency
- domain
- amplitude modulation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/02—Circuits for transducers for preventing acoustic reaction, i.e. acoustic oscillatory feedback
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
- H04M9/082—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/005—Circuits for transducers for combining the signals of two or more microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/12—Circuits for transducers for distributing signals to two or more loudspeakers
Definitions
- the present invention relates to audio signal processing and, more specifically but not exclusively, to stereophonic acoustic echo cancellation (AEC).
- AEC stereophonic acoustic echo cancellation
- undesirable acoustic echo can occur when sounds (i.e., acoustic signals) corresponding to (electronic) audio signals transmitted from a first side of the two-way communication and rendered by loudspeakers at the second side are picked up by microphones at the second side and included in the audio signals transmitted back to and rendered by loudspeakers at the first side as acoustic echo to individuals located at the first side.
- Acoustic echo cancellation refers to signal processing that attempts to estimate the audio signals corresponding to acoustic echo occurring at the second side of the two-way audio communication and appropriately compensate the audio signals to be transmitted back to the first side to reduce or even eliminate the contribution of that acoustic echo in those transmitted audio signals.
- the term “loudspeaker” refers to any suitable transducer for converting electronic audio signals into acoustic signals (including headphones), while the term “microphone” refers to any suitable transducer for converting acoustic signals into electronic audio signals.
- each side In a monophonic audio system, each side has only one microphone and only one loudspeaker. In a stereophonic audio system, each side has two microphones that generate left and right outgoing audio channels and two loudspeakers that render left and right incoming audio signals. In a multichannel audio system, each side has more than two microphones and more than two loudspeakers. Acoustic echo cancellation can be applied with varying sophistication and corresponding variable success in each of these different audio architectures.
- SAEC stereophonic AEC
- MAEC multichannel AEC
- FIG. 1 shows a block diagram of an exemplary stereophonic audio system
- FIG. 2 shows a block diagram of the coherence reduction module of FIG. 1 ;
- FIG. 3 graphically shows maximum phase excursion in degrees versus frequency introduced by a phase modulator.
- This disclosure describes a novel principle and implementation of an inter-channel amplitude coherence reduction (CR) method for stereophonic acoustic echo cancellation (SAEC) and multichannel acoustic echo cancellation (MAEC).
- CR inter-channel amplitude coherence reduction
- SAEC stereophonic acoustic echo cancellation
- MAEC multichannel acoustic echo cancellation
- Effective coherence reduction, or decorrelation, of the transmission channels solves the non-uniqueness problem in SAEC applications as well as in MAEC applications.
- this disclosure describes a novel improved version of the principle discussed in Ref [6].
- the concept of amplitude modulation aka magnitude modulation
- the low-frequency range is introduced to effectively enhance the reduction in amplitude coherence and thereby improve the effectiveness of acoustic echo cancellation.
- FIG. 1 shows a block diagram of an exemplary stereophonic audio system 100 .
- audio system 100 has (i) a transmission room 110 having left and right stereo microphones 112 and left and right stereo loudspeakers 114 and (ii) a receiving room 120 having left and right stereo microphones 122 and left and right stereo loudspeakers 124 .
- Audio system 100 has a transport layer 130 that handles the transmission and reception of audio signals transmitted between the transmission room and the receiving room.
- audio system 100 Located near (or in) receiving room 120 , audio system 100 also has an adaptive AEC module 140 and a coherence reduction (CR) module 150 .
- audio system 100 may also have an adaptive AEC module and a CR module analogous to modules 140 and 150 that are located near (or in) transmission room 110 .
- AEC module 140 comprises four adaptive echo models 142 corresponding to the four acoustic echo paths 126 from each different loudspeaker 124 in receiving room 120 to each different microphone 122 in receiving room 120 .
- Each adaptive echo model 142 adaptively generates an estimate 143 of the audio signal corresponding to echo in the associated acoustic echo path 126 based on the corresponding incoming audio signal 151 received at receiving room 120 from transmission room 110 .
- Those estimated echo audio signals 143 are subtracted at subtraction nodes 146 from the outgoing audio signals 121 transmitted from receiving room 120 to generate echo-cancelled audio signals 141 towards transmission room 110 .
- L r for the receiving room
- L echo path model
- the nominal realistic case usually has L ⁇ L r ⁇ L t .
- Coherence reduction module 150 processes the two outgoing audio signals 111 generated by microphones 112 in transmission room 110 to reduce the coherence between (i.e., decorrelate) those two signals. If the amount and type of decorrelation is appropriate, then the effectiveness of AEC module 140 in reducing echo can be significantly enhanced.
- FIG. 2 shows a block diagram of coherence reduction module 150 of FIG. 1 .
- CR module 150 has an analysis filterbank 210 that converts the time-domain outgoing audio signals 111 received from receiving room 120 into a plurality of frequency-domain audio signals 211 .
- CR module 150 has a pair of summation nodes 220 and a pair of multiplication nodes 230 that respectively and selectively add an uncorrelated noise signal 213 to the corresponding frequency-domain audio signal 211 and multiply the frequency-domain audio signal by a phase- and amplitude-modulation signal 215 to generate two decorrelated frequency-domain audio signals 231 .
- CR module 150 also has a synthesis filterbank 240 that converts the two sets of decorrelated frequency-domain audio signals 231 into the two decorrelated time-domain incoming audio signals 151 that are rendered by the two loudspeakers 124 in receiving room 120 .
- the summation nodes 220 may be downstream of the multiplication nodes 230 or may even be omitted.
- CR module 150 has only one set of summation nodes and only one set of multiplication nodes, such that uncorrelated noise, phase modulation, and amplitude modulation are applied to only one set of frequency-domain audio signals 211 in order to generate the two decorrelated time-domain audio signals 151 .
- the other time-domain audio signal 111 does not even have to be converted into the frequency domain, although a time delay may need to be added to that audio path to keep the two stereophonic signals synchronized.
- a frequency-domain, sub-band implementation of CR module 150 has been described, time-domain implementations are also possible, including those that use parallel bandpass filters to process different signal sub-bands.
- FIG. 2 shows a preferred architecture for a stereo (two channels) coherence reduction (CR) module 150 .
- X 1 p ( k,m ) ( X 1 ( k,m )+ ⁇ 1 ( k,m ))* ⁇ 1 ( k,m ) e i ⁇ (k,m)
- X 2 p ( k,m ) ( X 2 ( k,m )+ ⁇ 2 ( k,m ))* ⁇ 2 ( k,m ) e ⁇ i ⁇ (k,m)
- ⁇ i (k,m) is the uncorrelated noise signal (signal 213 in FIG. 2 )
- ⁇ i (k,m) is the amplitude modulation (part of signal 215 of FIG. 2 )
- ⁇ (k,m) is the phase function (part of signal 215 of FIG. 2 )
- k is the frequency bin (i.e., sub-band) number
- m is the frequency-domain sample number.
- Ref [6] A variant of this architecture was initially proposed in Ref [6], which architecture can also be seen as a frequency-domain generalization of the method described in Ref [5]. Moreover, the method proposed in Ref [7] can be re-interpreted to be a variant of the architecture of Ref [6] as well.
- the specific improvements of the architecture in FIG. 2 over the one in Ref [6] are the additions of amplitude modulation [ ⁇ i (k,m)] (i.e., part of signal 215 of FIG. 2 ), signal-dependent adaptivity, and, if desired, the additive noise components ⁇ i (k,m) (i.e., signal 213 of FIG. 2 ).
- Ref [5] and Ref [7] are very effective at reducing coherence for frequencies above 1 kHz or so without introducing significant audible distortions (spectral or image), they reduce coherence poorly for lower frequencies.
- the method in Ref [6] is still quite effective at reducing coherence below 1 kHz but image instability is just on the verge of being noticeable for loudspeaker playback. For headphone playback, image distortion is noticeable for the method in Ref [6].
- the present proposal of introducing amplitude modulation and optional additive components alongside phase modulation enables decorrelation more effectively below 1 kHz and with less noticeable distortion than any previous method. This turns out to be very important since speech has most of its power in this frequency range.
- Coherence reduction may be achieved by time variation and the addition of independent noise performed in a manner so as to be just noticeable to the human ear in the assumed playback scenario. That is, parameters can be set so that distortion may be perceived over headphones but not necessarily over loudspeaker playback.
- the significant decorrelating components are described in the following.
- the use of additive noise components is described in the Appendix.
- phase function ⁇ (k,m) ⁇ (k,m)
- K is the FFT size
- m is the sample number in each frequency bin
- T ⁇ is the phase-modulation period
- F s sample rate
- the frame size M is the total number of samples in each frequency bin
- F ⁇ is the phase-modulation rate.
- an amplitude-modulation function is added in coherence reduction module 150 of FIG. 1 .
- the motivation for this approach is the fact that the human auditory system has somewhat lower sensitivity to amplitude variation at lower frequencies ( ⁇ 1.5 kHz), although it is very sensitive to amplitude variations at higher frequencies.
- T ⁇ amplitude-modulation period
- F ⁇ is the amplitude-modulation rate
- amplitude modulation is effectively applied in only the 125 Hz to 1500 Hz range.
- the amplitude-modulation rate is chosen to be 2 Hz. It is desirable to have a high modulation rate to ensure a stable stereo image. However, too high a rate causes audible spectral distortion. In 0, an amplitude-modulation rate of less than 1 Hz is preferred for minimizing audible distortion. A little higher rate can be chosen to make the image stability better. Also, it is advantageous to use the same rate for phase and amplitude modulation since image shift due to phase can be counteracted with the amplitude image shift (hence, these two image effects are “out of phase”). The resulting stabilization is small but still very useful.
- amplitude modulation adaptive if certain conditions are met.
- the main problem of amplitude modulation is that, when the modulation rate is low, e.g., ⁇ 8 Hz, the image of the background noise can still be perceived as “unstable”.
- it is advantageous to reduce the level of amplitude modulation if the signals contain mainly background noise. Distinguishing this condition can, for example, be done by:
- a combined statistic ⁇ can be formed as:
- S x i is the average frequency-bin signal level of signal x i over the frequency range from frequency bin k 0 to frequency bin k 1
- E ⁇ denotes ensemble expectation
- ⁇ denotes Fourier transform
- * denotes complex conjugate
- ⁇ is the complex coherence function between the two channels given by:
- ⁇ ⁇ ( k ) S x 1 ⁇ x 2 ⁇ ( k ) S x 1 ⁇ x 1 ⁇ ( k ) ⁇ S x 2 ⁇ x 2 ⁇ ( k )
- S T , ⁇ T are appropriately chosen level and coherence thresholds, respectively. Examples of values for these are:
- X i (k,m) a complex filterbank sample at frequency index k and block index m such that:
- X′ p ( k,m ) X p ( k,m )+ ⁇ ( RE ⁇ X p ( k,m ) ⁇ r +j ⁇ IM ⁇ X p ( k,m ) ⁇ i )
- Embodiments of the invention may be implemented as (analog, digital, or a hybrid of both analog and digital) circuit-based processes, including possible implementation as a single integrated circuit (such as an ASIC or an FPGA), a multi-chip module, a single card, or a multi-card circuit pack.
- various functions of circuit elements may also be implemented as processing blocks in a software program.
- Such software may be employed in, for example, a digital signal processor, micro-controller, general-purpose computer, or other processor.
- Embodiments of the invention can be manifest in the form of methods and apparatuses for practicing those methods.
- Embodiments of the invention can also be manifest in the form of program code embodied in tangible media, such as magnetic recording media, optical recording media, solid state memory, floppy diskettes, CD-ROMs, hard drives, or any other non-transitory machine-readable storage medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
- Embodiments of the invention can also be manifest in the form of program code, for example, stored in a non-transitory machine-readable storage medium including being loaded into and/or executed by a machine, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
- program code segments When implemented on a general-purpose processor, the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits
- the storage medium may be (without limitation) an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device.
- the storage medium may be (without limitation) an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device.
- a more-specific, non-exhaustive list of possible storage media include a magnetic tape, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM) or Flash memory, a portable compact disc read-only memory (CD-ROM), an optical storage device, and a magnetic storage device.
- the storage medium could even be paper or another suitable medium upon which the program is printed, since the program can be electronically captured via, for instance, optical scanning of the printing, then compiled, interpreted, or otherwise processed in a suitable manner including but not limited to optical character recognition, if necessary, and then stored in a processor or computer memory.
- a suitable storage medium may be any medium that can contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
- processors may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software.
- the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared.
- explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, network processor, application specific integrated circuit (ASIC), field programmable gate array (FPGA), read only memory (ROM) for storing software, random access memory (RAM), and non volatile storage.
- DSP digital signal processor
- ASIC application specific integrated circuit
- FPGA field programmable gate array
- ROM read only memory
- RAM random access memory
- any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
- any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention.
- any flow charts, flow diagrams, state transition diagrams, pseudo code, and the like represent various processes which may be substantially represented in computer readable medium and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
- each may be used to refer to one or more specified characteristics of a plurality of previously recited elements or steps.
- the open-ended term “comprising” the recitation of the term “each” does not exclude additional, unrecited elements or steps.
- an apparatus may have additional, unrecited elements and a method may have additional, unrecited steps, where the additional, unrecited elements or steps do not have the one or more specified characteristics.
- figure numbers and/or figure reference labels in the claims is intended to identify one or more possible embodiments of the claimed subject matter in order to facilitate the interpretation of the claims. Such use is not to be construed as necessarily limiting the scope of those claims to the embodiments shown in the corresponding figures.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Stereophonic System (AREA)
Abstract
Description
- This application claims the benefit of the filing date of U.S. provisional application No. 61/857,840, filed on Jul. 24, 2013, the teachings of which are incorporated herein by reference in their entirety.
- 1. Field of the Invention
- The present invention relates to audio signal processing and, more specifically but not exclusively, to stereophonic acoustic echo cancellation (AEC).
- 2. Description of the Related Art
- This section introduces aspects that may help facilitate a better understanding of the invention. Accordingly, the statements of this section are to be read in this light and are not to be understood as admissions about what is prior art or what is not prior art.
- In a two-way audio communication, undesirable acoustic echo can occur when sounds (i.e., acoustic signals) corresponding to (electronic) audio signals transmitted from a first side of the two-way communication and rendered by loudspeakers at the second side are picked up by microphones at the second side and included in the audio signals transmitted back to and rendered by loudspeakers at the first side as acoustic echo to individuals located at the first side. Acoustic echo cancellation (AEC) refers to signal processing that attempts to estimate the audio signals corresponding to acoustic echo occurring at the second side of the two-way audio communication and appropriately compensate the audio signals to be transmitted back to the first side to reduce or even eliminate the contribution of that acoustic echo in those transmitted audio signals.
- Note that, as used in this disclosure, the term “loudspeaker” refers to any suitable transducer for converting electronic audio signals into acoustic signals (including headphones), while the term “microphone” refers to any suitable transducer for converting acoustic signals into electronic audio signals.
- In a monophonic audio system, each side has only one microphone and only one loudspeaker. In a stereophonic audio system, each side has two microphones that generate left and right outgoing audio channels and two loudspeakers that render left and right incoming audio signals. In a multichannel audio system, each side has more than two microphones and more than two loudspeakers. Acoustic echo cancellation can be applied with varying sophistication and corresponding variable success in each of these different audio architectures.
- Both the stereophonic AEC (SAEC) and multichannel AEC (MAEC) problems differ from the straightforward monophonic AEC application because of the non-uniqueness problem, i.e., the underlying equations to be solved by the echo canceller system can be singular or ill conditioned. The major effect this has is that, if no precaution is taken, the AEC has to re-converge as soon as there is any acoustic change in the transmission room (aka the far-end room or the first side of a two-way audio communication referred to previously). In the monophonic AEC case, it is not necessary to reconverge following transmission-room changes since the solution is independent of this variation. Still, as in the monophonic AEC case, SAEC and MAEC modules have to manage normal echo-path changes at the receiving room (aka the near-end room or the second side).
- The seriousness of transmission-room tracking is that these changes can be very abrupt, e.g., one talker in the far-end room stops talking when another person in the same room starts talking. Considering all practical issues there are to control monophonic AECs, neglecting this additional fundamental problem can cause significant performance issues for any stereo or multichannel AEC implementation. The net effect of acoustic-path changes in both the transmission room and the receiving room can be seen in terms of the inter-channel cross-correlation of the receive channels. The acoustic paths in the transmission room also determine the inter-channel cross-correlation of the downlink (far-end) channels.
- It is desirable to control and limit the inter-channel cross-correlation of transmission channels (downlink), without causing objectionable stereo image distortion or spectral artifacts.
- In the literature, there are various proposals for achieving inter-channel decorrelation. For details, see Refs [1], [2], [3], [4], [5], [6], and [7]. In fact, the methods presented in the latter four references belong to the same category, i.e., they all introduce a time-varying phase shift in the stereo channels to achieve decorrelation. This approach is very effective, especially for higher frequencies (>1.5 kHz), while it has to be carefully applied at lower frequencies to avoid noticeable distortion. Frequencies less than 1 kHz are more difficult than higher frequencies to decorrelate without introducing audible distortion, because human hearing is more sensitive to phase shifts at the lower frequencies (Ref [8]).
- Embodiments of the invention will become more fully apparent from the following detailed description, the appended claims, and the accompanying drawings in which like reference numerals identify similar or identical elements.
-
FIG. 1 shows a block diagram of an exemplary stereophonic audio system; -
FIG. 2 shows a block diagram of the coherence reduction module ofFIG. 1 ; and -
FIG. 3 graphically shows maximum phase excursion in degrees versus frequency introduced by a phase modulator. - This disclosure describes a novel principle and implementation of an inter-channel amplitude coherence reduction (CR) method for stereophonic acoustic echo cancellation (SAEC) and multichannel acoustic echo cancellation (MAEC). Effective coherence reduction, or decorrelation, of the transmission channels solves the non-uniqueness problem in SAEC applications as well as in MAEC applications. In particular, this disclosure describes a novel improved version of the principle discussed in Ref [6]. In the present approach, the concept of amplitude modulation (aka magnitude modulation) in the low-frequency range is introduced to effectively enhance the reduction in amplitude coherence and thereby improve the effectiveness of acoustic echo cancellation.
-
FIG. 1 shows a block diagram of an exemplarystereophonic audio system 100. As represented inFIG. 1 ,audio system 100 has (i) atransmission room 110 having left andright stereo microphones 112 and left andright stereo loudspeakers 114 and (ii) areceiving room 120 having left andright stereo microphones 122 and left andright stereo loudspeakers 124.Audio system 100 has atransport layer 130 that handles the transmission and reception of audio signals transmitted between the transmission room and the receiving room. Located near (or in)receiving room 120,audio system 100 also has anadaptive AEC module 140 and a coherence reduction (CR)module 150. Although not shown inFIG. 1 ,audio system 100 may also have an adaptive AEC module and a CR module analogous to 140 and 150 that are located near (or in)modules transmission room 110. -
AEC module 140 comprises fouradaptive echo models 142 corresponding to the fouracoustic echo paths 126 from eachdifferent loudspeaker 124 inreceiving room 120 to eachdifferent microphone 122 inreceiving room 120. Eachadaptive echo model 142 adaptively generates anestimate 143 of the audio signal corresponding to echo in the associatedacoustic echo path 126 based on the correspondingincoming audio signal 151 received atreceiving room 120 fromtransmission room 110. Those estimatedecho audio signals 143 are subtracted atsubtraction nodes 146 from theoutgoing audio signals 121 transmitted fromreceiving room 120 to generate echo-cancelledaudio signals 141 towardstransmission room 110. -
Acoustic echo paths 126 are normally modeled as FIR filters of lengths Lr (for the receiving room, hi,i=1, 2), L (echo path model, ĥi,i=1, 2), and Lt (for the transmission room gi,i=1, 2). The nominal realistic case usually has L<Lr≈Lt. However, there are many use cases when this is not true. For example, if the stereo signal is synthesized, then the equivalent Lt<<L can exist. The length and the “strength” (reverberation) of the tail of the response influence convergence of the stereo AEC in more-complicated ways than what is normally experienced in monophonic AEC scenarios. -
Coherence reduction module 150 processes the twooutgoing audio signals 111 generated bymicrophones 112 intransmission room 110 to reduce the coherence between (i.e., decorrelate) those two signals. If the amount and type of decorrelation is appropriate, then the effectiveness ofAEC module 140 in reducing echo can be significantly enhanced. -
FIG. 2 shows a block diagram ofcoherence reduction module 150 ofFIG. 1 .CR module 150 has ananalysis filterbank 210 that converts the time-domainoutgoing audio signals 111 received from receivingroom 120 into a plurality of frequency-domain audio signals 211. For each frequency band,CR module 150 has a pair ofsummation nodes 220 and a pair ofmultiplication nodes 230 that respectively and selectively add anuncorrelated noise signal 213 to the corresponding frequency-domain audio signal 211 and multiply the frequency-domain audio signal by a phase- and amplitude-modulation signal 215 to generate two decorrelated frequency-domain audio signals 231.CR module 150 also has asynthesis filterbank 240 that converts the two sets of decorrelated frequency-domain audio signals 231 into the two decorrelated time-domain incomingaudio signals 151 that are rendered by the twoloudspeakers 124 inreceiving room 120. - Note that, in alternative implementations, the
summation nodes 220 may be downstream of themultiplication nodes 230 or may even be omitted. Note further that, in some implementations,CR module 150 has only one set of summation nodes and only one set of multiplication nodes, such that uncorrelated noise, phase modulation, and amplitude modulation are applied to only one set of frequency-domain audio signals 211 in order to generate the two decorrelated time-domain audio signals 151. In that case, the other time-domain audio signal 111 does not even have to be converted into the frequency domain, although a time delay may need to be added to that audio path to keep the two stereophonic signals synchronized. Although a frequency-domain, sub-band implementation ofCR module 150 has been described, time-domain implementations are also possible, including those that use parallel bandpass filters to process different signal sub-bands. -
FIG. 2 shows a preferred architecture for a stereo (two channels) coherence reduction (CR)module 150. As represented inFIG. 2 , the decorrelated, frequency-domain audio signals Xi p(k,m), i=1, 2, (signal 231 inFIG. 2 ) are generated from the original, frequency-domain audio signals Xi(k,m) (signal 211 inFIG. 2 ) according to the following: -
X 1 p(k,m)=(X 1(k,m)+ω1(k,m))*ρ1(k,m)e iφ(k,m) -
X 2 p(k,m)=(X 2(k,m)+ω2(k,m))*ρ2(k,m)e −iφ(k,m) - where ωi(k,m) is the uncorrelated noise signal (signal 213 in
FIG. 2 ), ρi(k,m) is the amplitude modulation (part ofsignal 215 ofFIG. 2 ), φ(k,m) is the phase function (part ofsignal 215 ofFIG. 2 ), k is the frequency bin (i.e., sub-band) number, and m is the frequency-domain sample number. Note that, in the previous equations, complementary phase functions having the same magnitude, but different sign are applied. Alternative implementations may apply different phase functions. For example, an equivalent solution would be to apply the phase function 2*φ(k,m) to one signal and no phase function to the other signal. - A variant of this architecture was initially proposed in Ref [6], which architecture can also be seen as a frequency-domain generalization of the method described in Ref [5]. Moreover, the method proposed in Ref [7] can be re-interpreted to be a variant of the architecture of Ref [6] as well. The specific improvements of the architecture in
FIG. 2 over the one in Ref [6] are the additions of amplitude modulation [ρi(k,m)] (i.e., part ofsignal 215 ofFIG. 2 ), signal-dependent adaptivity, and, if desired, the additive noise components ωi(k,m) (i.e., signal 213 ofFIG. 2 ). Although the methods described in Ref [5] and Ref [7] are very effective at reducing coherence for frequencies above 1 kHz or so without introducing significant audible distortions (spectral or image), they reduce coherence poorly for lower frequencies. The method in Ref [6] is still quite effective at reducing coherence below 1 kHz but image instability is just on the verge of being noticeable for loudspeaker playback. For headphone playback, image distortion is noticeable for the method in Ref [6]. The present proposal of introducing amplitude modulation and optional additive components alongside phase modulation enables decorrelation more effectively below 1 kHz and with less noticeable distortion than any previous method. This turns out to be very important since speech has most of its power in this frequency range. - Coherence reduction may be achieved by time variation and the addition of independent noise performed in a manner so as to be just noticeable to the human ear in the assumed playback scenario. That is, parameters can be set so that distortion may be perceived over headphones but not necessarily over loudspeaker playback. For an exemplary implementation at a 16000 Hz sample rate, an exemplary filterbank FFT (fast Fourier transform) size is K=128 with a frame size M=64. The significant decorrelating components are described in the following. The use of additive noise components is described in the Appendix.
- For signal decorrelation using phase modification, time-periodic variation of the phase is particularly effective and, if so, one period of the present phase function φ(k,m) is described by:
-
φ(k,m)=A φ(k)sin(2πm/T φ);m=0, . . . ,T φ−1 -
T φ =└F s/(M·F φ)┘, - where the maximum phase excursion, Aφ(k), is preferably dependent on frequency bin (i.e., sub-band) index k=0, . . . , K/2, where K is the FFT size, m is the sample number in each frequency bin, Tφ is the phase-modulation period, Fs is sample rate, the frame size M is the total number of samples in each frequency bin, and Fφ is the phase-modulation rate. If the phase excursion is chosen according to 0, i.e., below the blue curve shown in
FIG. 3 , the introduced phase distortion is not noticeable. One exemplary implementation employs the excursion of 0 with a phase-modulation rate Fφ of 2 Hz. These values are well within the perceptual bounds suggested in 0 but the present phase-modulation rate is slightly higher than what they recommend. The reason for this will be motivated in the next section. - To achieve a more-effective decorrelation for low frequencies, an amplitude-modulation function is added in
coherence reduction module 150 ofFIG. 1 . The motivation for this approach is the fact that the human auditory system has somewhat lower sensitivity to amplitude variation at lower frequencies (<1.5 kHz), although it is very sensitive to amplitude variations at higher frequencies. - As with phase modulation, amplitude modulation can also be periodic and, if so, one period of the amplitude function ρi(k), i=1, 2, is described by:
-
ρ1(k)=1+A ρ(k)·sin(2πm/T ρ),m=0, . . . ,T ρ−1, -
ρ2(k)=1−A ρ(k)·sin(2πm/T ρ);m=0, . . . ,T ρ−1, - where the amplitude excursion, AdB(k), is defined as:
-
- and the amplitude-modulation period, Tρ, is defined as:
-
T ρ =└F s/(M·F ρ)┘, - where Fρ is the amplitude-modulation rate.
- The maximum amplitude excursion, AdB(k), can depend on frequency bin index k=0, . . . , K/2. It is important to tailor the amplitude excursion as a function of frequency to avoid perceptible distortion. Assuming a filterbank with a 125 Hz bin-width (Fs=16000, K=128, M=64), an exemplary set of parameter values for amplitude modulation are:
-
A dB(k)=0,k=0, -
A dB(k)=5,k=1, . . . ,9, -
A dB(k)=4,k=10,11,12, -
A dB(k)=0,k≧13, -
F ρ=2. - Hence, amplitude modulation is effectively applied in only the 125 Hz to 1500 Hz range. The amplitude-modulation rate is chosen to be 2 Hz. It is desirable to have a high modulation rate to ensure a stable stereo image. However, too high a rate causes audible spectral distortion. In 0, an amplitude-modulation rate of less than 1 Hz is preferred for minimizing audible distortion. A little higher rate can be chosen to make the image stability better. Also, it is advantageous to use the same rate for phase and amplitude modulation since image shift due to phase can be counteracted with the amplitude image shift (hence, these two image effects are “out of phase”). The resulting stabilization is small but still very useful.
- To control dynamic artifacts at low frequencies, it is advantageous to make the amplitude modulation adaptive if certain conditions are met. The main problem of amplitude modulation is that, when the modulation rate is low, e.g., <8 Hz, the image of the background noise can still be perceived as “unstable”. To mitigate this distortion, it is advantageous to reduce the level of amplitude modulation if the signals contain mainly background noise. Distinguishing this condition can, for example, be done by:
-
- 1. Detecting a low signal level compared to normal levels.
- 2. Detecting low inter-channel coherence in some frequency range.
- The above can be combined as a single detection statistic for modulation level. For example, a combined statistic α can be formed as:
-
- where
S xi is the average frequency-bin signal level of signal xi over the frequency range from frequency bin k0 to frequency bin k1, Sxi xi (k) is the energy of signal xi in frequency bin k, which is based on the cross-spectrum Sxi xj (k) of frequency-domain signals Xi(k)={xi}; i=1, 2; k=0, . . . , K/2 as: -
S xi xj (k)=E{X i(k)X* j(k)} -
-
S T, γT are appropriately chosen level and coherence thresholds, respectively. Examples of values for these are: -
S T=10−50/10, -
γ T=0.70 - Then, the amplitude-modulation function Aρ(k) above is multiplied by a to implement the adaptive amplitude modulation aggressiveness.
- For the multichannel case, the previously described equations for phase and amplitude modulations can be generalized as follows:
-
φi(k,m)=A φ(k)sin(2πm/T φ+θφ,i);m=0, . . . ,T φ−1 -
ρi(k)=1+A ρ(k)·sin(2πm/T ρ+θρ,i);m=0, . . . ,T ρ−1, - where and θφ,i and θρ,i, i=0, . . . , N−1, are modulation-rate phase-shifts for the phase and the amplitude modulation functions, respectively. In the stereo case above, θφ,1=0, θφ,2=π, and θρ,1=0, θρ,2=π.
- For a specific 5.1 surround channel case, the above could be used for the front left and right channels, leaving the center channel (subwoofer) unprocessed, and letting the rear surround channels (denoted channel 3 and 4) have phase shifts θφ,3=π/2, θφ,4=3π/2 and θρ,3=π/2, θρ,4=3π/2. This choice provides decorrelation of all channels and also the “out-of-phase” stabilization described above. However, other choices of phase-shifts will also work.
- An alternative approach is to apply the “stereo-pair” phase-shifts as described above to the channel pair that has the highest correlation at a specific time. As the channel-correlation varies, the decorrelation will follow and focus on the most-important pair at a certain time.
- In this disclosure, an effective decorrelation method has been described that provides a significant and predetermined coherence reduction, even at very low frequencies. The method has been shown by informal listening tests to preserve the original spatial image while not introducing perceivable spectral distortion. Exemplary unique features include:
-
- Combination of known phase modulation with a novel amplitude modulation in the low-frequency region.
- Balancing the image distortion by using the same modulation rate “out of phase” for phase and amplitude.
- Adaptively controlling the level of amplitude modulation to minimize distortion.
The proposed CR module solves the non-uniqueness/ill-conditioned problem inherent to stereo and multichannel echo cancellation since it guarantees that the transmission signals are not highly correlated. The maximum coherence at low frequencies is in the neighborhood of 0.90 and thus stabilizes the solution significantly. The coherence reduction for speech is substantially the same as in the stationary noise case, and significant coherence reduction is achieved below 1 kHz where it is most important for the SAEC application.
- Another option to control coherence is the addition of incoherent (i.e., uncorrelated) noise. This idea was proposed in the very beginning stereophonic
echo cancellation work 0. Although this technique on its own does not provide satisfactory coherence reduction, it can be applied to achieve some extra performance over what is already achieved by using, e.g., the phase- and amplitude-modulation methods described above. In a filterbank architecture, it is straightforward to add noise at a level which, in theory, is below the level of perception. That is, let: - be independent, identically distributed Gaussian samples of unit variance, and define Xi(k,m) to be a complex filterbank sample at frequency index k and block index m such that:
-
X′ p(k,m)=X p(k,m)+δ(RE{X p(k,m)}ωr +j·IM{X p(k,m)}ωi) -
δ=10−δdB /20. - It has been described in literature, under certain conditions, that, if δdB is larger than 13.6 dB, then the noise is not noticeable (Ref [11]). Hence, the threshold can be conservatively set to something larger than 13.6 dB. Consequently, where, in frequency, the filterbank bandwidth is close to that of the critical bands, this added noise should not be perceivable.
-
- [1] J. Benesty et al., “Advances in Network and Acoustic Echo Cancellation,” Springer, 2001.
- [2] M. M. Sondhi, D. R. Morgan, and J. L. Hall, “Stereophonic Acoustic. Echo Cancellation—An Overview of the Fundamental Problem,” IEEE Signal Processing Letters, Vol. 2, No. 8, August 1995.
- [3] J. Benesty, D. R. Morgan, and M. M. Sondhi, “A Better Understanding and an Improved Solution to the Specific Problems of Stereophonic Acoustic Echo Cancellation,” IEEE Trans. Speech and Audio Proc., Vol. 6, pp. 156-165, March 1998.
- [4] M. Ali, “Stereophonic Echo Cancellation System using Time-varying all-pass Filtering for Signal Decorrelation,” in Proc. IEEE ICASSP, 1998, pp. 3689-3692.
- [5] Y. Joncour and A. Sugiyama, “A Stereo Echo Canceller with Pre-processing for Correct Echo-path Identification,” in Proc. IEEE ICASSP, 1998.
- [6] J. Herre, H. Buchner, and W. Kellermann, “Acoustic echo cancellation for surround sound using perceptually motivated convergence enhancement,” in Proc. IEEE ICASSP, 2007.
- [7] T. S. Wada, and B. H. Juang, “Multi-Channel Acoustic Echo Cancellation based on Enhancement with Effective Decorrelation via Resampling,” in Proc. IWAENC, 2010.
- [8] N. I. Durlach and H. S. Colburn, “Binaural Phenomena,” Handbook of Perception, Vol. 4 (Hearing), edited by E. C. Carterette and M. P. Friedman, Academic, NY, 1978, pp. 365-466.
- [9] M. M. Sondhi and D. R. Morgan, “Acoustic Stereophonic Teleconferencing,” in Proc. WASPAA, 1991.
- [10] T. Gaensler and J. Benesty, “New Insights Into the Stereophonic Acoustic Echo Cancellation Problem and an Adaptive Nonlinearity Solution,” IEEE Trans. Speech and Audio Proc., Vol. 10, No. 5, July 2002.
- [11] K. Brandenburg and J. D. Johnston, “Second generation perceptual audio coding: The hybrid coder,” AES 88th Conv. Preprint, March 1990.
- Embodiments of the invention may be implemented as (analog, digital, or a hybrid of both analog and digital) circuit-based processes, including possible implementation as a single integrated circuit (such as an ASIC or an FPGA), a multi-chip module, a single card, or a multi-card circuit pack. As would be apparent to one skilled in the art, various functions of circuit elements may also be implemented as processing blocks in a software program. Such software may be employed in, for example, a digital signal processor, micro-controller, general-purpose computer, or other processor.
- Embodiments of the invention can be manifest in the form of methods and apparatuses for practicing those methods. Embodiments of the invention can also be manifest in the form of program code embodied in tangible media, such as magnetic recording media, optical recording media, solid state memory, floppy diskettes, CD-ROMs, hard drives, or any other non-transitory machine-readable storage medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention. Embodiments of the invention can also be manifest in the form of program code, for example, stored in a non-transitory machine-readable storage medium including being loaded into and/or executed by a machine, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention. When implemented on a general-purpose processor, the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits
- Any suitable processor-usable/readable or computer-usable/readable storage medium may be utilized. The storage medium may be (without limitation) an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device. A more-specific, non-exhaustive list of possible storage media include a magnetic tape, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM) or Flash memory, a portable compact disc read-only memory (CD-ROM), an optical storage device, and a magnetic storage device. Note that the storage medium could even be paper or another suitable medium upon which the program is printed, since the program can be electronically captured via, for instance, optical scanning of the printing, then compiled, interpreted, or otherwise processed in a suitable manner including but not limited to optical character recognition, if necessary, and then stored in a processor or computer memory. In the context of this disclosure, a suitable storage medium may be any medium that can contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
- The functions of the various elements shown in the figures, including any functional blocks labeled as “processors,” may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, network processor, application specific integrated circuit (ASIC), field programmable gate array (FPGA), read only memory (ROM) for storing software, random access memory (RAM), and non volatile storage. Other hardware, conventional and/or custom, may also be included. Similarly, any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
- It should be appreciated by those of ordinary skill in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudo code, and the like represent various processes which may be substantially represented in computer readable medium and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
- Unless explicitly stated otherwise, each numerical value and range should be interpreted as being approximate as if the word “about” or “approximately” preceded the value or range.
- It will be further understood that various changes in the details, materials, and arrangements of the parts which have been described and illustrated in order to explain embodiments of this invention may be made by those skilled in the art without departing from embodiments of the invention encompassed by the following claims.
- In this specification including any claims, the term “each” may be used to refer to one or more specified characteristics of a plurality of previously recited elements or steps. When used with the open-ended term “comprising,” the recitation of the term “each” does not exclude additional, unrecited elements or steps. Thus, it will be understood that an apparatus may have additional, unrecited elements and a method may have additional, unrecited steps, where the additional, unrecited elements or steps do not have the one or more specified characteristics.
- The use of figure numbers and/or figure reference labels in the claims is intended to identify one or more possible embodiments of the claimed subject matter in order to facilitate the interpretation of the claims. Such use is not to be construed as necessarily limiting the scope of those claims to the embodiments shown in the corresponding figures.
- It should be understood that the steps of the exemplary methods set forth herein are not necessarily required to be performed in the order described, and the order of the steps of such methods should be understood to be merely exemplary. Likewise, additional steps may be included in such methods, and certain steps may be omitted or combined, in methods consistent with various embodiments of the invention.
- Although the elements in the following method claims, if any, are recited in a particular sequence with corresponding labeling, unless the claim recitations otherwise imply a particular sequence for implementing some or all of those elements, those elements are not necessarily intended to be limited to being implemented in that particular sequence.
- Reference herein to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments necessarily mutually exclusive of other embodiments. The same applies to the term “implementation.”
- The embodiments covered by the claims in this application are limited to embodiments that (1) are enabled by this specification and (2) correspond to statutory subject matter. Non-enabled embodiments and embodiments that correspond to non-statutory subject matter are explicitly disclaimed even if they fall within the scope of the claims.
Claims (19)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/334,915 US9445196B2 (en) | 2013-07-24 | 2014-07-18 | Inter-channel coherence reduction for stereophonic and multichannel acoustic echo cancellation |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361857840P | 2013-07-24 | 2013-07-24 | |
| US14/334,915 US9445196B2 (en) | 2013-07-24 | 2014-07-18 | Inter-channel coherence reduction for stereophonic and multichannel acoustic echo cancellation |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20150030172A1 true US20150030172A1 (en) | 2015-01-29 |
| US9445196B2 US9445196B2 (en) | 2016-09-13 |
Family
ID=52390557
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/334,915 Active 2035-02-19 US9445196B2 (en) | 2013-07-24 | 2014-07-18 | Inter-channel coherence reduction for stereophonic and multichannel acoustic echo cancellation |
Country Status (1)
| Country | Link |
|---|---|
| US (1) | US9445196B2 (en) |
Cited By (85)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160065743A1 (en) * | 2014-08-27 | 2016-03-03 | Oki Electric Industry Co., Ltd. | Stereo echo suppressing device, echo suppressing device, stereo echo suppressing method, and non transitory computer-readable recording medium storing stereo echo suppressing program |
| US20190141195A1 (en) * | 2017-08-03 | 2019-05-09 | Bose Corporation | Efficient reutilization of acoustic echo canceler channels |
| US10367948B2 (en) | 2017-01-13 | 2019-07-30 | Shure Acquisition Holdings, Inc. | Post-mixing acoustic echo cancellation systems and methods |
| USD865723S1 (en) | 2015-04-30 | 2019-11-05 | Shure Acquisition Holdings, Inc | Array microphone assembly |
| US20190362733A1 (en) * | 2017-06-15 | 2019-11-28 | Goertek Inc. | Multichannel echo cancellation circuit and method and smart device |
| DE102018127071B3 (en) * | 2018-10-30 | 2020-01-09 | Harman Becker Automotive Systems Gmbh | Audio signal processing with acoustic echo cancellation |
| US11259116B2 (en) * | 2017-07-07 | 2022-02-22 | Yamaha Corporation | Sound processing method, remote conversation method, sound processing device, remote conversation device, headset, and remote conversation system |
| USD944776S1 (en) | 2020-05-05 | 2022-03-01 | Shure Acquisition Holdings, Inc. | Audio device |
| US11297423B2 (en) | 2018-06-15 | 2022-04-05 | Shure Acquisition Holdings, Inc. | Endfire linear array microphone |
| US11297426B2 (en) | 2019-08-23 | 2022-04-05 | Shure Acquisition Holdings, Inc. | One-dimensional array microphone with improved directivity |
| US11302347B2 (en) | 2019-05-31 | 2022-04-12 | Shure Acquisition Holdings, Inc. | Low latency automixer integrated with voice and noise activity detection |
| US11303981B2 (en) | 2019-03-21 | 2022-04-12 | Shure Acquisition Holdings, Inc. | Housings and associated design features for ceiling array microphones |
| US11310596B2 (en) | 2018-09-20 | 2022-04-19 | Shure Acquisition Holdings, Inc. | Adjustable lobe shape for array microphones |
| US11405430B2 (en) | 2016-02-22 | 2022-08-02 | Sonos, Inc. | Networked microphone device control |
| US11438691B2 (en) | 2019-03-21 | 2022-09-06 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality |
| US11445294B2 (en) | 2019-05-23 | 2022-09-13 | Shure Acquisition Holdings, Inc. | Steerable speaker array, system, and method for the same |
| US11482978B2 (en) | 2018-08-28 | 2022-10-25 | Sonos, Inc. | Audio notifications |
| US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
| US11501773B2 (en) | 2019-06-12 | 2022-11-15 | Sonos, Inc. | Network microphone device with command keyword conditioning |
| US11514898B2 (en) | 2016-02-22 | 2022-11-29 | Sonos, Inc. | Voice control of a media playback system |
| US11523212B2 (en) | 2018-06-01 | 2022-12-06 | Shure Acquisition Holdings, Inc. | Pattern-forming microphone array |
| US11531520B2 (en) | 2016-08-05 | 2022-12-20 | Sonos, Inc. | Playback device supporting concurrent voice assistants |
| US11538451B2 (en) * | 2017-09-28 | 2022-12-27 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
| US11552611B2 (en) | 2020-02-07 | 2023-01-10 | Shure Acquisition Holdings, Inc. | System and method for automatic adjustment of reference gain |
| US11557294B2 (en) | 2018-12-07 | 2023-01-17 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
| US11558693B2 (en) | 2019-03-21 | 2023-01-17 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality |
| US11556306B2 (en) | 2016-02-22 | 2023-01-17 | Sonos, Inc. | Voice controlled media playback system |
| US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
| US11563842B2 (en) | 2018-08-28 | 2023-01-24 | Sonos, Inc. | Do not disturb feature for audio notifications |
| US11641559B2 (en) | 2016-09-27 | 2023-05-02 | Sonos, Inc. | Audio playback settings for voice interaction |
| US11646045B2 (en) | 2017-09-27 | 2023-05-09 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
| US11646023B2 (en) | 2019-02-08 | 2023-05-09 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
| US11678109B2 (en) | 2015-04-30 | 2023-06-13 | Shure Acquisition Holdings, Inc. | Offset cartridge microphones |
| US11689858B2 (en) | 2018-01-31 | 2023-06-27 | Sonos, Inc. | Device designation of playback and network microphone device arrangements |
| US11694689B2 (en) | 2020-05-20 | 2023-07-04 | Sonos, Inc. | Input detection windowing |
| US11706562B2 (en) | 2020-05-29 | 2023-07-18 | Shure Acquisition Holdings, Inc. | Transducer steering and configuration systems and methods using a local positioning system |
| US11714600B2 (en) | 2019-07-31 | 2023-08-01 | Sonos, Inc. | Noise classification for event detection |
| US11727933B2 (en) | 2016-10-19 | 2023-08-15 | Sonos, Inc. | Arbitration-based voice recognition |
| US11736860B2 (en) | 2016-02-22 | 2023-08-22 | Sonos, Inc. | Voice control of a media playback system |
| US11741948B2 (en) | 2018-11-15 | 2023-08-29 | Sonos Vox France Sas | Dilated convolutions and gating for efficient keyword spotting |
| US11769505B2 (en) | 2017-09-28 | 2023-09-26 | Sonos, Inc. | Echo of tone interferance cancellation using two acoustic echo cancellers |
| US11778259B2 (en) | 2018-09-14 | 2023-10-03 | Sonos, Inc. | Networked devices, systems and methods for associating playback devices based on sound codes |
| US11785380B2 (en) | 2021-01-28 | 2023-10-10 | Shure Acquisition Holdings, Inc. | Hybrid audio beamforming system |
| US11790911B2 (en) | 2018-09-28 | 2023-10-17 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
| US11790937B2 (en) | 2018-09-21 | 2023-10-17 | Sonos, Inc. | Voice detection optimization using sound metadata |
| US11792590B2 (en) | 2018-05-25 | 2023-10-17 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
| US11798553B2 (en) | 2019-05-03 | 2023-10-24 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
| US11797263B2 (en) | 2018-05-10 | 2023-10-24 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
| US11817083B2 (en) | 2018-12-13 | 2023-11-14 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
| US11816393B2 (en) | 2017-09-08 | 2023-11-14 | Sonos, Inc. | Dynamic computation of system response volume |
| US11854547B2 (en) | 2019-06-12 | 2023-12-26 | Sonos, Inc. | Network microphone device with command keyword eventing |
| US11862161B2 (en) | 2019-10-22 | 2024-01-02 | Sonos, Inc. | VAS toggle based on device orientation |
| US11869503B2 (en) | 2019-12-20 | 2024-01-09 | Sonos, Inc. | Offline voice control |
| US11893308B2 (en) | 2017-09-29 | 2024-02-06 | Sonos, Inc. | Media playback system with concurrent voice assistance |
| US11900937B2 (en) | 2017-08-07 | 2024-02-13 | Sonos, Inc. | Wake-word detection suppression |
| US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
| US11947870B2 (en) | 2016-02-22 | 2024-04-02 | Sonos, Inc. | Audio response playback |
| US11961519B2 (en) | 2020-02-07 | 2024-04-16 | Sonos, Inc. | Localized wakeword verification |
| US11979960B2 (en) | 2016-07-15 | 2024-05-07 | Sonos, Inc. | Contextualization of voice inputs |
| US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
| US11983463B2 (en) | 2016-02-22 | 2024-05-14 | Sonos, Inc. | Metadata exchange involving a networked playback system and a networked microphone system |
| US12028678B2 (en) | 2019-11-01 | 2024-07-02 | Shure Acquisition Holdings, Inc. | Proximity microphone |
| US12047753B1 (en) | 2017-09-28 | 2024-07-23 | Sonos, Inc. | Three-dimensional beam forming with a microphone array |
| US12062383B2 (en) | 2018-09-29 | 2024-08-13 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
| US12063486B2 (en) | 2018-12-20 | 2024-08-13 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
| US12080314B2 (en) | 2016-06-09 | 2024-09-03 | Sonos, Inc. | Dynamic player selection for audio signal processing |
| US12118273B2 (en) | 2020-01-31 | 2024-10-15 | Sonos, Inc. | Local voice data processing |
| US12154569B2 (en) | 2017-12-11 | 2024-11-26 | Sonos, Inc. | Home graph |
| US12159085B2 (en) | 2020-08-25 | 2024-12-03 | Sonos, Inc. | Vocal guidance engines for playback devices |
| US12165651B2 (en) | 2018-09-25 | 2024-12-10 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
| US12212945B2 (en) | 2017-12-10 | 2025-01-28 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
| US12211490B2 (en) | 2019-07-31 | 2025-01-28 | Sonos, Inc. | Locally distributed keyword detection |
| US12217748B2 (en) | 2017-03-27 | 2025-02-04 | Sonos, Inc. | Systems and methods of multiple voice services |
| US12250526B2 (en) | 2022-01-07 | 2025-03-11 | Shure Acquisition Holdings, Inc. | Audio beamforming with nulling control system and methods |
| US12279096B2 (en) | 2018-06-28 | 2025-04-15 | Sonos, Inc. | Systems and methods for associating playback devices with voice assistant services |
| US12283269B2 (en) | 2020-10-16 | 2025-04-22 | Sonos, Inc. | Intent inference in audiovisual communication sessions |
| US12289584B2 (en) | 2021-10-04 | 2025-04-29 | Shure Acquisition Holdings, Inc. | Networked automixer systems and methods |
| US12327556B2 (en) | 2021-09-30 | 2025-06-10 | Sonos, Inc. | Enabling and disabling microphones and voice assistants |
| US12327549B2 (en) | 2022-02-09 | 2025-06-10 | Sonos, Inc. | Gatekeeping for voice intent processing |
| US12387716B2 (en) | 2020-06-08 | 2025-08-12 | Sonos, Inc. | Wakewordless voice quickstarts |
| US12452584B2 (en) | 2021-01-29 | 2025-10-21 | Shure Acquisition Holdings, Inc. | Scalable conferencing systems and methods |
| US12525083B2 (en) | 2021-11-05 | 2026-01-13 | Shure Acquisition Holdings, Inc. | Distributed algorithm for automixing speech over wireless networks |
| US12542123B2 (en) | 2021-08-31 | 2026-02-03 | Shure Acquisition Holdings, Inc. | Mask non-linear processor for acoustic echo cancellation |
| US12579978B2 (en) | 2018-09-14 | 2026-03-17 | Sonos, Inc. | Networked devices, systems, and methods for intelligently deactivating wake-word engines |
| US12598261B2 (en) | 2022-09-28 | 2026-04-07 | Shure Acquisition Holdings, Inc. | Wideband doubletalk detection for optimization of acoustic echo cancellation |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11405735B2 (en) * | 2020-06-16 | 2022-08-02 | Fujifilm Business Innovation Corp. | System and method for dynamically adjusting settings of audio output devices to reduce noise in adjacent spaces |
| TWI802108B (en) * | 2021-05-08 | 2023-05-11 | 英屬開曼群島商意騰科技股份有限公司 | Speech processing apparatus and method for acoustic echo reduction |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6895093B1 (en) * | 1998-03-03 | 2005-05-17 | Texas Instruments Incorporated | Acoustic echo-cancellation system |
| US7403609B2 (en) * | 2001-07-11 | 2008-07-22 | Yamaha Corporation | Multi-channel echo cancel method, multi-channel sound transfer method, stereo echo canceller, stereo sound transfer apparatus and transfer function calculation apparatus |
| US20090304198A1 (en) * | 2006-04-13 | 2009-12-10 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio signal decorrelator, multi channel audio signal processor, audio signal processor, method for deriving an output audio signal from an input audio signal and computer program |
| US9269343B2 (en) * | 2012-11-27 | 2016-02-23 | Oticon A/S | Method of controlling an update algorithm of an adaptive feedback estimation system and a decorrelation unit |
-
2014
- 2014-07-18 US US14/334,915 patent/US9445196B2/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6895093B1 (en) * | 1998-03-03 | 2005-05-17 | Texas Instruments Incorporated | Acoustic echo-cancellation system |
| US7403609B2 (en) * | 2001-07-11 | 2008-07-22 | Yamaha Corporation | Multi-channel echo cancel method, multi-channel sound transfer method, stereo echo canceller, stereo sound transfer apparatus and transfer function calculation apparatus |
| US20090304198A1 (en) * | 2006-04-13 | 2009-12-10 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio signal decorrelator, multi channel audio signal processor, audio signal processor, method for deriving an output audio signal from an input audio signal and computer program |
| US9269343B2 (en) * | 2012-11-27 | 2016-02-23 | Oticon A/S | Method of controlling an update algorithm of an adaptive feedback estimation system and a decorrelation unit |
Cited By (122)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160065743A1 (en) * | 2014-08-27 | 2016-03-03 | Oki Electric Industry Co., Ltd. | Stereo echo suppressing device, echo suppressing device, stereo echo suppressing method, and non transitory computer-readable recording medium storing stereo echo suppressing program |
| US9531884B2 (en) * | 2014-08-27 | 2016-12-27 | Oki Electric Industry Co., Ltd. | Stereo echo suppressing device, echo suppressing device, stereo echo suppressing method, and non-transitory computer-readable recording medium storing stereo echo suppressing program |
| USD940116S1 (en) | 2015-04-30 | 2022-01-04 | Shure Acquisition Holdings, Inc. | Array microphone assembly |
| US11832053B2 (en) | 2015-04-30 | 2023-11-28 | Shure Acquisition Holdings, Inc. | Array microphone system and method of assembling the same |
| USD865723S1 (en) | 2015-04-30 | 2019-11-05 | Shure Acquisition Holdings, Inc | Array microphone assembly |
| US11310592B2 (en) | 2015-04-30 | 2022-04-19 | Shure Acquisition Holdings, Inc. | Array microphone system and method of assembling the same |
| US11678109B2 (en) | 2015-04-30 | 2023-06-13 | Shure Acquisition Holdings, Inc. | Offset cartridge microphones |
| US12262174B2 (en) | 2015-04-30 | 2025-03-25 | Shure Acquisition Holdings, Inc. | Array microphone system and method of assembling the same |
| US11832068B2 (en) | 2016-02-22 | 2023-11-28 | Sonos, Inc. | Music service selection |
| US11405430B2 (en) | 2016-02-22 | 2022-08-02 | Sonos, Inc. | Networked microphone device control |
| US11556306B2 (en) | 2016-02-22 | 2023-01-17 | Sonos, Inc. | Voice controlled media playback system |
| US11750969B2 (en) | 2016-02-22 | 2023-09-05 | Sonos, Inc. | Default playback device designation |
| US12047752B2 (en) | 2016-02-22 | 2024-07-23 | Sonos, Inc. | Content mixing |
| US11983463B2 (en) | 2016-02-22 | 2024-05-14 | Sonos, Inc. | Metadata exchange involving a networked playback system and a networked microphone system |
| US12505832B2 (en) | 2016-02-22 | 2025-12-23 | Sonos, Inc. | Voice control of a media playback system |
| US12277368B2 (en) | 2016-02-22 | 2025-04-15 | Sonos, Inc. | Handling of loss of pairing between networked devices |
| US11863593B2 (en) | 2016-02-22 | 2024-01-02 | Sonos, Inc. | Networked microphone device control |
| US11736860B2 (en) | 2016-02-22 | 2023-08-22 | Sonos, Inc. | Voice control of a media playback system |
| US11947870B2 (en) | 2016-02-22 | 2024-04-02 | Sonos, Inc. | Audio response playback |
| US11514898B2 (en) | 2016-02-22 | 2022-11-29 | Sonos, Inc. | Voice control of a media playback system |
| US12080314B2 (en) | 2016-06-09 | 2024-09-03 | Sonos, Inc. | Dynamic player selection for audio signal processing |
| US11979960B2 (en) | 2016-07-15 | 2024-05-07 | Sonos, Inc. | Contextualization of voice inputs |
| US11531520B2 (en) | 2016-08-05 | 2022-12-20 | Sonos, Inc. | Playback device supporting concurrent voice assistants |
| US11641559B2 (en) | 2016-09-27 | 2023-05-02 | Sonos, Inc. | Audio playback settings for voice interaction |
| US11727933B2 (en) | 2016-10-19 | 2023-08-15 | Sonos, Inc. | Arbitration-based voice recognition |
| US12309326B2 (en) | 2017-01-13 | 2025-05-20 | Shure Acquisition Holdings, Inc. | Post-mixing acoustic echo cancellation systems and methods |
| US11477327B2 (en) | 2017-01-13 | 2022-10-18 | Shure Acquisition Holdings, Inc. | Post-mixing acoustic echo cancellation systems and methods |
| US10367948B2 (en) | 2017-01-13 | 2019-07-30 | Shure Acquisition Holdings, Inc. | Post-mixing acoustic echo cancellation systems and methods |
| US12217748B2 (en) | 2017-03-27 | 2025-02-04 | Sonos, Inc. | Systems and methods of multiple voice services |
| US20190362733A1 (en) * | 2017-06-15 | 2019-11-28 | Goertek Inc. | Multichannel echo cancellation circuit and method and smart device |
| US10643634B2 (en) * | 2017-06-15 | 2020-05-05 | Goertek Inc. | Multichannel echo cancellation circuit and method and smart device |
| US11259116B2 (en) * | 2017-07-07 | 2022-02-22 | Yamaha Corporation | Sound processing method, remote conversation method, sound processing device, remote conversation device, headset, and remote conversation system |
| US20190141195A1 (en) * | 2017-08-03 | 2019-05-09 | Bose Corporation | Efficient reutilization of acoustic echo canceler channels |
| US10601998B2 (en) * | 2017-08-03 | 2020-03-24 | Bose Corporation | Efficient reutilization of acoustic echo canceler channels |
| US11900937B2 (en) | 2017-08-07 | 2024-02-13 | Sonos, Inc. | Wake-word detection suppression |
| US11816393B2 (en) | 2017-09-08 | 2023-11-14 | Sonos, Inc. | Dynamic computation of system response volume |
| US11646045B2 (en) | 2017-09-27 | 2023-05-09 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
| US11769505B2 (en) | 2017-09-28 | 2023-09-26 | Sonos, Inc. | Echo of tone interferance cancellation using two acoustic echo cancellers |
| US12047753B1 (en) | 2017-09-28 | 2024-07-23 | Sonos, Inc. | Three-dimensional beam forming with a microphone array |
| US11817076B2 (en) | 2017-09-28 | 2023-11-14 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
| US12236932B2 (en) | 2017-09-28 | 2025-02-25 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
| US11538451B2 (en) * | 2017-09-28 | 2022-12-27 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
| US11893308B2 (en) | 2017-09-29 | 2024-02-06 | Sonos, Inc. | Media playback system with concurrent voice assistance |
| US12212945B2 (en) | 2017-12-10 | 2025-01-28 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
| US12154569B2 (en) | 2017-12-11 | 2024-11-26 | Sonos, Inc. | Home graph |
| US11689858B2 (en) | 2018-01-31 | 2023-06-27 | Sonos, Inc. | Device designation of playback and network microphone device arrangements |
| US11797263B2 (en) | 2018-05-10 | 2023-10-24 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
| US12360734B2 (en) | 2018-05-10 | 2025-07-15 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
| US11792590B2 (en) | 2018-05-25 | 2023-10-17 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
| US12513479B2 (en) | 2018-05-25 | 2025-12-30 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
| US11800281B2 (en) | 2018-06-01 | 2023-10-24 | Shure Acquisition Holdings, Inc. | Pattern-forming microphone array |
| US11523212B2 (en) | 2018-06-01 | 2022-12-06 | Shure Acquisition Holdings, Inc. | Pattern-forming microphone array |
| US11770650B2 (en) | 2018-06-15 | 2023-09-26 | Shure Acquisition Holdings, Inc. | Endfire linear array microphone |
| US11297423B2 (en) | 2018-06-15 | 2022-04-05 | Shure Acquisition Holdings, Inc. | Endfire linear array microphone |
| US12279096B2 (en) | 2018-06-28 | 2025-04-15 | Sonos, Inc. | Systems and methods for associating playback devices with voice assistant services |
| US11482978B2 (en) | 2018-08-28 | 2022-10-25 | Sonos, Inc. | Audio notifications |
| US11563842B2 (en) | 2018-08-28 | 2023-01-24 | Sonos, Inc. | Do not disturb feature for audio notifications |
| US11778259B2 (en) | 2018-09-14 | 2023-10-03 | Sonos, Inc. | Networked devices, systems and methods for associating playback devices based on sound codes |
| US12579978B2 (en) | 2018-09-14 | 2026-03-17 | Sonos, Inc. | Networked devices, systems, and methods for intelligently deactivating wake-word engines |
| US12490023B2 (en) | 2018-09-20 | 2025-12-02 | Shure Acquisition Holdings, Inc. | Adjustable lobe shape for array microphones |
| US11310596B2 (en) | 2018-09-20 | 2022-04-19 | Shure Acquisition Holdings, Inc. | Adjustable lobe shape for array microphones |
| US11790937B2 (en) | 2018-09-21 | 2023-10-17 | Sonos, Inc. | Voice detection optimization using sound metadata |
| US12230291B2 (en) | 2018-09-21 | 2025-02-18 | Sonos, Inc. | Voice detection optimization using sound metadata |
| US12165651B2 (en) | 2018-09-25 | 2024-12-10 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
| US11790911B2 (en) | 2018-09-28 | 2023-10-17 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
| US12165644B2 (en) | 2018-09-28 | 2024-12-10 | Sonos, Inc. | Systems and methods for selective wake word detection |
| US12062383B2 (en) | 2018-09-29 | 2024-08-13 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
| US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
| US10979100B2 (en) * | 2018-10-30 | 2021-04-13 | Harman Becker Automotive Systems Gmbh | Audio signal processing with acoustic echo cancellation |
| CN111128210A (en) * | 2018-10-30 | 2020-05-08 | 哈曼贝克自动系统股份有限公司 | Audio Signal Processing with Acoustic Echo Cancellation |
| US20200136675A1 (en) * | 2018-10-30 | 2020-04-30 | Harman Becker Automotive Systems Gmbh | Audio signal processing with acoustic echo cancellation |
| DE102018127071B3 (en) * | 2018-10-30 | 2020-01-09 | Harman Becker Automotive Systems Gmbh | Audio signal processing with acoustic echo cancellation |
| US11741948B2 (en) | 2018-11-15 | 2023-08-29 | Sonos Vox France Sas | Dilated convolutions and gating for efficient keyword spotting |
| US11557294B2 (en) | 2018-12-07 | 2023-01-17 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
| US11817083B2 (en) | 2018-12-13 | 2023-11-14 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
| US12063486B2 (en) | 2018-12-20 | 2024-08-13 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
| US11646023B2 (en) | 2019-02-08 | 2023-05-09 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
| US12284479B2 (en) | 2019-03-21 | 2025-04-22 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality |
| US11778368B2 (en) | 2019-03-21 | 2023-10-03 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality |
| US11303981B2 (en) | 2019-03-21 | 2022-04-12 | Shure Acquisition Holdings, Inc. | Housings and associated design features for ceiling array microphones |
| US11558693B2 (en) | 2019-03-21 | 2023-01-17 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality |
| US11438691B2 (en) | 2019-03-21 | 2022-09-06 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality |
| US12425766B2 (en) | 2019-03-21 | 2025-09-23 | Shure Acquisition Holdings, Inc. | Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality |
| US11798553B2 (en) | 2019-05-03 | 2023-10-24 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
| US12518756B2 (en) | 2019-05-03 | 2026-01-06 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
| US11800280B2 (en) | 2019-05-23 | 2023-10-24 | Shure Acquisition Holdings, Inc. | Steerable speaker array, system and method for the same |
| US11445294B2 (en) | 2019-05-23 | 2022-09-13 | Shure Acquisition Holdings, Inc. | Steerable speaker array, system, and method for the same |
| US11688418B2 (en) | 2019-05-31 | 2023-06-27 | Shure Acquisition Holdings, Inc. | Low latency automixer integrated with voice and noise activity detection |
| US11302347B2 (en) | 2019-05-31 | 2022-04-12 | Shure Acquisition Holdings, Inc. | Low latency automixer integrated with voice and noise activity detection |
| US11501773B2 (en) | 2019-06-12 | 2022-11-15 | Sonos, Inc. | Network microphone device with command keyword conditioning |
| US11854547B2 (en) | 2019-06-12 | 2023-12-26 | Sonos, Inc. | Network microphone device with command keyword eventing |
| US11714600B2 (en) | 2019-07-31 | 2023-08-01 | Sonos, Inc. | Noise classification for event detection |
| US12211490B2 (en) | 2019-07-31 | 2025-01-28 | Sonos, Inc. | Locally distributed keyword detection |
| US11750972B2 (en) | 2019-08-23 | 2023-09-05 | Shure Acquisition Holdings, Inc. | One-dimensional array microphone with improved directivity |
| US11297426B2 (en) | 2019-08-23 | 2022-04-05 | Shure Acquisition Holdings, Inc. | One-dimensional array microphone with improved directivity |
| US11862161B2 (en) | 2019-10-22 | 2024-01-02 | Sonos, Inc. | VAS toggle based on device orientation |
| US12028678B2 (en) | 2019-11-01 | 2024-07-02 | Shure Acquisition Holdings, Inc. | Proximity microphone |
| US12501207B2 (en) | 2019-11-01 | 2025-12-16 | Shure Acquisition Holdings, Inc. | Proximity microphone |
| US11869503B2 (en) | 2019-12-20 | 2024-01-09 | Sonos, Inc. | Offline voice control |
| US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
| US12118273B2 (en) | 2020-01-31 | 2024-10-15 | Sonos, Inc. | Local voice data processing |
| US11961519B2 (en) | 2020-02-07 | 2024-04-16 | Sonos, Inc. | Localized wakeword verification |
| US11552611B2 (en) | 2020-02-07 | 2023-01-10 | Shure Acquisition Holdings, Inc. | System and method for automatic adjustment of reference gain |
| USD944776S1 (en) | 2020-05-05 | 2022-03-01 | Shure Acquisition Holdings, Inc. | Audio device |
| US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
| US11694689B2 (en) | 2020-05-20 | 2023-07-04 | Sonos, Inc. | Input detection windowing |
| US11706562B2 (en) | 2020-05-29 | 2023-07-18 | Shure Acquisition Holdings, Inc. | Transducer steering and configuration systems and methods using a local positioning system |
| US12149886B2 (en) | 2020-05-29 | 2024-11-19 | Shure Acquisition Holdings, Inc. | Transducer steering and configuration systems and methods using a local positioning system |
| US12387716B2 (en) | 2020-06-08 | 2025-08-12 | Sonos, Inc. | Wakewordless voice quickstarts |
| US12159085B2 (en) | 2020-08-25 | 2024-12-03 | Sonos, Inc. | Vocal guidance engines for playback devices |
| US12283269B2 (en) | 2020-10-16 | 2025-04-22 | Sonos, Inc. | Intent inference in audiovisual communication sessions |
| US12424220B2 (en) | 2020-11-12 | 2025-09-23 | Sonos, Inc. | Network device interaction by range |
| US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
| US11785380B2 (en) | 2021-01-28 | 2023-10-10 | Shure Acquisition Holdings, Inc. | Hybrid audio beamforming system |
| US12452584B2 (en) | 2021-01-29 | 2025-10-21 | Shure Acquisition Holdings, Inc. | Scalable conferencing systems and methods |
| US12542123B2 (en) | 2021-08-31 | 2026-02-03 | Shure Acquisition Holdings, Inc. | Mask non-linear processor for acoustic echo cancellation |
| US12327556B2 (en) | 2021-09-30 | 2025-06-10 | Sonos, Inc. | Enabling and disabling microphones and voice assistants |
| US12289584B2 (en) | 2021-10-04 | 2025-04-29 | Shure Acquisition Holdings, Inc. | Networked automixer systems and methods |
| US12525083B2 (en) | 2021-11-05 | 2026-01-13 | Shure Acquisition Holdings, Inc. | Distributed algorithm for automixing speech over wireless networks |
| US12250526B2 (en) | 2022-01-07 | 2025-03-11 | Shure Acquisition Holdings, Inc. | Audio beamforming with nulling control system and methods |
| US12327549B2 (en) | 2022-02-09 | 2025-06-10 | Sonos, Inc. | Gatekeeping for voice intent processing |
| US12598261B2 (en) | 2022-09-28 | 2026-04-07 | Shure Acquisition Holdings, Inc. | Wideband doubletalk detection for optimization of acoustic echo cancellation |
Also Published As
| Publication number | Publication date |
|---|---|
| US9445196B2 (en) | 2016-09-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9445196B2 (en) | Inter-channel coherence reduction for stereophonic and multichannel acoustic echo cancellation | |
| CN105144674B (en) | Multi-channel echo is eliminated and noise suppressed | |
| US8538037B2 (en) | Audio signal decorrelator, multi channel audio signal processor, audio signal processor, method for deriving an output audio signal from an input audio signal and computer program | |
| US9870783B2 (en) | Audio signal processing | |
| US9711131B2 (en) | Sound zone arrangement with zonewise speech suppression | |
| CN101133633B (en) | Audio system and method for acoustic echo cancellation | |
| US10979100B2 (en) | Audio signal processing with acoustic echo cancellation | |
| US8682006B1 (en) | Noise suppression based on null coherence | |
| KR101250124B1 (en) | Apparatus and Method for Computing Control Information for an Echo Suppression Filter and Apparatus and Method for Computing a Delay Value | |
| US8761410B1 (en) | Systems and methods for multi-channel dereverberation | |
| Schmidt et al. | Signal processing for in-car communication systems | |
| US20160066088A1 (en) | Utilizing level differences for speech enhancement | |
| US9743215B2 (en) | Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio | |
| US8259926B1 (en) | System and method for 2-channel and 3-channel acoustic echo cancellation | |
| TW201207845A (en) | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system | |
| WO2009117084A2 (en) | System and method for envelope-based acoustic echo cancellation | |
| US20180047408A1 (en) | System and method for addressing acoustic signal reverberation | |
| Bispo et al. | Hybrid pre-processor based on frequency shifting for stereophonic acoustic echo cancellation | |
| Valero et al. | Insight into a phase modulation technique for signal decorrelation in multi-channel acoustic echo cancellation | |
| Wada et al. | Multi-channel acoustic echo cancellation based on residual echo enhancement with effective channel decorrelation via resampling | |
| Guo | Analysis, design, and evaluation of acoustic feedback cancellation systems for hearing aids | |
| Arun | Efficient and robust acoustic feedback cancellation algorithm for in-car communication system | |
| Laska | Transform domain model-based wideband speech enhancement with hearing aid applications |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: MH ACOUSTICS LLC, NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GAENSLER, TOMAS F.;DIETHORN, ERIC J.;REEL/FRAME:033440/0925 Effective date: 20140722 |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2551); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 4 |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2552); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 8 |