...A Planet of Partners™

  • Increase font size
  • Default font size
  • Decrease font size

Patents

 
User-selected media content blocking
Tue, 18 Jun 2013 08:00:00 EDT
Presented herein is a method of blocking user-selected media content, such as, for example, audio and/or video content. In the method, at least one media content stream is presented to a user, wherein the at least one media content stream includes multiple showings of an identifiable contiguous segment of media content. One example of such a segment may be a commercial or advertisement. While presenting a current showing of the segment of media content to the user, a command is received from a user to block the segment of media content from presentation to the user. In response to receiving the command, information identifying the segment of media content is generated. The at least one media content stream is analyzed using the information to detect subsequent showings of the segment of media content. Presentation of at least one of the subsequent showings of the segment of media content is prevented when detected in the at least one media content stream.
Digital data distribution system with switching unit, online acquisition unit, and conversion unit for converting from first to second format
Tue, 18 Jun 2013 08:00:00 EDT
A CD on which only music information specified by the CD-DA is recorded, or a CD on which both music information specified by the CD-DA and music information to be recorded on a CD-ROM are recorded is mounted upon an information processing terminal. When the CD on which only music information specified by the CD-DA is recorded is mounted, the information processing terminal acquires, from a directory server, an ISRC number that identifies the music information recorded on the CD, and distribution server location information that identifies a content distribution server. The information processing terminal acquires content that is the music information compressed according to the MP3 and encrypted, from the content distribution server identified by the acquired distribution server location information, and the decryption key. The information processing terminal then decrypts the acquired content using the acquired decryption key and reproduces music.
Audio decoding using variable-length codebook application ranges
Tue, 18 Jun 2013 08:00:00 EDT
Provided are, among other things, systems, methods and techniques for decoding an audio signal from a frame-based bit stream. At least one frame includes processing information pertaining to the frame and entropy-encoded quantization indexes representing audio data within the frame. The processing information includes: (i) code book indexes, and (ii) code book application information specifying ranges of entropy-encoded quantization indexes to which the code books are to be applied. The entropy-encoded quantization indexes are decoded by applying the identified code books to the corresponding ranges of entropy-encoded quantization indexes.
Method and apparatus for processing signal
Tue, 18 Jun 2013 08:00:00 EDT
A method and an apparatus for processing a signal are provided. The method includes: obtaining an energy average value of each sub-band for a current frame frequency-domain signal; obtaining a current frame modification coefficient of each sub-band for the current frame frequency-domain signal according to a spectral envelope and the energy average value of each sub-band; obtaining a weighted modification coefficient of each sub-band for the current frame frequency-domain signal by using the current frame modification coefficient and a relevant frame modification coefficient; and modifying the spectral envelope of each sub-band for the current frame frequency-domain signal by using the weighted modification coefficient.
Generating a frame of audio data
Tue, 18 Jun 2013 08:00:00 EDT
A method of generating a frame of audio data for an audio signal from preceding audio data for the audio signal that precede the frame of audio data, the method comprising the steps of: predicting a predetermined number of data samples for the frame of audio data based on the preceding audio data, to form predicted data samples; identifying a section of the preceding audio data for use in generating the frame of audio data; and forming the audio data of the frame of audio data as a repetition (602) of at least part of the identified section to span the frame of audio data, wherein the beginning of the frame of audio data comprises a combination of a subset of the repetition (602) of the at least part of the identified section and the predicted data samples.
Handsfree device with countinuous keyword recognition
Tue, 18 Jun 2013 08:00:00 EDT
A handsfree device, which is coupled to a data processing device, may be operable to monitor at least one audio stream for occurrence of at least one keyword. Upon recognition of the at least one keyword, the handsfree device may establish a first connection between the handsfree device and the data processing device for launching a voice interface in the data processing device. The handsfree device may send audio data received after the recognition of the at least one keyword to the data processing device, via the first connection for responding to the audio data via the voice interface. During a keyword configuration operation, the handsfree device may send at least one inputted keyword to the data processing device for recording. The handsfree device may receive, via a second connection, the recorded at least one keyword from the data processing device for keyword configuration of the handsfree device.
Voice control for asynchronous notifications
Tue, 18 Jun 2013 08:00:00 EDT
A computing device may receive an incoming communication and, in response, generate a notification that indicates that the incoming communication can be accessed using a particular application on the communication device. The computing device may further provide an audio signal indicative of the notification and automatically activate a listening mode. The computing device may receive a voice input during the listening mode, and an input text may be obtained based on speech recognition performed upon the voice input. A command may be detected in the input text. In response to the command, the computing device may generate an output text that is based on at least the notification and provide a voice output that is generated from the output text via speech synthesis. The voice output identifies at least the particular application.
System and method for writing digits in words and pronunciation of numbers, fractions, and units
Tue, 18 Jun 2013 08:00:00 EDT
Disclosed is a system and method for converting a digital number to text and for pronouncing the digital number. The system includes a filtration system for determining whether the digital number has nonnumeric symbols and for generating a filtrated number, an analyzing system for analyzing the filtrated number, a composition system configured to collect words associated with ternary units of the filtrated number, a linking system configured to link the words, and a pronouncing system for pronouncing the linked words.
Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access
Tue, 18 Jun 2013 08:00:00 EDT
An apparatus for synthesizing a speech including a waveform memory that stores a plurality of speech unit waveforms, an information memory that correspondingly stores speech unit information and an address of each of the speech unit waveforms, a selector that selects a speech unit sequence corresponding to the input phoneme sequence by referring to the speech unit information, a speech unit waveform acquisition unit that acquires a speech unit waveform corresponding to each speech unit of the speech unit sequence from the waveform memory by referring to the address, a speech unit concatenation unit that generates the speech by concatenating the speech unit waveform acquired.
Adaptive noise modeling speech recognition system
Tue, 18 Jun 2013 08:00:00 EDT
An adaptive noise modeling speech recognition system improves speech recognition by modifying an activation of the system's grammar rules or models based on detected noise characteristics. An adaptive noise modeling speech recognition system includes a sensor that receives acoustic data having a speech component and a noise component. A processor analyzes the acoustic data and generates a noise indicator that identifies a characteristic of the noise component. An integrating decision logic processes the noise indicator and generates a noise model activation data structure that includes data that may be used by a speech recognition engine to adjust the activation of associated grammar rules or models.
Apparatus and method for canceling noise of voice signal in electronic apparatus
Tue, 18 Jun 2013 08:00:00 EDT
An apparatus and a method for canceling noise in a voice signal in an electronic apparatus are provided. The apparatus includes a Generalized Sidelobe Canceller (GSC) and a decision unit. The GSC cancels noise components from signals with different phases input via a plurality of microphones. The decision unit estimates a Signal-to-Noise Ratio (SNR) of an input signal to determine a step-size of a filter included in the GSC.
Multi-stage quantization method and device
Tue, 18 Jun 2013 08:00:00 EDT
The invention discloses a multi-stage quantization method, which includes the following steps: obtaining a reference codebook according to a previous stage codebook; obtaining a current stage codebook according to the reference codebook and a scaling factor; and quantizing an input vector by using the current stage codebook. The invention also discloses a multi-stage quantization device. With the invention, the current stage codebook may be obtained according to the previous stage codebook, by using the correlation between the current stage codebook and the previous stage codebook. As a result, it does not require an independent codebook space for the current stage codebook, which saves the storage space and improves the resource usage efficiency.
Speech feature extraction apparatus, speech feature extraction method, and speech feature extraction program
Tue, 18 Jun 2013 08:00:00 EDT
A speech feature extraction apparatus, speech feature extraction method, and speech feature extraction program. A speech feature extraction apparatus includes: first difference calculation module to: (i) receive, as an input, a spectrum of a speech signal segmented into frames for each frequency bin; and (ii) calculate a delta spectrum for each of the frame, where the delta spectrum is a difference of the spectrum within continuous frames for the frequency bin; and first normalization module to normalize the delta spectrum of the frame for the frequency bin by dividing the delta spectrum by a function of an average spectrum; where the average spectrum is an average of spectra through all frames that are overall speech for the frequency bin; and where an output of the first normalization module is defined as a first delta feature.
Parameter decoding device, parameter encoding device, and parameter decoding method
Tue, 18 Jun 2013 08:00:00 EDT
A parameter decoding device performs a parameter compensation process so as to suppress degradation of a main observation quality in a prediction quantization. The parameter decoding device includes first amplifiers which multiply inputted quantization prediction residual vectors by a weighting coefficient. A further amplifier multiplies the preceding frame decoding LSF vector yn−1 by the weighting coefficient. An additional amplifier multiplies the code vector xn+1 outputted from a codebook by the weighting coefficient β0. An adder calculates the total of the vectors outputted from the amplifiers, the further amplifier, and the additional amplifier. A selector switch selects the vector outputted from the adder if the frame erasure coding Bn of the current frame indicates that ‘the n-th frame is an erased frame’ and the frame erasure coding Bn+1 of the next frame indicates that ‘the n+1-th frame is a normal frame’.
Voicing detection modules in a system for automatic transcription of sung or hummed melodies
Tue, 18 Jun 2013 08:00:00 EDT
The technology disclosed relates to audio signal processing. It includes a series of modules that individually are useful to solve audio signal processing problems. Among the problems addressed are buzz removal, selecting a pitch candidate among pitch candidates based on local continuity of pitch and regional octave consistency, making small adjustments in pitch, ensuring that a selected pitch is consistent with harmonic peaks, determining whether a given frame or region of frames includes harmonic, voiced signal, extracting harmonics from voice signals and detecting vibrato. One environment in which these modules are useful is transcribing singing or humming into a symbolic melody. Another environment that would usefully employ some of these modules is speech processing. Some of the modules, such as buzz removal, are useful in many other environments as well.
Method, system and computer readable recording medium for correcting OCR result
Tue, 18 Jun 2013 08:00:00 EDT
Disclosed is a method, system and computer readable recording medium for correcting an OCR result. According to an exemplary embodiment of the present invention, there is provided a method for correcting an OCR result, the method including performing character recognition on content including character information using an OCR technique, removing extra carriage return information from the content, outputting the character recognition result, and correcting word spacing on the outputted result.
Acoustic model adaptation using geographic information
Tue, 18 Jun 2013 08:00:00 EDT
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, adapting one or more acoustic models for the geographic location, and performing speech recognition on the audio signal using the one or more acoustic models model that are adapted for the geographic location.
Detecting writing systems and languages
Tue, 18 Jun 2013 08:00:00 EDT
Methods, systems, and apparatus, including computer program products, for detecting writing systems and languages are disclosed. In one implementation, a method is provided. The method includes receiving text; identifying portions of the text as being non-repetitive, the identifying including: compressing underlying data of a first portion of the text, identifying a data compression ratio based on the amount of compression of the underlying data, and determining whether the first portion of the text is non-repetitive based on the data compression ratio; and identifying the first portion of the text as candidate text for use in language detection based on the portions of the text that are determined to be non-repetitive.
System and method for language translation in a hybrid peer-to-peer environment
Tue, 18 Jun 2013 08:00:00 EDT
An improved system and method are disclosed for peer-to-peer communications. In one example, the method enables an endpoint to send and/or receive audio speech translations to facilitate communications between users who speak different languages.
Microphone and voice activity detection (VAD) configurations for use with communication systems
Tue, 18 Jun 2013 08:00:00 EDT
Communication systems are described, including both portable handset and headset devices, which use a number of microphone configurations to receive acoustic signals of an environment. The microphone configurations include, for example, a two-microphone array including two unidirectional microphones, and a two-microphone array including one unidirectional microphone and one omnidirectional microphone. The communication systems also include Voice Activity Detection (VAD) devices to provide information of human voicing activity. Components of the communications systems receive the acoustic signals and voice activity signals and, in response, automatically generate control signals from data of the voice activity signals. Components of the communication systems use the control signals to automatically select a denoising method appropriate to data of frequency subbands of the acoustic signals. The selected denoising method is applied to the acoustic signals to generate denoised acoustic signals when the acoustic signal includes speech and noise.
Dereverberation apparatus, dereverberation method, dereverberation program, and recording medium
Tue, 18 Jun 2013 08:00:00 EDT
A sound source model storage section stores a sound source model that represents an audio signal emitted from a sound source in the form of a probability density function. An observation signal, which is obtained by collecting the audio signal, is converted into a plurality of frequency-specific observation signals each corresponding to one of a plurality of frequency bands. Then, a dereverberation filter corresponding to each frequency band is estimated by using the frequency-specific observation signal for the frequency band on the basis of the sound source model and a reverberation model that represents a relationship for each frequency band among the audio signal, the observation signal and the dereverberation filter. A frequency-specific target signal corresponding to each frequency band is determined by applying the dereverberation filter for the frequency band to the frequency-specific observation signal for the frequency band, and the resulting frequency-specific target signals are integrated.
System and method for processing calls in a call center
Tue, 18 Jun 2013 08:00:00 EDT
A system and method for processing calls in a call center are described. A call session from a caller via a session manager and including incoming text messages of a verbal speech stream is assigned. The incoming text messages are progressively visually presented throughout the call session to a live agent on an agent console operatively coupled to the session manager. The incoming text messages are progressively processed through a customer support scenario interactively monitored and controlled by the live agent via the agent console. The incoming text messages are processed through automated script execution in concert with the live agent. Outgoing text messages are converted into a synthesized speech stream. The synthesized speech stream is sent via the agent console to the caller.
Systems and methods for structured voice interaction facilitated by data channel
Tue, 18 Jun 2013 08:00:00 EDT
A voice channel connection and a data channel connection are established with a structured voice interaction system. Navigation information for and provided by the structured voice interaction system is received over the data channel connection. The data channel navigation information is coordinated with navigation information provided by the structured voice interaction system over the voice channel connection.
Interactive assistant for managing telephone communications
Tue, 18 Jun 2013 08:00:00 EDT
An interactive assistant for managing telephone communications and services is disclosed. In one of many possible method embodiments, a chat interface is provided to a device associated with an intended recipient of an incoming voice call. A chat message from the intended recipient is received through the chat interface. The chat message is presented to a calling party associated with the incoming call prior to a disposition of the call being determined. In other embodiments, an interface provides controls for individually managing calls on a conference call, without affecting other calls on the conference call.
Image searching device and method, program and program recording medium
Tue, 18 Jun 2013 08:00:00 EDT
An encoded code stream is searched for a frame generally coincident with a specific frame without having to decoding the frame to its original image. The present invention provides an image search device that searches an object encoded code stream formed by compression coding of a plurality of frames for a frame generally coincident with a specific one, which includes a decoder for making entropy decoding of the object encoded code stream to generate quantization coefficients of each frame, a matching unit for making matching between the quantization coefficients of the specific frame and those of each frame which are generated by the decoder and correspond in sample position to those of the specific frame, and a judging unit for judging, based on the result of matching, whether the frame is generally coincident with the specific one.
Handheld electronic device providing confirmation of input, and associated method
Tue, 18 Jun 2013 08:00:00 EDT
A letter confirmation system is provided on a handheld electronic device. The letter confirmation provides highlighting of various letters that have been input to the handheld electronic device during a string of member input actuations. The letter confirmation system can additionally provide predictive linguistic elements that would be appropriate next inputs. Various types of highlights can be provided in various combinations to provide various indications to a user.

Language Selection

linkedin        twitter

Company Search


Advanced Search   


Services


logo inttranet


logo inttrastats


logo Inttranews


logo Inttrasearch


logo linguists of the year