...A Planet of Partners™

  • Increase font size
  • Default font size
  • Decrease font size

Patents

 
Methods and apparatuses for deferred object customization
Tue, 18 Nov 2014 08:00:00 EST
A method and apparatus to record one or more customization messages in a storage are described. Each customization message may include one or more predicates specifying applicability of the customization message for a plurality of objects. An operation on the objects may be performed to generate a configuration of a device in response to receiving an event. The configuration may include the objects. Each customization message may be selectively applied to the objects in the configuration to customize the configuration. An object may be updated via the customization message if the predicates match the object in the configuration. The device may be configured via the customized configuration.
Spell-check for a keyboard system with automatic correction
Tue, 18 Nov 2014 08:00:00 EST
User input is received, specifying a continuous traced path across a keyboard presented on a touch sensitive display. An input sequence is resolved, including traced keys and auxiliary keys proximate to the traced keys by prescribed criteria. For each of one or more candidate entries of a prescribed vocabulary, a set-edit-distance metric is computed between said input sequence and the candidate entry. Various rules specify when penalties are imposed, or not, in computing the set-edit-distance metric. Candidate entries are ranked and displayed according to the computed metric.
Call completion via instant communications client
Tue, 18 Nov 2014 08:00:00 EST
A system is disclosed for achieving completion of a telephone call by way of an instant communications client.
Systems and methods for identifying new words from a meta tag
Tue, 18 Nov 2014 08:00:00 EST
A computer-implemented method to identify new words from a meta tag is described. A Hyper-Text Markup Language (HTML) page is analyzed to identify a meta tag associated with the HTML page. At least one separator included in content of the meta tag is identified. The content of the meta tag is divided using the identified separator. A portion of the divided content is compared to content stored in a database. Upon determining that the portion of the divided content does not exist in the database, the portion of the divided content is added to the database.
Method and apparatus for accessing an audio file from a collection of audio files using tonal matching
Tue, 18 Nov 2014 08:00:00 EST
There is provided a method for accessing at least one audio file from a collection comprising more than one audio file stored within or accessible with an electronic device. The method includes generating one index comprising of information entries obtained from each of the more than one audio file in the collection, with each audio file in the collection information being linked to at least one information entry; receiving an input during an input reception mode; converting the input into a digital signal using an analog-digital converter; analyzing the digital signal using either frequency spectrum analysis or melody contour analysis into discrete portions; and comparing the discrete portions with the entries in the index. It is advantageous that the audio file is accessed when the discrete portions substantially coincide with at least one of the information entries in the index. The discrete portions may be musical notes, melody contours or waveforms. The at least one information entry may also be musical notes, melody contours or waveforms. An apparatus employing the aforementioned method is also disclosed.
Signal clipping protection using pre-existing audio gain metadata
Tue, 18 Nov 2014 08:00:00 EST
The application describes a method and an apparatus to prevent clipping of an audio signal when protection against signal clipping by received audio metadata is not guaranteed. The method may be used to prevent clipping for the case of downmixing a multichannel signal to a stereo audio signal. According to the method, it is determined whether first gain values (4) based on received audio metadata are sufficient for protection against clipping of the audio signal. The audio metadata is embedded in a first audio stream (1). In case a first gain value (4) is not sufficient for protection, the respective first gain value (4) is replaced with a gain value sufficient for protection against clipping of the audio signal. Preferably, in case no metadata related to dynamic range control is present in the first audio stream (1), the method may add gain values sufficient for protection against signal clipping.
Audio encoder/decoder with switching between first and second encoders/decoders using first and second framing rules
Tue, 18 Nov 2014 08:00:00 EST
An audio encoder for encoding audio samples has a first time domain aliasing introducing encoder configured to decode audio samples in a first encoding domain and having a first framing rule, a start window and a stop window. The audio encoder further has a second encoder configured to encode samples in a second encoding domain and having a predetermined frame size number of audio samples, and a coding warm-up period number of audio samples, the second encoder having a different second framing rule, a frame of the second encoder being an encoded representation of a number of successive audio samples that is equal to the predetermined frame size number of audio samples. The audio encoder further has a controller switching from the first to the second encoder and for modifying the second framing rule or for modifying the start or the stop window of the first encoder.
Systems, methods, and apparatus for gain factor smoothing
Tue, 18 Nov 2014 08:00:00 EST
In one embodiment, a method of signal processing includes calculating an envelope of a first signal that is based on a low-frequency portion of a speech signal, calculating an envelope of a second signal that is based on a high-frequency portion of the speech signal, and calculating a first plurality of gain factor values according to a time-varying relation between the envelopes of the first and second signals. The method includes, based on the first plurality of gain factor values, calculating a plurality of smoothed gain factor values. In one example, each of the plurality of smoothed gain factor values is based on a weighted sum of at least one of the first plurality of gain factor values and at least one smoothed gain factor value. In another example, the sum is adaptively weighted based on at least one distance among the plurality of gain factor values.
Quality assessment of text derived from an audio signal
Tue, 18 Nov 2014 08:00:00 EST
According to example configurations, a translation quality assessment application (e.g., system) receives a set of text derived from a translation of an audio signal. The translation quality assessment application aligns and compares the received set of text to reference text to identify occurrence of speech-to-text translation errors in the set of text. For each of the errors, the translation quality assessment application identifies a type associated with the error and retrieves an appropriate weighted coefficient. For example, the translation quality assessment application produces a respective weighted value for each error based on the weighted coefficients. A magnitude of each of the weighted values varies depending on a type of the error that occurred during the translation. The translation quality assessment application utilizes the weighted values to generate a metric indicating a level of quality of the set of captioned text with respect to the set of reference text.
Service orchestration for intelligent automated assistant
Tue, 18 Nov 2014 08:00:00 EST
Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A user request is received, the user request including at least a speech input received from the user. The user request is processed to obtain a representation of user intent, where the representation of user intent associates the user request with a task flow operationalizing a requested task, and the task flow is operable to invoke a plurality of services each supporting functions according to a respective plurality of service parameters. Based on the representation of user intent, one or more relevant task parameters are identified from a plurality of task parameters of the task flow. A subset of the plurality of services are selectively invoked during execution of the task flow, where the selectively invoked subset of the plurality of services support functions according to the identified one or more relevant task parameters.
Quality of user generated audio content in voice applications
Tue, 18 Nov 2014 08:00:00 EST
Methods and arrangements for improving quality of content in voice applications. A specification is provided for acceptable content for a voice application, and user generated audio content for the voice application is inputted. At least one test is applied to the user generated audio content, and it is thereupon determined as to whether the user generated audio content meets the provided specification.
Systems and methods for improving quality of user generated audio content in voice applications
Tue, 18 Nov 2014 08:00:00 EST
Methods and arrangements for improving quality of content in voice applications. A specification is provided for acceptable content for a voice application, and user generated audio content for the voice application is inputted. At least one test is applied to the user generated audio content, and it is thereupon determined as to whether the user generated audio content meets the provided specification.
System and method for combining geographic metadata in automatic speech recognition language and acoustic models
Tue, 18 Nov 2014 08:00:00 EST
Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.
System and method for answering a communication notification
Tue, 18 Nov 2014 08:00:00 EST
Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a notification of communication from a user, converting information related to the notification to speech, outputting the information as speech to the user, and receiving from the user an instruction to accept or ignore the incoming communication associated with the notification. In one embodiment, information related to the notification comprises one or more of a telephone number, an area code, a geographic origin of the request, caller id, a voice message, address book information, a text message, an email, a subject line, an importance level, a photograph, a video clip, metadata, an IP address, or a domain name. Another embodiment involves notification assigned an importance level and repeat attempts at notification if it is of high importance.
System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling
Tue, 18 Nov 2014 08:00:00 EST
The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations based on symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio.
Electronic device and control method thereof
Tue, 18 Nov 2014 08:00:00 EST
Disclosed are an electronic device and a control method thereof, The electronic device includes a text-to-speech unit which converts a text into an audio signal, an audio output unit which outputs an audio corresponding to the converted audio signal; and a controller which controls the audio output unit to reoutput at least one of audios whose output is not completed if there is at least one audio which is not completely output among a plurality of audios output by the audio output unit.
Combination and federation of local and remote speech recognition
Tue, 18 Nov 2014 08:00:00 EST
Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.
Apparatus and method for analysis of language model changes
Tue, 18 Nov 2014 08:00:00 EST
An apparatus, a method, and a machine-readable medium are provided for characterizing differences between two language models. A group of utterances from each of a group of time domains are examined. One of a significant word change or a significant word class change within the plurality of utterances is determined. A first cluster of utterances including a word or a word class corresponding to the one of the significant word change or the significant word class change is generated from the utterances. A second cluster of utterances not including the word or the word class corresponding to the one of the significant word change or the significant word class change is generated from the utterances.
Method and apparatus of providing semi-automated classifier adaptation for natural language processing
Tue, 18 Nov 2014 08:00:00 EST
Example embodiments of the present invention may include a method that provides transcribing spoken utterances occurring during a call and assigning each of the spoken utterances with a corresponding set of first classifications. The method may also include determining a confidence rating associated with each of the spoken utterances and the assigned set of first classifications, and performing at least one of reclassifying the spoken utterances with new classifications based on at least one additional classification operation, and adding the assigned first classifications and the corresponding plurality of spoken utterances to a training data set.
Front-end processor for speech recognition, and speech recognizing apparatus and method using the same
Tue, 18 Nov 2014 08:00:00 EST
A method of recognizing speech is provided. The method includes the operations of (a) dividing first speech that is input to a speech recognizing apparatus into frames; (b) converting the frames of the first speech into frames of second speech by applying conversion rules to the divided frames, respectively; and (c) recognizing, by the speech recognizing apparatus, the frames of the second speech, wherein (b) comprises converting the frames of the first speech into the frames of the second speech by reflecting at least one frame from among the frames that are previously positioned with respect to a frame of the first speech.
Text data processing apparatus, text data processing method, and recording medium storing text data processing program
Tue, 18 Nov 2014 08:00:00 EST
Provided is to a text data processing apparatus, method and program to add a symbol at an appropriate position. The apparatus according to this embodiment is a text data processing apparatus that executes edit of a symbol in input text, the apparatus including symbol edit determination means 52 that determines whether symbol edit is necessary or not based on a frequency of symbol insertion in a block consisting of a plurality of divided text; and symbol edit position calculation means 53 that calculates likelihood of the symbol edit based on likelihood of symbol insertion for a word and a distance between the symbols and calculates a symbol edit position in the block in accordance with the likelihood of symbol edit or a word in the block when the symbol edit determination means determines that the symbol edit is necessary.
Voice emphasis device
Tue, 18 Nov 2014 08:00:00 EST
There is provided a voice emphasis device with which voice clarity can be improved. This voice emphasis device comprises a correlation component removal filter circuit that removes a correlation component from a voice signal produced at a specific sampling frequency, a multiplication circuit that produces an extracted signal by multiplying a specific gain coefficient by the output of the correlation component removal filter circuit, and an arithmetic circuit that adds or subtracts the extracted signal to or from the voice signal. The correlation component removal filter circuit is a lattice-type filter circuit that combines a feedforward filter and a feedback filter. The feedforward filter and the feedback filter update the filter coefficient at the specific sampling frequency based on the formula ki,j+1=ki,j+α×fi/bi−l.
Method of selecting one microphone from two or more microphones, for a speech processor system such as a “hands-free” telephone device operating in a noisy environment
Tue, 18 Nov 2014 08:00:00 EST
The method comprises the steps of: digitizing sound signals picked up simultaneously by two microphones (N, M); executing a short-term Fourier transform on the signals (xn(t), xm(t)) picked up on the two channels so as to produce a succession of frames in a series of frequency bands; applying an algorithm for calculating a speech-presence confidence index on each channel, in particular a probability a speech that is present; selecting one of the two microphones by applying a decision rule to the successive frames of each of the channels, which rule is a function both of a channel selection criterion and of a speech-presence confidence index; and implementing speech processing on the sound signal picked up by the one microphone that is selected.
Signal processing system, apparatus and method used on the system, and program thereof
Tue, 18 Nov 2014 08:00:00 EST
Provided is a signal separation system including a rendering unit which receives a first and a second input signal and positions the first input signal according to rendering information.
Smoothing method for suppressing fluctuating artifacts during noise reduction
Tue, 18 Nov 2014 08:00:00 EST
A smoothing method for suppressing fluctuating artifacts in the reduction of interference noise includes the following steps: providing short-term spectra for a sequence of signal frames, transforming each short-term spectrum by way of a forward transformation which describes the short-term spectrum using transformation coefficients that represent the short-term spectrum subdivided into its coarse and fine structures; smoothing the transformation coefficients with the respective same coefficient indices by combining at least two successive transformed short-term spectra; and transforming the smoothed transformation coefficients into smoothed short-term spectra by way of a backward transformation.
Noise detecting device and noise detecting method
Tue, 18 Nov 2014 08:00:00 EST
A difference signal calculating unit of a noise detecting device calculates a difference between the amplitudes of a residual signal at each sample timing and a residual signal at the preceding sample timing. A difference signal comparing unit determines whether or not an impulsive noise is present on the basis of the difference signal at the current sample timing, and the difference signal at each sample timing within a predetermined duration from the current sample timing.
Encoding device and encoding method, decoding device and decoding method, and program
Tue, 18 Nov 2014 08:00:00 EST
The present invention relates to an encoding device and an encoding method, a decoding device and a decoding method, and a program that reduce deterioration of sound quality due to encoding of audio signals. An envelope emphasis part (51) emphasizes an envelope (ENV). A noise shaping part (52) divides an emphasized envelope (D) formed by emphasis of the envelope (ENV) by a value larger than 1, and subtracts noise shaping (G) specified by information (NS) from a result of the division. A quantization part (14) sets a result of the subtraction as a quantization bit count (WL), and quantizes a normalized spectrum (S1) formed by normalization of a spectrum (S0) based on the quantization bit count (WL). A multiplexing part (53) multiplexes the information (NS), a quantized spectrum (QS) formed by quantization of the normalized spectrum (S1), and the envelope (ENV). The present invention can be applied to an encoding device encoding audio signals, for example.
Encoding apparatus, decoding apparatus, encoding method, and decoding method for adjusting a spectrum amplitude
Tue, 18 Nov 2014 08:00:00 EST
An encoding device is provided for increasing the quality of an encoded signal, even when encoding music signals. In the encoding device, a Code-Excited Linear Prediction (CELP) encoder generates first encoded data by encoding an input signal, and a CELP decoder generates a decoded signal by decoding the first encoded data input from the CELP encoder. Additionally, a characteristic parameter encoder calculates a parameter that expresses the degree of fluctuation in the ratio of the peak components and the floor components between the spectra of the decoded signal and the input signal.
Method and an apparatus for processing an audio signal
Tue, 18 Nov 2014 08:00:00 EST
An apparatus for processing an audio signal and method thereof are disclosed. The present invention includes receiving, by an audio processing apparatus, an audio signal including a first data of a first block encoded with rectangular coding scheme and a second data of a second block encoded with non-rectangular coding scheme; receiving a compensation signal corresponding to the second block; estimating a prediction of an aliasing part using the first data; and, obtaining a reconstructed signal for the second block based on the second data, the compensation signal and the prediction of aliasing part.
Audio signal loudness determination and modification in the frequency domain
Tue, 18 Nov 2014 08:00:00 EST
Methods of, apparatuses for, and computer readable media having instructions thereon that when executed cause carrying out methods of determining and modifying the perceived loudness of a frequency domain audio signal where the frequency resolution, and corresponding temporal coverage of the frequency domain information is not constant. The frequency (and thus temporal) resolution of the perceived loudness processing is maintained constant at the longest block size. One method includes a block combiner and a loudness modification interpolator.

Language Selection

linkedin        twitter

Company Search


Advanced Search   


Services


logo inttranet


logo inttrastats


logo Inttranews


logo Inttrasearch


logo linguists of the year