...A Planet of Partners™

  • Increase font size
  • Default font size
  • Decrease font size

Patents

 
Secure communication network
Tue, 30 Sep 2014 08:00:00 EDT
The present invention is directed to a secure communication network that enables multi-point to multi-point proxy communication over the network. The network employs a smart server that establishes a secure communication link with each of a plurality of smart client devices installed on local client networks. Each smart client device is in communication with a plurality of agent devices. A plurality of remote devices can access the smart server directly and communicate with agent devices via the secure communication link between the smart server and one of the smart client devices. This communication is enabled without complex configuration of firewall or network parameters by the user.
Monetizing service calls through advertising
Tue, 30 Sep 2014 08:00:00 EDT
A platform service that is used by an application may be linked to an advertising service. A cost of using the platform service may be reduced based on the linking.
Method, medium, and apparatus encoding and/or decoding multichannel audio signals
Tue, 30 Sep 2014 08:00:00 EDT
A method, medium, and apparatus encoding and/or decoding a multichannel audio signal. The method includes detecting the type of spatial extension data included in an encoding result of an audio signal, if the spatial extension data is data indicating a core audio object type related to a technique of encoding core audio data, detecting the core audio object type; decoding core audio data by using a decoding technique according to the detected core audio object type, if the spatial extension data is residual coding data, decoding the residual coding data by using the decoding technique according to the core audio object type, and up-mixing the decoded core audio data by using the decoded residual coding data. According to the method, the core audio data and residual coding data may be decoded by using an identical decoding technique, thereby reducing complexity at the decoding end.
Coding apparatus, coding method, decoding apparatus, decoding method, and program
Tue, 30 Sep 2014 08:00:00 EDT
A coding apparatus includes a generation unit configured to generate first coding information used for first coding of a first audio signal and second coding information used for second coding of a second audio signal, and generate third coding information used for the first coding of the second audio signal and fourth coding information used for the second coding of a third audio signal; a first coding unit configured to generate first data and second data; a second coding unit configured to generate third data and fourth data by performing the second coding on the third audio signal; and a multiplexing unit configured to generate a stream of the first audio signal and a stream of the second audio signal. The third data is decoded in place of the second data in a case where a loss or an error has occurred in the stream of the second audio signal.
Content customization
Tue, 30 Sep 2014 08:00:00 EDT
A content processing service may analyze an item of original content and identify several objects, attributes of those objects, and relationships between those objects present in the item of original content. The content processing service may also analyze a source graph, such as a social graph or supplemental graph, and identify several objects, attributes of those objects, and relationships between objects present in the source graph. The content processing service may customize the item of original content by selecting an original object and selecting a source graph object. One or more of the attributes or relationships of the selected original object in the item of original content may be replaced by one or more of the attributes or relationships of the selected source graph object. To customize items of audio content, audio content associated with the source graph object may replace audio content associated with the target graph object.
Suggested query constructor for voice actions
Tue, 30 Sep 2014 08:00:00 EDT
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suggesting a phrase for performing a voice action. The methods, systems, and apparatus include actions of classifying a received phrase as an incomplete phrase for performing a voice action based at least on determining that (i) the voice action requires a parameter, and (ii) that no term of the phrase corresponds to the parameter. Additional actions include, in response to classifying the phrase as an incomplete phrase, generating a prompt for entry of the parameter, and in response to the prompt, receiving data indicating an entered parameter. Further actions include providing, for output, a suggested complete phrase for performing the voice action using the entered parameter.
Voice interface to NFC applications
Tue, 30 Sep 2014 08:00:00 EDT
Technologies for transferring Near Field Communications information on a computing device include storing information corresponding to services in a database on the computing device, receiving a voice input corresponding to a name of a requested service, and retrieving the information corresponding to the requested service from the database. Such technologies may also include loading the retrieved information corresponding to the requested service into a Near Field Communications tag emulated by the computing device and transferring the retrieved information to a portable computing device in response to the Near Field Communications tag being touched by a Near Field Communications reader of the portable computing device. The information corresponding to the requested service stored in the database, retrieved from the database, loaded into the Near Field Communications tag, and/or transferred to the portable computing device may include a Universal Resource Identifier and content-specific keywords corresponding to the requested service.
Rule generation
Tue, 30 Sep 2014 08:00:00 EDT
A method for implementing at least one rule for an application is described. The method includes receiving an input rule. Based on the input rule, a program executable code is generated. The generated program executable code can then be associated with the application.
System and method for excerpt creation by designating a text segment using speech
Tue, 30 Sep 2014 08:00:00 EDT
A method including displaying content on a display of a device, receiving a speech input designating a segment of the content to be excerpted and transferring the excerpted content to a predetermined location for storage and retrieval.
Automated third party verification system
Tue, 30 Sep 2014 08:00:00 EDT
A third party verification system, method, and computer readable medium comprising a response unit in communication with at least a first party, said response unit configured to automatically elicit information in the form of a dual tone multi-frequency (DTMF) response from the at least first party providing evidence of consent granted by the at least first party to enter into at least one transaction with at least a second party, and a print server in communication with said response unit, said print server configured to receive and store said information to provide a record for third party verification that the at least first party consented to enter into the at least one transaction with the at least one second party.
Systems and methods for responding to natural language speech utterance
Tue, 30 Sep 2014 08:00:00 EDT
Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for one or more users presenting questions or commands across multiple domains. The systems and methods creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech and non-speech communications and presenting the expected results for a particular question or command.
System for tuning synthesized speech
Tue, 30 Sep 2014 08:00:00 EDT
An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.
Speech recognition apparatus and method
Tue, 30 Sep 2014 08:00:00 EDT
A speech recognition apparatus is provided. The speech recognition apparatus includes a primary speech recognition unit configured to perform speech recognition on input speech and thus to generate word lattice information, a word string generation unit configured to generate one or more word strings based on the word lattice information, a language model score calculation unit configured to calculate bidirectional language model scores of the generated word strings selectively using forward and backward language models for each of words in each of the generated word strings, and a sentence output unit configured to output one or more of the generated word strings with high scores as results of the speech recognition of the input speech based on the calculated bidirectional language model scores.
Method and apparatus for speech recognition
Tue, 30 Sep 2014 08:00:00 EDT
A computer-implemented method, apparatus and computer program product. The computer-implemented method performed by a computerized device, comprising: transforming a hidden Markov model to qubits; transforming data into groups of qubits, the data being determined upon the hidden Markov model and features extracted from an audio signal, the data representing a likelihood observation matrix representing likelihood of phoneme and state combinations in an audio signal; applying a quantum search algorithm for finding a maximal value of the qubits; and transforming the maximal value of the qubits into a number, the number representing an entry in a delta array used in speech recognition.
Conference call service with speech processing for heavily accented speakers
Tue, 30 Sep 2014 08:00:00 EDT
Speech recognition processing captures phonemes of words in a spoken speech string and retrieves text of words corresponding to particular combinations of phonemes from a phoneme dictionary. A text-to-speech synthesizer then can produce and substitute a synthesized pronunciation of that word in the speech string. If the speech recognition processing fails to recognize a particular combination of phonemes of a word, as spoken, as may occur when a word is spoken with an accent or when the speaker has a speech impediment, the speaker is prompted to clarify the word by entry, as text, from a keyboard or the like for storage in the phoneme dictionary such that a synthesized pronunciation of the word can be played out when the initially unrecognized spoken word is again encountered in a speech string to improve intelligibility, particularly for conference calls.
System and method of providing machine translation from a source language to a target language
Tue, 30 Sep 2014 08:00:00 EDT
A machine translation method, system for using the method, and computer readable media are disclosed. The method includes the steps of receiving a source language sentence, selecting a set of target language n-grams using a lexical classifier and based on the source language sentence. When selecting the set of target language n-grams, in at least one n-gram, n is greater than 1. The method continues by combining the selected set of target language n-grams as a finite state acceptor (FSA), weighting the FSA with data from the lexical classifier, and generating an n-best list of target sentences from the FSA. As an alternate to using the FSA, N strings may be generated from the n-grams and ranked using a language model. The N strings may be represented by an FSA for efficiency but it is not necessary.
Realtime acoustic adaptation using stability measures
Tue, 30 Sep 2014 08:00:00 EDT
Methods, systems, and computer programs encoded on a computer storage medium for real-time acoustic adaptation using stability measures are disclosed. The methods include the actions of receiving a transcription of a first portion of a speech session, wherein the transcription of the first portion of the speech session is generated using a speaker adaptation profile. The actions further include receiving a stability measure for a segment of the transcription and determining that the stability measure for the segment satisfies a threshold. Additionally, the actions include triggering an update of the speaker adaptation profile using the segment, or using a portion of speech data that corresponds to the segment. And the actions include receiving a transcription of a second portion of the speech session, wherein the transcription of the second portion of the speech session is generated using the updated speaker adaptation profile.
Systems and methods for segmenting and/or classifying an audio signal from transformed audio information
Tue, 30 Sep 2014 08:00:00 EDT
A system and method may be provided to segment and/or classify an audio signal from transformed audio information. Transformed audio information representing a sound may be obtained. The transformed audio information may specify magnitude of a coefficient related to energy amplitude as a function of frequency for the audio signal and time. Features associated with the audio signal may be obtained from the transformed audio information. Individual ones of the features may be associated with a feature score relative to a predetermined speaker model. An aggregate score may be obtained based on the feature scores according to a weighting scheme. The weighting scheme may be associated with a noise and/or SNR estimation. The aggregate score may be used for segmentation to identify portions of the audio signal containing speech of one or more different speakers. For classification, the aggregate score may be used to determine a likely speaker model to identify a source of the sound in the audio signal.
Method and system for segmenting phonemes from voice signals
Tue, 30 Sep 2014 08:00:00 EDT
A method and a system for segmenting phonemes from voice signals. A method for accurately segmenting phonemes, in which a histogram showing a peak distribution corresponding to an order is formed by using a high order concept, and a boundary indicating a starting point and an ending point of each phoneme is determined by calculating a peak statistic based on the histogram. The phoneme segmentation method can remarkably reduce an amount of calculation, and has an advantage of being applied to sound signal systems which perform sound coding, sound recognition, sound synthesizing, sound reinforcement, etc.
Method and system for assisting input of text information from voice data
Tue, 30 Sep 2014 08:00:00 EDT
A method may include converting voice data into text data and tagging at least one portion of the text data with at least one tag, the at least one tag indicating that the at least one portion of the text data includes a particular type of data. The method may also include displaying the text data on a display such that the at least one portion of text data is displayed with at least one associated graphical element indicating that the at least one portion of text data is associated with the at least one tag. The at least one portion of text data may be a selectable item on the display allowing a user interfacing with the display to select the at least one portion of text data in order to apply the at least one portion of text data to an application.
Training of voice-controlled television navigation
Tue, 30 Sep 2014 08:00:00 EDT
Systems and methods for training voice activation control of electronic equipment are disclosed. One example method includes receiving a selection corresponding to at least one command used to control the electronic equipment. The method further includes instructing a user to speak, and responsive to the instruction, receiving a digitized speech stream. The method further includes segmenting the speech stream into speech segments, storing at least one of the speech segments as an entry in a dictionary, and associating the dictionary entry with the selected command.
Spoken mobile engine for analyzing a multimedia data stream
Tue, 30 Sep 2014 08:00:00 EDT
Systems and methods are disclosed to operate a mobile device by capturing user input, transmitting the user input over a wireless channel to an engine, analyzing at the engine a music clip or video in a multimedia stream, and sending an analysis wirelessly to the mobile device.
Speech encoding utilizing independent manipulation of signal and noise spectrum
Tue, 30 Sep 2014 08:00:00 EDT
Some embodiments describe methods, programs, and systems for speech encoding. Among other things, a received input signal representing a property of speech is quantized to generate a quantized output signal. Prior to the quantization, a version of the input signal is supplied to a first noise shaping filter having a first set of filter coefficients effective to generate a first filtered signal. Following the quantization, the quantized output signal is supplied to a second noise shaping filter having a second set of filter coefficients, thus generating a second filtered signal. A noise shaping operation is performed to control a frequency spectrum of a noise effect in the quantized output signal caused by the quantization, wherein the noise shaping operation is based on both the first and second filtered signals. Finally, the quantised output signal is transmitted in an encoded signal.
Apparatus and method for isolating multi-channel sound source
Tue, 30 Sep 2014 08:00:00 EDT
In an apparatus and method for isolating a multi-channel sound source, the probability of speaker presence calculated when noise of a sound source signal separated by GSS is estimated is used to calculate a gain. Thus, it is not necessary to additionally calculate the probability of speaker presence when calculating the gain, the speaker's voice signal can be easily and quickly separated from peripheral noise and reverb and distortion are minimized. As such, if several interference sound sources, each of which has directivity, and speakers are simultaneously present in a room with high reverb, a plurality of sound sources generated from several microphones can be separated from one another with low sound quality distortion, and the reverb can also be removed.
System enhancement of speech signals
Tue, 30 Sep 2014 08:00:00 EDT
A system enhances speech by detecting a speaker's utterance through a first microphone positioned a first distance from a source of interference. A second microphone may detect the speaker's utterance at a different position. A monitoring device may estimate the power level of a first microphone signal. A synthesizer may synthesize part of the first microphone signal by processing the second microphone signal. The synthesis may occur when power level is below a predetermined level.
Encoder, decoder and methods thereof
Tue, 30 Sep 2014 08:00:00 EDT
An encoder whereby the bit efficiency of encoding can be improved, thereby improving the qualities of signals as decoded. In the encoder: a time-frequency converting unit (101) converts signals, which are to be encoded, to frequency domain signals; an adaptive spectrum formation encoding unit (102) determines an effective range in the frequency band of the frequency domain signals; and a pulse vector encoding unit (103) pulse vector encodes only the signal components within the effective range.
Method, device and system for voice encoding/decoding
Tue, 30 Sep 2014 08:00:00 EDT
A method, a device and a system for voice encoding/decoding are disclosed in the present invention. The method includes: assembling an input pulse code modulation signal into one signal according to a designated time slot and assembly manner; and encoding the assembled signal according to a designated encoding manner to output an encoded voice signal. In the present invention, because a process of assembling or splitting the signal may be implemented through software, in the case that hardware in a current network does not need to be replaced, an effect of encoding/decoding voice with a 7 K spectrum may be achieved in the current network.
Updating dictionary during application installation
Tue, 30 Sep 2014 08:00:00 EDT
Methods, computer program products and systems for updating a dictionary on a computer are provided including identifying a set of words included with an application program being installed on the computer and identifying one or more installed dictionaries within the computer. The dictionaries that are identified, typically by searching the computer for installed dictionaries, may then be updated by adding one or more words of the set to at least one of the installed dictionaries. Optionally, one or more installed dictionaries may be identified by the user responding to a query asking the user to identify dictionaries or applications that contain dictionaries installed on the computer. An application programming interface may add the words to the dictionaries by associating the installed dictionaries with one or more installed application programs and identifying an API that is specific for the application and using the API to update the dictionaries.
Mobile systems and methods of supporting natural language human-machine interactions
Tue, 30 Sep 2014 08:00:00 EDT
A mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications. The mobile system identifies and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/or commands in multiple domains. The invention creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context and presenting the expected results for a particular question or command. The invention may organize domain specific behavior and information into agents, that are distributable or updateable over a wide area network.
Method and system for natural language dictionary generation
Tue, 30 Sep 2014 08:00:00 EDT
A method and computer system for analyzing a text corpus in a natural language is provided. An initial morphological description having word inflection rules for various groups of words in the natural language is created by a linguist. A plurality of text corpuses are analyzed to obtain information on the occurrence of a plurality of word forms for each word token in each text corpus. A morphological dictionary which contains information about each base form and word inflection rules for each word token with verified hypothesis is generated.

Language Selection

linkedin        twitter

Company Search


Advanced Search   


Services


logo inttranet


logo inttrastats


logo Inttranews


logo Inttrasearch


logo linguists of the year