...A Planet of Partners™

  • Increase font size
  • Default font size
  • Decrease font size

Patents

 
Embedded system with web-based user interface, firmware structure thereof and method for providing information thereof
Tue, 29 Jul 2014 08:00:00 EDT
Embedded system with web-based user interface, firmware structure thereof, and method for providing information thereof are provided. The firmware of the embedded system includes an execution part and a presentation part, which are separated, wherein the execution part includes a web service program and a program interface while the presentation part includes a web page which includes a request for dynamic content associated with the program interface so as to obtain corresponding dynamic content. When the system takes the presentation part as the static content of its web-based user interface, the web service program, in response to a static content request, reads the web page from the non-volatile memory and outputs it to a device, and in response to the dynamic content request, invokes the program interface to obtain the corresponding dynamic content and output it to the device.
Method and apparatus for real-time multidimensional adaptation of an audio coding system
Tue, 29 Jul 2014 08:00:00 EDT
An adaptive controller for a configurable audio coding system including a fuzzy logic controller modified to use reinforcement learning to create an intelligent control system. With no knowledge of the external system into which it is placed the audio coding system, under the control of the adaptive controller, is capable of adapting its coding configuration to achieve user set performance goals.
Method and system for generating grammar rules
Tue, 29 Jul 2014 08:00:00 EDT
An information retrieval system including a natural language parser (3) for parsing documents of a document space (1) to identify key terms of each document based on linguistic structure, and for parsing a search query to determine the search term, a feature extractor (4) for determining an importance score for terms of the document space based on distribution of the terms in the document space, an index term generator (5) for generating index terms using the key terms identified by the parser and the extractor and having an importance score above a threshold level, and a query clarifier (16) for selecting from the index terms, on the basis of the search term, index terms for selecting a document from the document space. A speech recognition engine (12) generates the query, and a bi-gram language module (6) generates grammar rules for the speech recognition engine using the index terms.
Method and apparatus using historical influence for success attribution in network site activity
Tue, 29 Jul 2014 08:00:00 EDT
User actions prior to, and associated with, an online success event may be considered participating actions that may have influenced the user toward the success event. A previously measured success influence metric for the participating actions may be used to determine a historical influence score for each participating action leading up to the success event. Each participating action may be assigned a current success influence score based on that event's historical influence score as a percentage of a combined historical influence score of all the participating actions for the success event. Additionally, the assigned current success influence scores may be combined with the previously measured success influence metric for use as historical influence scores for success attribution regarding further instances of the success event.
Voice activated cockpit management system for flight procedures and control of aircraft systems and flight management systems of single and multi-engine aircraft
Tue, 29 Jul 2014 08:00:00 EDT
A voice activated cockpit management system for flight procedures and control of aircraft systems and flight management systems of single and multi-engine aircraft, including a mean to recognize and communicate commands, and to deploy procedures utilizing a NEXT-GEN voice recognition system. A specific feature of this mode of initiation is the automatic communication of flight procedures (normal or emergency procedures) and control procedures for aircraft systems and flight management systems by the utterance of specific key words resulting in the automatic transference of these command words to executable procedure audio-files. The said cockpit management system for providing automated voice activated procedures enables cockpit specific audio procedures through wireless Bluetooth connection or wired communication, and generally includes A Voice Recognition and Audio-Display System, A Mini-PC, A Listening Device, A Microphone, A Power-Source, A Pairing System, Bluetooth Software and Hardware, A Method for Voice Recognition and Audio-Display of Procedures on to pilot.
Method and apparatus for smart voice recognition
Tue, 29 Jul 2014 08:00:00 EDT
A display device with a voice recognition capability may be used to allow a user to speak voice commands for controlling certain features of the display device. As a means for increasing operational efficiency, the display device may utilize a plurality of voice recognition units where each voice recognition unit may be assigned a specific task.
Method for processing the output of a speech recognizer
Tue, 29 Jul 2014 08:00:00 EDT
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is sufficiently complete for reliable processing, then processing the command; if the speech input ambiguously corresponds to a single command or is not sufficiently complete for reliable processing, then prompting a user for further speech input to reduce ambiguity or increase completeness, in dependence on a relationship of previously received speech input and at least one command grammar of the plurality of predetermined command grammars, reparsing the further speech input in conjunction with previously parsed speech input, and iterating as necessary. The system also monitors abort, fail or cancel conditions in the speech input.
Method and apparatus for smart voice recognition
Tue, 29 Jul 2014 08:00:00 EDT
A display device with a voice recognition capability may be used to allow a user to speak voice commands for controlling certain features of the display device. As a means for increasing operational efficiency, the display device may utilize a plurality of voice recognition units where each voice recognition unit may be assigned a specific task.
System and method for auditory captchas
Tue, 29 Jul 2014 08:00:00 EDT
Disclosed herein are systems, methods, and computer readable-media for performing an audible human verification. The method includes determining that a human verification is needed, presenting an audible challenge to a user which exploits a known issue with automatic speech recognition processes, receiving a response to the audible challenge, and verifying that a human provided the response. The known issue with automatic speech recognition processes can be recognition of a non-word, in which case the user can be asked to spell the recognized non-word. The known issue with automatic speech recognition processes can be differentiation of simultaneous input for multiple audio streams. Multiple audio streams contained in the audible challenge can be provided monaurally. Verifying that a human provided the response can include confirming the contents of one of the multiple audio streams. Audible human verification can be performed in combination with visual human verification.
System and method for integrating gesture and sound for controlling device
Tue, 29 Jul 2014 08:00:00 EDT
Disclosed is a system for integrating gestures and sounds including: a gesture recognition unit that extracts gesture feature information corresponding to user commands from image information and acquires gesture recognition information from the gesture feature information; a background recognition unit acquiring background sound information using the predetermined background sound model from the sound information; a sound recognition unit that extracts the sound feature information corresponding to user commands from the sound information and extracts the sound feature information based on the background sound information and acquires the sound recognition information from the sound feature information; and an integration unit that generates integration information by integrating the gesture recognition information and the sound recognition information.
Systems and methods document narration
Tue, 29 Jul 2014 08:00:00 EDT
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. In some aspects, systems and methods described herein can include receiving a user-based selection of a first portion of words in a document where the document has a pre-associated first voice model and overwriting the association of the first voice model, by the one or more computers, with a second voice model for the first portion of words.
Method for segmenting utterances by using partner's response
Tue, 29 Jul 2014 08:00:00 EDT
An apparatus, method and program for dividing a conversational dialog into utterance. The apparatus includes: a computer processor; a word database for storing spellings and pronunciations of words; a grammar database for storing syntactic rules on words; a pause detecting section which detects a pause location in a channel making a main speech among conversational dialogs inputted in at least two channels; an acknowledgement detecting section which detects an acknowledgement location in a channel not making the main speech; a boundary-candidate extracting section which extracts boundary candidates in the main speech, by extracting pauses existing within a predetermined range before and after a base point that is the acknowledgement location; and a recognizing unit which outputs a word string of the main speech segmented by one of the extracted boundary candidates after dividing the segmented speech into optimal utterance in reference to the word database and grammar database.
Systems, methods, and media for determining fraud patterns and creating fraud behavioral models
Tue, 29 Jul 2014 08:00:00 EDT
Systems, methods, and media for analyzing fraud patterns and creating fraud behavioral models are provided herein. In some embodiments, methods for analyzing call data associated with fraudsters may include executing instructions stored in memory to compare the call data to a corpus of fraud data to determine one or more unique fraudsters associated with the call data, associate the call data with one or more unique fraudsters based upon the comparison, generate one or more voiceprints for each of the one or more identified unique fraudsters from the call data, and store the one or more voiceprints in a database.
Confidence measure generation for speech related searching
Tue, 29 Jul 2014 08:00:00 EDT
A method of generating a confidence measure generator is provided for use in a voice search system, the voice search system including voice search components comprising a speech recognition system, a dialog manager and a search system. The method includes selecting voice search features, from a plurality of the voice search components, to be considered by the confidence measure generator in generating a voice search confidence measure. The method includes training a model, using a computer processor, to generate the voice search confidence measure based on selected voice search features.
Image display device for identifying keywords from a voice of a viewer and displaying image and keyword
Tue, 29 Jul 2014 08:00:00 EDT
It is an object of the present invention to make an act of viewing an image interactive and further enriched. A microphone 18 inputs a voice signal of a voice uttered by a viewer who is viewing a display image displayed on a display portion 17, and causes the voice signal to be stored in a buffer 19. A voice recognition portion 20 identifies at least one word from the voice uttered by the viewer based on the voice signal, and acquires them as a keyword. A counter 21 calculates the number of incidences of the keyword. A display driver 16 causes information including a keyword having a number of incidences that exceeds a threshold value or information derived from the keyword to be displayed together with the display image displayed on the display portion 17.
Speech signal processing system, speech signal processing method and speech signal processing method program using noise environment and volume of an input speech signal at a time point
Tue, 29 Jul 2014 08:00:00 EDT
A speech signal processing system that includes a speech input unit for inputting a speech signal; input speech storage unit for storing an input speech signal that is the speech signal inputted through the speech input unit; characteristic estimation unit for referring to the input speech signal stored in the input speech storage unit, and estimating characteristics of an input speech indicated by the input speech signal, the characteristics including an environmental sound included in the input speech signal; reference speech output unit for causing a predetermined speech signal that becomes a reference speech, to output; and characteristic adding unit for adding the characteristics of the input speech estimated by the characteristic estimation unit, in a reference speech signal that is the speech signal caused to output by the reference speech output unit.
Method and apparatus for automatically determining speaker characteristics for speech-directed advertising or other enhancement of speech-controlled devices or services
Tue, 29 Jul 2014 08:00:00 EDT
In addition to conveying primary information, human speech also conveys information concerning the speaker's gender, age, socioeconomic status, accent, language spoken, emotional state, or other personal characteristics, which is referred to as secondary information. Disclosed herein are both the means of automatic discovery and use of such secondary information to direct other aspects of the behavior of a controlled system. One embodiment of the invention comprises an improved method to determine, with high reliability, the gender of an adult speaker. A further embodiment of the invention comprises the use of this information to display a gender-appropriate advertisement to the user of an information retrieval system that uses a cell phone as the input and output device. The invention is not limited to gender and such secondary information can include, for example, any of information concerning the speaker's age, socioeconomic status, accent, language spoken, emotional state, or other personal characteristics.
Time/frequency two dimension post-processing
Tue, 29 Jul 2014 08:00:00 EDT
In accordance with an embodiment, a time-frequency post-processing method of improving perceptual quality of a decoded audio signal, the method includes determining a time-frequency representation (such as filter bank analysis and synthesis) of an audio signal, estimating a time-frequency energy distribution of an audio signal from a time-frequency filter bank, computing a modification gain for each time-frequency representation point to have a modified time-frequency representation, and outputting audio signal from a modified time-frequency representation.
Method and device for decorrelation and upmixing of audio channels
Tue, 29 Jul 2014 08:00:00 EDT
A device (1) for converting a first number (M) of input audio channels into a second, larger number (N) of output audio channels comprises: decorrelation units (3) for decomposing the input audio channels into a set of decorrelated auxiliary channels, at least one upmix unit (4) for combining the decorrelated auxiliary channels into the output audio channels, and at least one pre-processing unit (2) for pre-processing the input audio channels and feeding the pre-processed input audio channels to the decorrelation units (3). The pre-processing unit (2) and the upmix unit (4) are preferably controlled by audio parameters.
Speech processing method and apparatus for deciding emphasized portions of speech, and program therefor
Tue, 29 Jul 2014 08:00:00 EDT
A scheme to judge emphasized speech portions, wherein the judgment is executed by a statistical processing in terms of a set of speech parameters including a fundamental frequency, power and a temporal variation of a dynamic measure and/or their derivatives. The emphasized speech portions are used for clues to summarize an audio content or a video content with a speech.
Apparatus and method for converting an audio signal into a parameterized representation using band pass filters, apparatus and method for modifying a parameterized representation using band pass filter, apparatus and method for synthesizing a parameterized of an audio signal using band pass filters
Tue, 29 Jul 2014 08:00:00 EDT
Apparatus for converting an audio signal into a parameterized representation, has a signal analyzer for analyzing a portion of the audio signal to obtain an analysis result; a band pass estimator for estimating information of a plurality of band pass filters based on the analysis result, wherein the information on the plurality of band pass filters has information on a filter shape for the portion of the audio signal, wherein the band width of a band pass filter is different over an audio spectrum and depends on the center frequency of the band pass filter; a modulation estimator for estimating an amplitude modulation or a frequency modulation or a phase modulation for each band of the plurality of band pass filters for the portion of the audio signal using the information on the plurality of band pass filters; and an output interface for transmitting, storing or modifying information on the amplitude modulation, information on the frequency modulation or phase modulation or the information on the plurality of band pass filters for the portion of the audio signal.
Corrective feedback loop for automated speech recognition
Tue, 29 Jul 2014 08:00:00 EDT
Audio data that includes speech may be transcribed using a language model. The transcription may be provided to a user. The user may provide feedback on the transcription, and the language model may be updated based at least in part on the feedback. The feedback may include, for example, an affirmation of the transcription; a disapproval of the transcription; a correction to the transcription; a selection of an alternate transcription result; or any other kind of response.
Information processing apparatus, natural language analysis method, program and recording medium
Tue, 29 Jul 2014 08:00:00 EDT
An apparatus and method for calculating a score of matching a sentence with a query pattern having a dependency structure. The apparatus includes: an input unit acquiring an analysis target sentence, a query pattern and an index value indexing how a linguistic unit in the sentence tends to modify another; and a score calculation unit calculating a matching score indexing the degree of matching of the sentence with the query pattern. The matching score is represented by a function having an index value with which a dependency relation included in the query pattern is associated. The score is calculated by attempting association between a substructure of the query pattern and a range in the sentence and by performing recursive calculation in the substructure and the range while storing partial calculation result of the function in a memory area for reuse.
Behavior-driven multilingual stemming
Tue, 29 Jul 2014 08:00:00 EDT
User behavior data can be used with language-specific rule sets to generate stemming databases useful for such tasks as indexing and search query processing. The terms contained in user queries, as well as user behavior with respect to those queries or results returned for those queries, can be analyzed to determine a relative measure (e.g., relative frequency) of various forms of those terms. When generating a stemming database, language-specific rule sets can be used to determine appropriate stemming rules, and where more than one potential rule is identified the user behavior data can be used to select what is likely the appropriate rule, at least for the respective environment. Whitelists or other such components can be used to handle specific or irregular forms that do not follow the general rules or otherwise are exceptions that might not otherwise be processed correctly.
System and method for generating manually designed and automatically optimized spoken dialog systems
Tue, 29 Jul 2014 08:00:00 EDT
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for generating a natural language spoken dialog system. The method includes nominating a set of allowed dialog actions and a set of contextual features at each turn in a dialog, and selecting an optimal action from the set of nominated allowed dialog actions using a machine learning algorithm. The method includes generating a response based on the selected optimal action at each turn in the dialog. The set of manually nominated allowed dialog actions can incorporate a set of business rules. Prompt wordings in the generated natural language spoken dialog system can be tailored to a current context while following the set of business rules. A compression label can represent at least one of the manually nominated allowed dialog actions.
Adaptive multimodal communication assist system
Tue, 29 Jul 2014 08:00:00 EDT
A computer implemented method and system for assisting a user to learn and/or communicate in a visual communication language in one or more modes is provided. The multimodal communication assist application, provided on a user's computing device, determines the user's characteristic information based on one or more selected multimodal communication mappers. The multimodal communication assist application determines a delay factor based on the characteristic information. The multimodal communication assist application captures a modal input in one of the modes from the user via an interactive interface based on the delay factor and the characteristic information. The multimodal communication assist application processes and transforms the captured modal input in one of the modes into a modal output in another one or more of the modes and renders the modal output to the user via the interactive interface. The multimodal communication assist application generates learning components and testing components for the user.
Voice based addressing of messages
Tue, 29 Jul 2014 08:00:00 EDT
A system for transmitting messages from a caller location to a receiver location using a plurality of computers each coupled to another through a network such as the Internet. The system also has a plurality of access devices, which are coupled to the network through a telecommunication line. These access devices include computers, workstations, and the like. Each access device includes a voice conversion board for converting a voice message from a telephone device into digital data for transmission through the network.
Audio book editing method and apparatus providing the integration of images into the text
Tue, 29 Jul 2014 08:00:00 EDT
A method and apparatus for recording and editing audio books which indexes the text and audio recording to one another. This allows for ease in locating a portion of the audio recording corresponding to a portion of text. The audio portion can then be edited. Images can be integrated into the text. These images can be included in the completed file for multimedia presentation. The indexing can also be used for ease in placing various takes in the correct sequential order in the master recording. The original audio recording is maintained unchanged while, changes from editing are contained in a separate file. The method and apparatus can also be used to generate compressed audio and text files for ease of forwarding via e-mail.
Accurate fast forward rate when performing trick play with variable distance between frames
Tue, 29 Jul 2014 08:00:00 EDT
The present invention is directed to system(s), method(s), and apparatus for accurate fast forward rate when performing trick play with variable distance between frames. In one embodiment, there is presented a circuit for providing a fast forward video sequence. The circuit comprises a system time clock for providing a time reference, said time reference incremented at a predetermined fast forward rate; a comparator for comparing the time reference with timing information associated with a picture; and a controller for determining whether to display the picture based at least in part on the comparison between the timing information and the time reference.
Audio equipment and a signal processing method thereof
Tue, 29 Jul 2014 08:00:00 EDT
The present invention relates to an audio equipment (1) and a signal processing method thereof, wherein the mismatches that can occur at the packet boundaries in an audio signal, which is processed in packets by utilizing the missing fundamental phenomenon and which is formed again by the concatenating of these packets, are eliminated.

Language Selection

linkedin        twitter

Company Search


Advanced Search   


Services


logo inttranet


logo inttrastats


logo Inttranews


logo Inttrasearch


logo linguists of the year