...A Planet of Partners™

  • Increase font size
  • Default font size
  • Decrease font size

Patents

 
Systems and methods for rendering text onto moving image content
Tue, 14 Oct 2014 08:00:00 EDT
A method for rendering text onto moving image content. The method comprises receiving a request to translate dialog associated with moving image content, transmitting an interface, transmitting a time-stamped transcription, and receiving a translation of the dialog.
Presentation of an adaptive avatar
Tue, 14 Oct 2014 08:00:00 EDT
A system that incorporates teachings of the present disclosure may include, for example, an avatar engine having a controller to retrieve a user profile of a user, present the user an avatar having characteristics that correlate to the user profile, detect one or more responses of the user during a communication exchange between the avatar and the user, establish a communication session with a language translation system responsive to identifying from the one or more responses a need to engage in language translations, transmit to the language translation system content in a language format other than a language understood by the user, receive from the language translation system a translation of the content in the language understood by the user, and present the user an adaptation of the avatar that presents the translated content in the user's language. Other embodiments are disclosed.
Extensible input method editor dictionary
Tue, 14 Oct 2014 08:00:00 EDT
An extensible reading system is described that provides a method of extending the readings supported by an IME application without updating the entire application. The extensible reading system separates the IME reading dictionary from the IME application, so that the user can update or supplement the dictionary with new readings without modifying the IME application. The extensible reading system receives custom readings from a user that include a sequence of keyboard characters and a selection of a language character that is to be inserted into a document when a user inputs the sequence of keyboard characters. Thus, the extensible reading system allows the user to update the readings for mapping keyboard characters to language characters much more frequently.
Systems and methods for providing information discovery and retrieval
Tue, 14 Oct 2014 08:00:00 EDT
This invention relates generally to software and computers, and more specifically, to systems and methods for providing information discovery and retrieval. In one embodiment, the invention includes a system for providing information discovery and retrieval, the system including a processor module, the processor module configurable to performing the steps of receiving an information request from a consumer device over a communications network; decoding the information request; discovering information using the decoded information request; preparing instructions for accessing the information; and communicating the prepared instructions to the consumer device, wherein the consumer device is configurable to retrieving the information for presentation using the prepared instructions.
System and method for creating, managing, and publishing audio microposts
Tue, 14 Oct 2014 08:00:00 EDT
A system and method for creating, managing, and publishing audio microposts is provided. An audio micropost comprises a short audio segment recorded and/or captured based on voice, speech, and/or other sound, which may be shared with and/or published to subscribers and/or other users. The system may enable creating a discussion and playlist based on the audio microposts. The discussion may be generated by identifying and/or selecting an audio micropost that may pose a question and/or topic for a discussion and/or debate. The system may further enable granting the ability to participate in the discussion to a selected group of participants. The playlist of audio microposts may be created by adding individual posts into the playlist and/or by using hashtags and/or keywords to search for audio microposts of interest.
Relational learning for system imitation
Tue, 14 Oct 2014 08:00:00 EDT
Technologies pertaining to learning a computer-executable imitation system that imitates behavior of an existing computer-executable system are described herein. Behavior of an existing computer-executable system can be monitored through monitoring data input to the existing computer-executable system and data output by the existing computer-executable system responsive to receipt of the input data. An imitation system that imitates the behavior of the existing system can be learned, wherein the imitation system comprises a relational model.
Audio encoding/decoding with aliasing switch for domain transforming of adjacent sub-blocks before and subsequent to windowing
Tue, 14 Oct 2014 08:00:00 EDT
An apparatus for encoding an audio signal includes the windower for windowing a first block of the audio signal using an analysis window having an aliasing portion and a further portion. The apparatus furthermore includes a processor for processing the first sub-block of the audio signal associated with the aliasing portion by transforming the sub-block from a domain into a different domain subsequent to windowing the first sub-block to obtain the processed first sub-block, and for processing a second sub-block of the audio signal associated with the further portion by transforming the second sub-block from the domain into the different domain before windowing the second sub-block to obtain a processed second sub-block. Thus, a critically sampled switch between two coding modes can be obtained.
Encoder, encoding system, and encoding method
Tue, 14 Oct 2014 08:00:00 EDT
An encoding device includes, an estimation unit to estimate a decoded signal of a plurality of channels based on a down-mix signal obtained by down-mixing an input signal of the plurality of channels, similarity between the channels of the input signal, and an intensity difference between the channels of the input signal; an analysis unit to analyze a phase of the input signal and a phase of the decoded signal; a calculation unit to calculate phase information based on the phase of the input signal and the phase of the decoded signal; and a coding unit to encode the similarity between the channels of the input signal, the intensity difference between the channels of the input signal, and the phase information.
Speech translation system, first terminal apparatus, speech recognition server, translation server, and speech synthesis server
Tue, 14 Oct 2014 08:00:00 EDT
In conventional network-type speech translation systems, devices or models for recognizing or synthesizing speech cannot be changed in accordance with speakers' attributes, and therefore, accuracy is reduced or inappropriate output occurs in each process of speech recognition, translation, and speech synthesis. Accuracy of each processing of speech translation, translation, or speech synthesis is improved and appropriate output is performed in a network-type speech translation system by, based on speaker attributes, appropriately changing the server to perform speech recognition or the speech recognition model, appropriately changing the translation server to perform translation or the translation model, or appropriately changing the speech synthesis server or speech synthesis model.
Menu hierarchy skipping dialog for directed dialog speech recognition
Tue, 14 Oct 2014 08:00:00 EDT
A method and a processing device for managing an interactive speech recognition system is provided. Whether a voice input relates to expected input, at least partially, of any one of a group of menus different from a current menu is determined. If the voice input relates to the expected input, at least partially, of any one of the group of menus different from the current menu, skipping to the one of the group of menus is performed. The group of menus is different from the current menu include menus at multiple hierarchical levels.
Voice-activated signal generator
Tue, 14 Oct 2014 08:00:00 EDT
A voice-activated signal generator is a device to produce output signals responsive to spoken commands. The device accepts only predetermined commands and responsively generates specific output signals such as a pulse, a series of pulses, a voltage level, or a periodic waveform. The device is suitable for triggering an oscilloscope, or controlling a circuit under test, or activating another instrument. The invention also enables safely controlling a hazardous system such as a high voltage system, hands-free and with precise timing determined by the user. Also disclosed are fast, compact, robust algorithms for analyzing spoken commands, and particularly for detecting voiced and unvoiced sound, and for identifying commands by comparing the order of sound intervals in the spoken command to templates that represent the predetermined commands. The device may have one output or multiple outputs in parallel, all controlled by voice commands with precision output timing.
Speech-enabled content navigation and control of a distributed multimodal browser
Tue, 14 Oct 2014 08:00:00 EDT
Speech-enabled content navigation and control of a distributed multimodal browser is disclosed, the browser providing an execution environment for a multimodal application, the browser including a graphical user agent (‘GUA’) and a voice user agent (‘VUA’), the GUA operating on a multimodal device, the VUA operating on a voice server, that includes: transmitting, by the GUA, a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command; receiving, by the GUA, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the GUA, the voice utterance to the VUA for speech recognition by the VUA; receiving, by the GUA, an event message from the VUA, the event message specifying a particular event corresponding to the particular voice command; and controlling, by the GUA, the browser in dependence upon the particular event.
Multisensory speech detection
Tue, 14 Oct 2014 08:00:00 EDT
A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
Comment recording apparatus, method, program, and storage medium that conduct a voice recognition process on voice data
Tue, 14 Oct 2014 08:00:00 EDT
A comment recording apparatus, including a voice input device and a voice output device for recording and playing back comment voice, includes a voice obtaining unit, a voice recognition unit, a morphological analysis unit, and a display generation unit. The voice obtaining unit obtains comment voice as voice data, and registers the obtained voice data to a voice database for each topic specified by a topic specification device and each comment-delivered participant identified from the voice data. The voice recognition unit conducts a voice recognition process on the voice data to obtain text information. The morphological analysis unit conducts a morphological analysis on the text information, and registers a keyword extracted from words obtained by the morphological analysis unit to a keyword database with topic and comment-delivered participant along with voice. The display generation unit displays the keyword in a matrix while relating the keyword to a topic and a comment-delivering participant.
Speech synthesis and coding methods
Tue, 14 Oct 2014 08:00:00 EDT
The present invention is related to a method for coding excitation signal of a target speech comprising the steps of: extracting from a set of training normalized residual frames, a set of relevant normalized residual frames, said training residual frames being extracted from a training speech, synchronized on Glottal Closure Instant(GCI), pitch and energy normalized; determining the target excitation signal of the target speech; dividing said target excitation signal into GCI synchronized target frames; determining the local pitch and energy of the GCI synchronized target frames; normalizing the GCI synchronized target frames in both energy and pitch, to obtain target normalized residual frames; determining coefficients of linear combination of said extracted set of relevant normalized residual frames to build synthetic normalized residual frames close to each target normalized residual frames; wherein the coding parameters for each target residual frames comprise the determined coefficients.
Establishing a multimodal advertising personality for a sponsor of a multimodal application
Tue, 14 Oct 2014 08:00:00 EDT
Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.
System and method for pronunciation modeling
Tue, 14 Oct 2014 08:00:00 EDT
Systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to the same phoneme, and generating a pronunciation model which substitutes each family for each respective phoneme. In one aspect, the generic model of speech is a vocal tract length normalized acoustic model. Interchangeable phonemic alternatives can represent a same phoneme for different dialectal classes. An interchangeable phonemic alternative can include a string of phonemes.
Leveraging back-off grammars for authoring context-free grammars
Tue, 14 Oct 2014 08:00:00 EDT
A system and method of refining context-free grammars (CFGs). The method includes deriving back-off grammar (BOG) rules from an initially developed CFG and utilizing the initial CFG and the derived BOG rules to recognize user utterances. Based on a response of the initial CFG and the derived BOG rules to the user utterances, at least a portion of the derived BOG rules are utilized to modify the initial CFG and thereby produce a refined CFG. The above method can carried out iterativey, with each new iteration utilizing a refined CFG from preceding iterations.
Contextual speech recognition
Tue, 14 Oct 2014 08:00:00 EDT
A computer-implemented method can include receiving, by a computer system, a request to transcribe spoken input from a user of a computing device, the request including information that (i) characterizes a spoken input, and (ii) context information associated with the user or the computing device. The method can determine, based on the information that characterizes the spoken input, multiple hypotheses that each represent a possible textual transcription of the spoken input. The method can select, based on the context information, one or more of the multiple hypotheses for the spoken input as one or more likely intended hypotheses for the spoken input, and can send the one or more likely intended hypotheses for the spoken input to the computing device. In conjunction with sending the one or more likely intended hypotheses for the spoken input to the computing device, the method can delete the context information.
Speech input device, speech recognition system and speech recognition method
Tue, 14 Oct 2014 08:00:00 EDT
A device for speech input includes a speech input unit configured to convert a speech of a user to a speech signal; an angle detection unit configured to detect an angle of the speech input unit; a distance detection unit configured to detect a distance between the speech input unit and the user; and an input switch unit configured to control on and off of the speech input unit based on the angle and the distance.
Determining pitch cycle energy and scaling an excitation signal
Tue, 14 Oct 2014 08:00:00 EDT
An electronic device for determining a set of pitch cycle energy parameters is described. The electronic device includes a processor and executable instructions stored in memory. The electronic device obtains a frame, a set of filter coefficients and a residual signal based on the frame and the set of filter coefficients. The electronic device determines a set of peak locations based on the residual signal and segments the residual signal such that each segment includes one peak. The electronic device determines a first set of pitch cycle energy parameters based on a frame region between two consecutive peak locations and maps regions between peaks in the residual signal to regions between peaks in a synthesized excitation signal to produce a mapping. The electronic device determines a second set of pitch cycle energy parameters based on the first set of pitch cycle energy parameters and the mapping.
Adaptive time/frequency-based audio encoding and decoding apparatuses and methods
Tue, 14 Oct 2014 08:00:00 EDT
Adaptive time/frequency-based audio encoding and decoding apparatuses and methods. The encoding apparatus includes a transformation & mode determination unit to divide an input audio signal into a plurality of frequency-domain signals and to select a time-based encoding mode or a frequency-based encoding mode for each respective frequency-domain signal, an encoding unit to encode each frequency-domain signal in the respective encoding mode, and a bitstream output unit to output encoded data, division information, and encoding mode information for each respective frequency-domain signal. In the apparatuses and methods, acoustic characteristics and a voicing model are simultaneously applied to a frame, which is an audio compression processing unit. As a result, a compression method effective for both music and voice can be produced, and the compression method can be used for mobile terminals that require audio compression at a low bit rate.
Dynamic method for emoticon translation
Tue, 14 Oct 2014 08:00:00 EDT
A vehicle communication system is provided and may include at least one communication device that audibly communicates information within the vehicle. A controller may receive a character string from an external device and may determine if the character string represents an emoticon. The controller may translate the character string into a face description if the character string represents an emoticon and may audibly communicate the face description via the at least one communication device.
Fraud detection using text analysis
Tue, 14 Oct 2014 08:00:00 EDT
In one embodiment, a method executed by at least one processor includes receiving text from submitted by a user. The method also includes determining a text score for the received text by comparing a first set of phrases included in the received text to a second set of phrases. The second set of phrases includes phrases from stored text. The stored text includes stored text known to be genuine and stored text known to be fraudulent. The method also includes determining that the received text is fraudulent based on the text score.
System, method, and program for processing text using object coreference technology
Tue, 14 Oct 2014 08:00:00 EDT
System, method and program product for text processing using object coreference technology. In particular, the invention provides a text processing method which includes, acquiring text to be processed; extracting subject words and entity words corresponding to the subject words from the text; grouping the subject words; determining entity words that reference a same concerned object according to the grouped subject words; and generating processing policy for entity words that reference a same concerned object. The invention also includes a system with means for carrying out the method. The invention generally realizes automatic, more comprehensive, accurate, efficient analysis and processing on text data. The invention can be used to dig a large amount of comment data about some entity, and the invention can also be used to suggest insertion place in an article where embedded advertisement is inserted.
Generating Chinese language banners
Tue, 14 Oct 2014 08:00:00 EDT
Embodiments are disclosed for automatically generating a banner given a first scroll sentence and a second scroll sentence of a Chinese couplet. The first and/or second scroll sentence can be generated by an automatic computer system or by a human (e.g., manually generated and then provided as input to an automated banner generation system) or obtained from any source (e.g., a book) and provided as input. In one embodiment, an information retrieval process is utilized to identify banner candidates that best match the first and second scroll sentences. In one embodiment, candidate banners are automatically generated. In one embodiment, a ranking model is applied in order to rank banner candidates derived from the banner search and generation processes. One or more banners are then selected from the ranked banner candidates.
Natural language interface
Tue, 14 Oct 2014 08:00:00 EDT
The present disclosure involves systems, software, and computer implemented methods for providing a natural language interface for searching a database. One process includes operations for receiving a natural language query. One or more tokens contained in the natural language query are identified. A set of sentences is generated based on the identified tokens, each sentence representing a possible logical interpretation of the natural language query and including a combination of at least one of the identified tokens. At least one sentence in the set of sentences is selected for searching a database based on the identified tokens.
Method and system for smart mark-up of natural language business rules
Tue, 14 Oct 2014 08:00:00 EDT
Smart Mark-up or highlighting delimits a rule using ontology technology to identify words and fields as objects and/or possible values in the rule. These technologies support the user in formalizing parts of the rules in a manner consistent with the system's data.
System and method for automatic language translation for applications
Tue, 14 Oct 2014 08:00:00 EDT
System and method to translate displayed text of a computer application, the method including: intercepting a command to display text in a first language, the command comprising the text to display in the first language; extracting text to translate from the command; querying a translation mechanism by use of the extracted text; receiving translated text in a second language from the translation mechanism; and displaying the translated text in the second language.
Creating and implementing language-dependent string pluralizations
Tue, 14 Oct 2014 08:00:00 EDT
Embodiments are directed to applying appropriate pluralization rules to text strings and to generating pluralization rules for multiple different languages. In an embodiment, a computer system identifies a user interface (UI) text string that includes a numerical amount for which an appropriate pluralization form is to be determined. The string is represented by a resource identifier (ID). The computer system receives an indication indicating which language the text string is to be displayed in and determines an appropriate resource ID from a set of pre-generated resource IDs based on the numerical amount and the determined language. The pre-generated resource IDs include various language-specific pluralization forms for localization of the text string. The computer system also returns the localized text string at the determined appropriate resource ID to the UI for display. In this manner, the localized text string is presented with the numerical amount and proper pluralization in the indicated language.

Language Selection

linkedin        twitter

Company Search


Advanced Search   


Services


logo inttranet


logo inttrastats


logo Inttranews


logo Inttrasearch


logo linguists of the year