Technology Catalog

Download the Technology Catalog in pdf here.

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
    visuel KIT_speech to text
    Transcription of human speech into written word sequences read more
     

    Target users and customers

    Companies who want to integrate the transcription of human speech into their products.

    Application sectors

    Speech-to-Text technology is key to indexing multimedia content as it is found in multimedia databases or in video and audio collections on the World Wide Web, and to make it searchable by human queries. In addition, it offers a natural interface for submitting and executing queries.

    This technology is further part of speech-translation services. In combination with machine translation technology, it is possible to design machines that take human speech as input and translate it into a new language. This can be used to enable human-to-human combination across the language barrier or to access languages in a cross-lingual way.

     
    Q-Tech-INRA-TYDI-visuel
    A platform for the validation, structuration and export of termino-ontologies read more
     

    Target users and customers

    The primary use of TyDI is the design of termino-ontologies for the indexation of textual documents. It can therefore be of great help for most projects involved in natural language processing.

    Application sectors

    • Terminology structuring
    • Textual document indexing
    • Natural language processing
     
    Q-Tech-SYNAPSE-QA-Qristal1-newvisuel
    A Question Answering system allows the user to ask questions in natural language and to obtain one or several answers. For boolean and generic questions, our system is able to generate potential questions and to return the corresponding answers. Read more
     

    Target users and customers

    End-user application, Question-Answering is the easiest way to find information for everybody: ask the question as you want and obtain answers, not snippets or pages.

    Application sectors

    Search and find precise answers in any collection of texts, from the Web or any other source (voice recognition, optical character recognition, etc.), with eventual correction of the source text, ability to generate questions from generic requests, eventually a single word, ability to find similar questions and their answers, etc.

    Monolingual and multilingual Question-Answering system. Languages: English, French (+ Spanish, Portuguese, Polish, with partners using the same API).

     
    Q-Tech-Inria-SlopPy-visuel
    Slope One with Privacy read more
     

    Target users and customers

    The targeted users and customers are all the Internet actors providing personalized services to their users, interested by integrating recommender systems that are more respectful of their privacy.

    Application sectors

    • Personalization
    • Recommender systems
     
    ircammusicgenre
    Ircammusicgenre and Ircammusicmood softwares estimate automatically the belonging of a music track to a set of music genre (electronica, jazz, pop/rock…) and music mood classes (positive, sad, powerful, calming…) Read more
     

    Target users and customers

    Classification of music items are generally primarily based on their belonging to a music genre: electronica, jazz, pop/ rock… However, the editorial meta-data related to the genre are generally only accessible at the artist level (the whole set of music tracks produced by one artist will belong to the same music genre whatever the tracks content). Ircammusicgenre is a software which allows the automatic estimation of the belonging of a music track to music genres. The list of music genres considered by the software can be pre-determined by Ircam (electronica, jazz, pop/rock…) or can be adapted to categories relevant to the partner, provided a sufficient number of sound examples per category.

    Ircammusicgenre also allows to perform multi-labeling of a music track, i.e. assigning a set of genre labels instead of a single genre. In this case, a weighting is assigned to each estimated label.

    Ircammusicmood a software which allows the automatic estimate of the music mood of has music track to music mood. Music mood report to the “mood” that a track suggests: positive, sad, powerful, calming…

    As for the music genre, the list can be predetermined by Ircam or discussed with the partner. Multi-labels can also be applied to the music mood classification.

    Application sectors

    • Online music providers
    • Online music portals
     
    Q-Tech-Vocapia-automatic speechtranscription-visuel
    Vocapia Research develops core multilingual large vocabulary speech recognition technologies* for voice interfaces and automatic audio indexing applications. This speech-to-text technology is available for multiple languages. (* Under license from LIMSI-CNRS) read more
     

    Target users and customers

    The targeted users and customers of speech-to-text transcription technologies are actors in the multimedia and call center sector, including academic and industrial organizations interested in the automatic mining processing of audio or audiovisual documents.

    Application sectors

    This core technology can serve as the basis for a variety of applications: multilingual audio indexing, teleconference transcription, telephone speech analytics, transcription of speeches, subtitling…

    Large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and audiovisual documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.

    Via speech recognition, spoken document retrieval can support random access using specific criteria to relevant portions of audio documents, reducing the time needed to identify recordings in large multimedia
    databases. Some applications are data-mining, news-on-demand, and
    media monitoring.

     
    ircamaudiosim-1
    Ircamaudiosim estimates the acoustical similarity between two music tracks. It can be used to perform music recommendation based on music content similarity. Read more
     

    Target users and customers

    Ircamaudiosim allows the development of music recommendation based on music content similarity. It can therefore be used for any system (online or offline) requiring music recommendation, such as for the development of a recommendation engine for online music service or offline music collection browsing.

    Application sectors

    • Online music providers
    • Online music portals
    • Music players developers
    • Music software developers
     
    Q-Tech_A2iA_visuel_Document_Classification_HD
    Classification of all types of paper documents, Data Extraction and Mail Processing and Workflow Automation Read more
     

    Target users and customers

    • Independent Software Vendors
    • Business Process Outsourcers

    Application sectors

    Bank, Insurance, Administration, Telecom and Utility Companies, Historical Document Conversion

     
    Q-Tech-Technicolor-Audience-visuel
    Automatically characterize in-home audience and level of attention read more
     

    Target users and customers

    All content providers may be interested in the automatic characterization of the in-home audience. When personalization of video – either Video on Demand (VoD) or broadcast – or ads is targeted, these same providers will see an interest in having this module to help the automatic personalization of provided content.

    The audience characterization module may also be used by end users to manage their own content at home. Furthermore, content providers and advertisers will be interested in the ‘level of attention’ information provided by the module.

    Application sectors

    Provided content personalization
    Knowing what the audience is:

    • VoD portals may be personalized and proper home pages may be displayed;
    • Ads may be personalized;
    • According videos and broadcast programs may be proposed.
     
    PastedGraphic-2
    Recognition of complex events, like "wedding proposal" or "giving directions to a location". Read more
     

    Target users and customers

    Personal/professional managers of video collections

    Application sectors

    Video archives
    Multi-media document processing
     
 
 
Application demonstrators