^^

'contrib/sound' section


ALSA software loaders for specific hardware

A collection of software loaders for specific hardware: . cspctl - Sound Blaster 16 ASP/CSP control program hdsploader - firmware loader for the RME Hammerfall DSP cards mixartloader - firmware loader for Digigram's miXart board sound drivers pcxhrloader - firmware loader for Digigram pcxhr compatible soundcards sscape_ctl - SoundScape control utility and firmware loader usx2yloader - firmware loader for Tascam USX2Y USB soundcards vxloader - firmware loader for Digigram VX soundcards

/contrib/cicero/cicero

French and English Text-To-Speech for MBROLA

This Text-To-Speech (TTS) engine speaks French; a preliminary English support is also offered. The engine uses context-sensitive rules to produce phonemes from the text. It relies on MBROLA to generate actual audio output from the phonemes. The TTS engine is implemented using the Python programming language. . The upstream authors have come up with this TTS to try and meet their own needs as blind users. It's designed to be plugged as output to some screen-review software, firstly with BRLTTY. They favor speed and intelligibility over perfect pronunciation. Cicero is aimed to have a quick response time, the ability to quickly shut-up and skip to another utterance, intelligibility where it counts (not perfect pronunciation), the ability to track speech progression, relative simplicity (hackability) and relative small code size.

/contrib/fdkaac/fdkaac

command line encoder frontend for libfdk-aac

fdkaac is a command-line AAC encoder based on libfdk-aac. . It reads linear PCM audio in either WAV, raw PCM, or CAF format and encodes it into M4A / AAC files.


minimal British English male speaker for festival

This is a very small diphone database, which gives poor quality sound output but runs very fast. . The voice contained in this database is a British English male speaker.


mbrola-us2 voice support for festival

This package lets festival make use of the non-free mbrola-us2 sample-based voice instead of its own synthesis voices.


mbrola-en1 voice support for festival

This package lets festival make use of the non-free mbrola-en1 sample-based voice instead of its own synthesis voices.


mbrola-us1 voice support for festival

This package lets festival make use of the non-free mbrola-us1 sample-based voice instead of its own synthesis voices.


mbrola-us3 voice support for festival

This package lets festival make use of the non-free mbrola-us3 sample-based voice instead of its own synthesis voices.


British English male speaker for festival, 16khz sample rate

This is a diphone database for festival that uses 16k samples for high quality sound output. This is a large database (9 MB), and festival will convert text to speech more slowly using it, so you might want to install festvox-rablpc8k instead if you have low disk space or a slow computer. . The voice contained in this database is a British English male speaker.


British English male speaker for festival, 8khz sample rate

This is a diphone database for festival that uses 8k samples so it takes up a minimum of disk space. Using this diphone database, festival also converts text to speech faster than with the 16k version. However, the sound quality is not as good. . The voice contained in this database is a British English male speaker.


Adobe Flash Player platform support library for Esound and OSS

This is an open Source extension library for the Adobe Flash Player that enables support for otherwise unsupported sound systems. It provides the libflashsupport.so plugin. The sound system to use is automatically detected: . * It first tries to detect Esound, * Next, it checks for OSS. . If all of the above failed, it falls back to the ALSA driver that's built directly into FlashPlayer 9. . For PulseAudio support, look elsewhere.


Japanese male voice data for Open JTalk

This is a Japanese male voice data for Open JTalk, a Hidden Markov Model based speech synthesis system (a.k.a. HTS) for Japanese. . This voice data was trained by using the HTS voice data "NIT ATR503 M001" and released by HTS Working Group as a part of Open JTalk.


Microsoft Kinect sensor audio setup helpers

When the Kinect sensor is first plugged in, its audio input device shows up as a generic USB device; after the appropriate firmware is loaded, the device is reset and it becomes available as a USB Audio Class (UAC) device. . Since the firmware is not redistributable, kinect-audio-setup provides tools to download it from the Internet at installation time, and sets up udev rules to call the firmware loader when the device is plugged in to finally get the UAC device. . The UAC firmware is downloaded from the Microsoft Kinect Software Development Kit at http://www.microsoft.com/en-us/kinectforwindows/, the license of which can be found at http://research.microsoft.com/en-us/um/legal/kinectsdk-tou_noncommercial.htm

/contrib/mbrola/mbrola

Multilingual software speech synthesizer

Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The various diphone databases are distributed on separate packages, but they must be used with and only with Mbrola because of license matters. Read the copyright for details. . Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as input, together with prosodic information (duration of phonemes and a piecewise linear description of pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone database. . Use Mbrola along with Freephone, cicero or espeak to have a complete text-to-speech in English.


Speech Dispatcher: Voxin output module

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications. . Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel. . This package contains the output module for the Voxin speech synthesizer, which supports IBM TTS (aka Eloquence) and Vocalizer Embedded, which needs to be installed separately.


Speech Dispatcher: IBM TTS output module

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications. . Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel. . This package contains the output module for the IBM TTS (aka Eloquence), which needs to be installed separately.


Speech Dispatcher: Pico output module

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications. . Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel. . This package contains the output module for the pico speech synthesizer.


Speech Dispatcher: Kali output module

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications. . Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel. . This package contains the output module for the Kali speech synthesizer, which needs to be installed separately.


Speech Dispatcher: Baratinoo (VoxyGen) output module

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications. . Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel. . This package contains the output module for the Baratinoo speech synthesizer, also called VoxyGen, which needs to be installed separately.