Balabolka :: Oprogramowanie narzędziowe do usług online

The command line application allows to use online text-to-speech services: text files or subtitles can be converted to audio files. The utility can be used for testing purposes: it will help you to choose a cloud computing service that satisfies your needs. The separate application for Yandex SpeechKit is available for downloading, because Yandex is the Russian IT company with close government ties.

Online services with speech technologies:

Google Cloud Text-to-Speech;
Amazon Polly;
Baidu Text-to-Speech;
CereVoice Cloud;
IBM Watson Text-to-Speech;
Microsoft Azure;
Naver Text-to-Speech;
Youdao Text-to-Speech;
Yandex SpeechKit.

Pobrać oprogramowanie narzędziowe BAL4WEB

Rozmiar pliku: MB

Wersja: Lista zmian

Licenzja: Freeware

System operacyjny:

Command Line Utility for Yandex SpeechKit Using: Pobrać ( MB)
The program converts text or subtitles to audio files by using of the Yandex service.
To perform operations via the Yandex API, it is necessary to authenticate using an API-key.

Wiersz poleceń

The utility handles various command line parameters to be able to read text aloud or save as an audio file. The command line options use the syntax "bal4web [options ...]", all parameters must be separated by a space. Options can appear in any order on the command line so long as they are paired with their related parameters. Use the "bal4web -?" command line to get help on the command line syntax and parameters.

-s nazwa_usługi: Sets the name of the online TTS service ("google" or "g", "amazon" or "a", "baidu" or "b", "cerevoice" or "c", "ibm" or "i", "microsoft" or "m", "naver" or "n", "youdao" or "y"). The default is "google".
-l nazwa_języka: Sets the language name for the online TTS service. The name is a combination of an ISO 639 two-letter lowercase culture code associated with a language and an ISO 3166 two-letter uppercase subculture code associated with a country or region.
For example: pl-PL, de-DE, fr-FR. The default is "en-US".
-g płeć: Sets the gender for the online TTS service (if supported). The available values: "female" or "f", "male" or "m". The default value is not defined. This parameter is supported by services: Amazon Polly, CereProc TTS, Google TTS, IBM Watson TTS, Microsoft Azure, Naver TTS.
-n nazwa_głosu: Sets the voice name for the online TTS service (if supported). The default value is not defined. This parameter is supported by services Amazon Polly, CereProc TTS, Google Cloud TTS, IBM Watson TTS, Microsoft Azure, Naver TTS.
-r szybkość_mowy: Sets the rate of the synthesized speech (if supported). The default is "1.0" (average human speech rate).
For Amazon Polly the rate values range from "0.2" to "2.0".
For CereProc TTS the rate values range from "0.3" to "4.0".
For Google TTS and Microsoft Azure the rate values range from "0.1" to "3.0".
For IBM Watson TTS the rate values range from "0.3" to "3.0".
For Naver TTS the rate values range from "0.5" to "1.5".
-p liczba: Sets the speaking pitch in a range of -20 to 20 (if supported). The default is 0.
This option is supported by Amazon Polly, CereProc TTS, Google Cloud TTS, IBM Watson, Microsoft Azure.
-v liczba: Ustawić głosność mowy w zakresie od 0 do 200 (wartość domyślna dorówna 100).
-m: Prints the list of supported languages (genders and voices' names, if available) for the online TTS service.
-f nazwa_pliku: Otworzyć plik tekstowy. Wiersz poleceń może zawierać kilka parametrów -f.
-fl nazwa_pliku: Sets the name of the text file with the list of input files (one file name per line). The command line may contain few options -fl.
-w nazwa_pliku: Zapisać plik dźwiękowy w formacie WAV.
-c: Użyć tekstu ze schowka.
-t tekst: Użyć tekstu z wiersza poleceń. Wiersz poleceń może zawierać kilka parametrów -t.
-i: Użyć tekstu z standarowego strumienia wejścia (STDIN).
-o: Zapisać dżwięk do standarowego strumienia wyjścia (STDOUT). Jeśli parametr jest ustawiony, parametr -w jest ignorowany.
--encoding kodowanie lub -enc kodowanie: Kodowanie tekstu ze standardowego strumiena wejściowego ("ansi", "utf8" lub "unicode"). Wartość domyślna dorówna "ansi".
--silence-begin liczba lub -sb liczba: Ustawić długość pauzy na początku pliku dźwiękowego (w milisekundach). Wartość domyślna dorówna 0.
--silence-end liczba lub -se liczba: Ustawić długość pauzy na końcu pliku dźwiękowego (w milisekundach). Wartość domyślna dorówna 0.
-ln liczba: Selects a line from the text file by using of a line number. The line numbering starts at "1". The interval of numbers can be used for selecting of more than one line (for example, "26-34"). The command line may contain few options -ln.
-e liczba: Sets the length of pauses between sentences (in milliseconds). The value should be set less than 5000. If the option is not specified, the service will use the default pauses between sentences. This parameter is supported by Microsoft Azure only.
-d nazwa_pliku: Applies a dictionary for pronunciation correction (*.BXD, *.DIC or *.REX). The command line may contain few options -d.
-lrc: Utworzyć plik LRC. Lyrics will be synchronized with the speech in the output audio file.
-srt: Utworzyć plik SRT. Subtitles will be synchronized with the speech in the output audio file.
-sub: Tekst wygląda jak napisy do plików i musi być przekształcony w plik dźwiękowy z wrachowaniem określonych odstępów czasu. The option may be useful, when the options -i or -c are specified.
-host host_name: Sets the hostname of the proxy server.
-port liczba: Sets the port number of the proxy server.
-fr liczba: Sets the output audio sampling frequency in kHz (8, 11, 16, 22, 24, 32, 44, 48). If the option is not specified, the default value for the selected serice will be used.
--ignore-square-brackets lub -isb: Ignore text in [square brackets].
--ignore-curly-brackets lub -icb: Ignore text in {curly brackets}.
--ignore-angle-brackets lub -iab: Ignore text in <angle brackets>.
--ignore-round-brackets lub -irb: Ignore text in (round brackets).
--ignore-url lub -iu: Ignore URLs.
--ignore-comments lub -ic: Ignore comments in text. Single-line comments start with // and continue until the end of the line. Multiline comments start with /* and end with */.
-dp: Display progress information in a console window.
-cfg nazwa_pliku: Sets the name of the configuration file with the command line options (a text file where each line contains one option). If the option is not specified, the file bal4web.cfg in the same folder as the utility will be used.
-h: Pokazać opis parametrów wiersza poleceń.
--lrc-length liczba: Ustawić maksymalną długość wierszy plika LRC (symboli).
--lrc-fname nazwa_pliku: Nazwa pliku LRC. Parametr może być przydatny gdy w wierszu poleceń ustawiony parametr -o.
--lrc-enc kodowanie: Kodowanie pliku LRC ("ansi", "utf8" lub "unicode"). Wartość domyślna dorówna "ansi".
--srt-length liczba: Ustawić maksymalną długość wierszy plika SRT (symboli).
--srt-fname nazwa_pliku: Nazwa pliku SRT. Parametr może być przydatny gdy w wierszu poleceń ustawiony parametr -o.
--srt-enc kodowanie: Kodowanie pliku SRT ("ansi", "utf8" lub "unicode"). Wartość domyślna dorówna "ansi".
--lrc-offset liczba: Ustawić przesunięcie czasu dla pliku LRC (milisekundy).
--lrc-artist tekst: Znacznik dla pliku LRC: wykonawca utworu.
--lrc-album tekst: Znacznik dla pliku LRC: album.
--lrc-title tekst: Znacznik dla pliku LRC: nazwa utworu.
--lrc-author tekst: Znacznik dla pliku LRC: autor.
--lrc-creator tekst: Znacznik dla pliku LRC: twórcą pliku.
--lrc-sent: Inserts blank lines after sentences when creating the LRC file.
--lrc-para: Inserts blank lines after paragraphs when creating the LRC file.
--srt-fname nazwa_pliku: Sets the name of the SRT file. The option may be useful, when the option -o is specified.
--srt-enc kodowanie: Sets the encoding for the SRT file ("ansi", "utf8" or "unicode"). The default is "ansi".
--raw: Zapisać dżwiek w formacie RAW PCM; dane nie zawierają tytułu formatu WAV. Parametr wykorzystuję się razem z parametrem -o.
--ignore-length lub -il: Nie zapisywać rozmiar audio do tytułu formatu WAV. Parametr wykorzystuję się razem z parametrem -o.
--wss: Use the WebSocket protocol for Microsoft Azure. It allows to improve sound quality of audio files (24 KHz instead of 16 KHz). The option is ignored if the subscription key for the Microsoft Azure Cognitive Services is defined. Use the option -m to check if a voice supports the WebSocket protocol or not.
--sub-format tekst: Format napisów filmowych ("srt", "lrc", "ssa", "ass", "smi" lub "vtt"). Jeśli parametr nie określony format zostanie wybrany przez rozszerzenie nazwy pliku napisów.
--sub-fit lub -sf: Automatically increases the speech rate to fit time intervals (when the program converts subtitles to an audio file). This parameter is supported by services Amazon Polly, CereProc TTS, Google TTS, Microsoft Azure, Naver TTS.

--aws-keyid tekst lub -ak tekst: Sets AWS access key ID for the Amazon Polly. It is recommended to apply such key if you have it.
--aws-secret tekst lub -as tekst: Sets AWS secret access key for the Amazon Polly.
--aws-region tekst lub -ar tekst: Sets AWS region for the Amazon Polly.
--crv-email tekst lub -ce tekst: Sets the email address used when registering on the CereProc website. This information is necessary for CereVoice Cloud API authorization. It is recommended to apply such email if you have it.
--crv-pwd tekst lub -cp tekst: Sets the password used when registering on the CereProc website. This information is necessary for CereVoice Cloud API authorization. It is recommended to apply such password if you have it.
--gc-apikey tekst lub -gk tekst: Sets API key ID for the Google Cloud. It is recommended to apply such key if you have it.
--ms-apikey tekst lub -mk tekst: Sets the subscription key for the Microsoft Azure Cognitive Services. It is recommended to apply such key if you have it.
--ms-region tekst lub -mr tekst: Sets the subscription region for the Microsoft Azure Cognitive Services.

Przykłady komend

Create the text file LANGUAGE.TXT with the list of all supported languages and genders for the Google TTS service:

bal4web -s Google -m > language.txt

Convert text from BOOK.TXT to speech and save as BOOK.WAV:

bal4web -f "d:\Text\book.txt" -w "d:\Sound\book.wav" -s Google -l en-US -g female

Convert subtitles to speech and save as MOVIE.WAV:

bal4web -f "d:\Subtitles\movie.srt" -w "d:\Sound\movie.wav" -s m -l de-DE -n Conrad -r 1.1

The example of use together with LAME.EXE:

bal4web -f d:\book.txt -s Baidu -l en-US -o --raw | lame -r -s 16 -m m -h - d:\book.mp3

The example of use together with OGGENC2.EXE:

bal4web -f d:\book.txt -s Baidu -l en-US -o -il | oggenc2 --ignorelength - -o d:\book.ogg

Plik konfiguracyjny

Parametry można zachować jak plik konfiguracyjny "bal4web.cfg" w tym samym folderze co aplikacja konsolowa.

Przykład zawartości pliku:

-f d:\Text\book.txt
-w d:\Sound\book.wav
-s Google
-l de-DE
-g female
-d d:\Dict\rules.bxd
-lrc
--lrc-length 75
--lrc-enc utf8

Program może kombinować parametry z pliku konfiguracyjnego i wiersza poleceń.

Licenzja

You are free to use and distribute software for noncommercial purposes. For commercial use or distribution, you need to get permission from the copyright holder.