Files
deepspeech-demo/README.md
2023-08-01 14:20:25 +02:00

52 lines
1.5 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# DeepSpeech Demo
![Screenshot](public/screenshot.png)
## About
This is a simple web-interface and Node server that uses [DeepSpeech](https://github.com/mozilla/DeepSpeech) to create a local Speech-to-Text service.
I mostly built it for myself, in order to play around with DeepSpeech, but it can be used
as a starting point for your own experiments.
## Prerequisites
The demo relies on [SoX](http://sox.sourceforge.net/) being available
on your system and in your PATH.
A quick seach for `how to install SoX on (windows|ubuntu|mac)` should do the trick
for you if you don't know how to install it, at least it did for me.
## Install
First, clone the repository:
```bash
git clone git@github.com:asciidisco/deepspeech-demo.git
```
Then change into the directory & perform an npm install:
```bash
npm install
````
As this downloads the pre-trained models from the DeepSpeech releases as a postinstall step,
which are roughly 1.4 GB, it might take a while.
After that, start the server using:
```bash
npm start
```
And navigate to [http://localhost:3000](http://localhost:3000)
## Usage
The user interface is quite reduced, after you´ve agreed that the page is allowed
to use your microphone, you just need to hit the "Listen" button and speak a few words
into your microphone. After you finished your sentence, you need to hit the "Listening..."
button once again, so that the recorded sound can be processed.
After a few seconds, you should see the text in the result box below the button.