cccd82d3cb70ddea5e0f2e628386a8bfbb9a9f74
tpro
Transcript Processing! tpro takes JSON-formatted transcripts produced by
various speech-to-text services and converts them to various standardized
formats.
Installation and Usage
Non-pip Requirement: Stanford NER JAR
- download and unzip this
- put these files in in /usr/local/bin/:
- stanford-ner.jar
- classifiers/english.all.3class.distsim.crf.ser.gz
- you might have to update Java on Linux
Pip
$ pip install tpro
Usage
$ tpro --help
Usage: tpro [OPTIONS] JSON_PATH_OR_DATA [amazon|gentle|speechmatics]
[universal_transcript|viral_overlay]
Options:
-s, --save TEXT save to file
--help Show this message and exit.
Example
STT Services
Planned
Output Formats
- Universal Transcript (JSON)
- viraloverlay (JSON)
Planned
- Word (
.doc,.docx) - text files
- SRT (subtitles)
Description
Languages
Python
100%