ed2ece3dfb4966f58b4aaf6d7ff9fc79d0669d4b
tpro
Transcript Processing! tpro takes JSON-formatted transcripts produced by
various speech-to-text services and converts them to various standardized
formats.
Installation and Usage
Non-pip Requirement: Stanford NER JAR
- download and unzip this
- put these files in in /usr/local/bin/:
- stanford-ner.jar
- classifiers/english.all.3class.distsim.crf.ser.gz
- you might have to update Java on Linux
Pip
$ pip install tpro
Usage
$ tpro --help
Usage: tpro [OPTIONS] JSON_PATH_OR_DATA [amazon|gentle|speechmatics|google]
[universal_transcript|viral_overlay]
Options:
-s, --save TEXT save to file
--help Show this message and exit.
STT Services
Planned
Output Formats
- Universal Transcript (JSON)
- viraloverlay (JSON)
Planned
- Word (
.doc,.docx) - text files
- SRT (subtitles)
Description
Languages
Python
100%
