srt3 is a simple yet featureful Python library for parsing, modifying, and composing SRT files. Take a look at the quickstart for a basic overview of the library. Detailed API documentation is also available.
Want to see some examples of its use? Take a look at the tools shipped with the library.
- Parses broken SRT files other libraries can't and fixes them
- Support for Asian-style SRT formats (ie. "fullwidth" SRT format)
- Extremely lightweight with a Well Documented API
- Includes tools that allow you to perform tasks using the library
- No Dependencies outside of the Standard Library
- High quality test suite using Hypothesis
- ~30% faster than pysrt on typical workloads
- 100% Unicode Compliant
- Portable — runs on Windows, OSX, and Linux
- Released under a highly permissive license (MIT)
There are a number of tools shipped with the library to manipulate, process, and fix SRT files. Here's an example using hanzidentifier to strip out non-Chinese lines:
$ cat pe.srt 1 00:00:33,843 --> 00:00:38,097 Only 3% of the water on our planet is fresh. 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 Yet, these precious waters are rich with surprise. 可是这些珍贵的淡水中却充满了惊奇 $ srt match -m hanzidentifier -fm hanzidentifier.has_chinese -i pe.srt 1 00:00:33,843 --> 00:00:38,097 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 可是这些珍贵的淡水中却充满了惊奇
These tools are easy to chain together. For example, you have a subtitle containing Chinese and English, and another containing French. You only want Chinese French. The Chinese and English subtitle is also 5 seconds late. That's easy enough to sort out:
$ srt match -m hanzidentifier -fm hanzidentifier.has_chinese -i chs+eng.srt | > srt fixed_timeshift --seconds -5 | > srt mux --input - --input fra.srt
See the srt/tools/ directory for more information.
Detailed API documentation is available, but here are the basics:
>>> # list() is needed as srt.parse creates a generator
>>> subs = list(srt.parse('''\
... 1
... 00:00:33,843 --> 00:00:38,097
... 地球上只有3%的水是淡水
...
... 2
... 00:00:40,641 --> 00:00:44,687
... 可是这些珍贵的淡水中却充满了惊奇
...
... 3
... 00:00:57,908 --> 00:01:03,414
... 所有陆地生命归根结底都依赖於淡水
...
... '''))
>>> subs
[Subtitle(index=1, start=datetime.timedelta(0, 33, 843000), end=datetime.timedelta(0, 38, 97000), content='地球上只有3%的水是淡水', proprietary=''),
Subtitle(index=2, start=datetime.timedelta(0, 40, 641000), end=datetime.timedelta(0, 44, 687000), content='可是这些珍贵的淡水中却充满了惊奇', proprietary=''),
Subtitle(index=3, start=datetime.timedelta(0, 57, 908000), end=datetime.timedelta(0, 63, 414000), content='所有陆地生命归根结底都依赖於淡水', proprietary='')]
>>> print(srt.compose(subs))
1
00:00:33,843 --> 00:00:38,097
地球上只有3%的水是淡水
2
00:00:40,641 --> 00:00:44,687
可是这些珍贵的淡水中却充满了惊奇
3
00:00:57,908 --> 00:01:03,414
所有陆地生命归根结底都依赖於淡水
To install the latest stable version from PyPi:
pip install -U srt3
To install the latest development version directly from GitHub:
pip install -U git+https://github.com/switchupcb/srt3.git@develop
You can contribute to this repository using its Contribution Guidelines.