-
Notifications
You must be signed in to change notification settings - Fork 176
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
added
alignment.locate()
and updated all docstring
-added `alignment.locate()` to locate where in specific words / phrase are spoken in an audio without transcribing; significantly faster than transcribing then `result.WhisperResult.find()` -updated all docstring to be consistent with more common practices (to ease future documentation generation) -renamed the parameter `original_spit` to `original_split` for `alignment.align()` -the parameters: `time_scale`, `input_sr`, `demucs_output`, `demucs_device` are deprecated for all functions and method, except for `input_sr` which is not deprecated for `non_whisper.transcribe_any()` -fixed `alignment.align()` not work if `text` is an instance of `result.WhisperResult` without tokens but words -added the method `to_display_str()` to `result.Segment` as a consistent was to format a segment for printing out when `verbose=True` for all transcription functions that uses it -improved efficiency of segment splitting for `alignment.align()` when `original_split=True`; significantly faster, especially with extremely long `text`. -added parameters: `demucs`, `demucs_options`, `only_voice_freq` to `alignment.refine()` -refactored the audio preprocessing in most transcription functions into `audio.prep_audio()` -the parameter, `demucs` now also accept instance of a Demucs model instance instead of a bool; model can be loading with `audio.load_demucs_model()` -remove `__is_whisper_repo_version` from `utils.py` so that `result.py` does not depend on Whisper -added `utils.format_timestamp()` and `utils.make_safe()` from `whisper.utils.py` -added `utils.safe_print()`, a wrapper for printing content returned by `utils.make_safe()` -changed the parameter `audio` such that it is always expected to be 16kHz if `audio` a `torch.Tensor` or `numpy.ndarray` -added the parameter, `demucs_options` to `whisper_word_level.load_faster_whisper.faster_transcribe` so that `demucs_option` can used with faster-whisper -set `action="extend"` for all CLI keyword arguments that take multiple values; allowing for `-o` example to be use like: `-o 1.srt -o 2.srt 3.srt` instead of only `-o 1.srt 2.srt 3.srt`
- Loading branch information
Showing
15 changed files
with
1,660 additions
and
864 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
__version__ = "2.12.3" | ||
__version__ = "2.13.0" |
Oops, something went wrong.