Riffusion v0.3 #48
hmartiro
announced in
Announcements
Replies: 1 comment 1 reply
-
This is the most exciting project I have seen on the web in a long time! Thank you for making it! |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
🖊️ Full Rewrite
This release contains a full rewrite of the Riffusion codebase to go from a hack to a quality software project.
riffusion-inference
toriffusion
.SpectrogramParams
class that contains all conversion parameters, with sane defaults.SpectrogramConverter
class that converts between spectrogram tensors and audio.SpectrogramImageConverter
class that converts between spectrogram images and audio.AudioSegment
in more places rather than raw numpy arrays.util
package.integrations
package. Share most of the code so they greatly simplify.pyproject.toml
for tool configuration🚨 This release is API compatible with the web app, but code that used this repository directly will need to be updated.
👩💻 Riffusion CLI
Extensible command line interface for performing common tasks. See the README for details.
🤾♂️ Riffusion Playground
Extensible Streamlit app for interactive exploration of Riffusion. See the README for details.
🔥 MPS and CPU Backends
Riffusion now can run on MPS and CPU backends in addition to CUDA. See the README for details.
Also adds graceful detection and fallback of devices.
Closes: #15
👓 Stereo Spectrograms
Add tools to encode and decode stereo audio as spectrograms, using the G and B channels for left and right.
🖼️ Encode Spectrogram Params in Image EXIF
Add the ability to store spectrogram conversion parameters in EXIF metadata of the images, and the ability to decode back to audio from those params. This allows more flexibility for usage without assuming default parameters.
The
SpectrogramParams
class has methods to convert to and from EXIF.🔉 Post-Processing Filters
Add a capability to apply normalization and compression to audio using pydub.
🟢 Test Suite
Add a suite of tests in the
test/
package, and check in some test data.They are automatically run on pull requests, configured from ci.yml.
audio_to_image_test.py
image_to_audio_test.py
image_util_test.py
linter_test.py
print_exif_test.py
sample_clips_test.py
spectrogram_converter_test.py
spectrogram_image_converter_test.py
🧹 Lint Tools
These tools run in CI and must pass cleanly to merge.
ruff --fix .
)black .
)mypy .
)PRs
Full Changelog: v0.2.0...v0.3.0
This discussion was created from the release Riffusion v0.3.
Beta Was this translation helpful? Give feedback.
All reactions