Best Open Source Python Sound/Audio Software

LAME (Lame Aint an MP3 Encoder)

A high quality MP3 encoder

LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!

Downloads: 20,183 This Week

Last Update: 2023-12-18

See Project

spotDL

Download your Spotify playlists and songs along with album art

spotDL finds songs from Spotify playlists on YouTube and downloads them - along with album art, lyrics and metadata.

Downloads: 126 This Week

Last Update: 2025-10-08

See Project

Spleeter

Deezer source separation library including pretrained models

Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x faster than real-time when run on a GPU. We designed Spleeter so you can use it straight from command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker.

1 Review

Downloads: 92 This Week

Last Update: 2021-09-03

See Project

savonet

Audio toolkit for streaming, aiming at complex Internet radio systems. Our stream generator is highly extensible and flexible. Script based, it can be a fully automatic daemon streamer, but is also used for live shows with an user-friendly GUI.

7 Reviews

Downloads: 346 This Week

Last Update: 2016-05-24

See Project

EasyABC

EasyABC is an open source ABC editor

EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs are automatically included.

8 Reviews

Downloads: 249 This Week

Last Update: 2025-01-12

See Project

MediaCrate — Video/Audio Downloader

Download video and audio from over 1,000+ websites with one click

MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads. Project links: Website: justagwas.com/projects/mediacrate GitHub: github.com/Justagwas/MediaCrate Documentation: github.com/Justagwas/MediaCrate/wiki The application is fully open source, runs entirely on your device, and only downloads content you explicitly request. VirusTotal scan result: https://www.virustotal.com/gui/file/ae5eaca0b6a58697bd46aa9767625b6be085696df766e3022a68039c3f0dd6e7

2 Reviews

Downloads: 146 This Week

Last Update: 5 hours ago

See Project

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 15 This Week

Last Update: 2021-04-08

See Project

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 15 This Week

Last Update: 2025-12-31

See Project

GNU Solfege

GNU Solfege is *free* ear training software written in Python 3.4 using the Gtk+ 3 toolkit. The program is designed to be easily extended with lesson files (data files), so the user can create new exercises.

23 Reviews

Downloads: 56 This Week

Last Update: 2016-06-24

See Project

Radio Tray

Radio Tray is a streaming player for listening to online radios. Any bug or feature request should be reported in https://bitbucket.org/carlmig/radio-tray/issues

23 Reviews

Downloads: 50 This Week

Last Update: 2013-05-01

See Project

cwtext text to morse code converter

Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.

7 Reviews

Downloads: 58 This Week

Last Update: 2013-03-22

See Project

Curlew Multimedia Converter

Easy to use Multimedia Converter for Linux

8 Reviews

Downloads: 70 This Week

Last Update: 2018-05-26

See Project

PyTube Downloader

Let's quickly download YouTube videos & playlists with one click.

PyTube Downloader lets you quickly download YouTube videos and playlists with one click. Choose from 144p to 8K quality and download multiple videos simultaneously. PyTube Downloader 让你轻松一键下载 YouTube 视频和播放列表。支持选择 144p 到 8K 的质量，并支持同时下载多个视频。

1 Review

Downloads: 150 This Week

Last Update: 2026-01-06

See Project

NovaSR

A lightning fast audio upsampler

NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. NovaSR is especially valuable for post-processing tasks in speech enhancement, TTS pipelines, and dataset restoration where low sampling rates degrade perceived audio clarity; the minimal model size also makes it suitable for edge and embedded use cases where memory is at a premium. Its performance can reach thousands of times realtime on modern GPUs, allowing massive audio batches to be processed with negligible compute overhead.

Downloads: 8 This Week

Last Update: 4 days ago

See Project

Spotify Music Downloader

Download music from Spotify and other music sources.

1 Review

Downloads: 8 This Week

Last Update: 2022-04-27

See Project

FF Multi Converter

FF Multi Converter is a simple graphical application which enables you to convert audio, video, image and document files between all popular formats, using and combining other programs. It uses ffmpeg for audio/video files, unoconv for document files and ImageMagick for image file conversions.

6 Reviews

Downloads: 63 This Week

Last Update: 2017-10-19

See Project

Swing Music

Swing Music is a beautiful, self-hosted music player

Swing Music is a beautiful, self-hosted music player and streaming server that lets you bring your personal audio library online with a modern browser-based interface, giving you a private alternative to mainstream streaming services. Designed to be both elegant and powerful, the project scans your local music files (like MP3s or FLACs), organizes metadata, and streams them on-demand to any device with a browser or its Android client. It includes features like folder browsing, playlist creation, related artists/albums exploration, and intelligent metadata normalization so your collection feels clean and connected. The interface handles fuzzy search so typos don’t stop you from finding tracks, supports silence detection and crossfading for a smoother listening experience, and provides statistics about your listening habits to help you understand your personal music library.

Downloads: 7 This Week

Last Update: 2 days ago

See Project

Spatial Media

Specifications and tools for 360º video and spatial audio

spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is present. By separating metadata handling from encoding, the tools slot into existing editing/export workflows without requiring specialized NLE support. They help ensure assets play back with the right projection, field of view, and head-tracked audio in compliant players. For archive and validation use cases, the scripts can also read existing flags to diagnose why a video isn’t being recognized as immersive.

Downloads: 6 This Week

Last Update: 2025-10-10

See Project

StreamTuner2 ♪♬#

Internet radio directory browser

Streamtuner2 is an internet radio station and video browser. It simply lists stations in categories from different directories. Launches your preferred media apps for playback. It's built in Python now, but retains UI similarity with the original StreamTuner 0.99

6 Reviews

Downloads: 42 This Week

Last Update: 2022-02-22

See Project

LiveAvatar

Streaming Real-time Audio-Driven Avatar Generation

LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. LiveAvatar focuses on delivering not just high-quality visuals but also the responsiveness necessary for immersive conversational experiences, making it suitable for advanced AI agents, virtual assistants, and interactive streaming contexts.

Downloads: 5 This Week

Last Update: 2026-01-30

See Project

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.

Downloads: 5 This Week

Last Update: 2026-01-26

See Project

Quod Libet

Music player and music library manager for Linux, Windows, and macOS

Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the Mutagen tagging library. It’s designed around the idea that you know how to organize your music better than we do. It lets you make playlists based on regular expressions (don’t worry, regular searches work too). It lets you display and edit any tags you want in the file, for all the file formats it supports. Unlike some, Quod Libet will scale to libraries with tens of thousands of songs. It also supports most of the features you’d expect from a modern media player.

Downloads: 5 This Week

Last Update: 2025-05-31

See Project

Tauon

The music player of today

Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a playlist-oriented music player that makes the assumption that folders are albums. (You may find the words "album" and "folder" used interchangeably within the interface and documentation) While some features and interface elements may not be immediately intuitive, I hope that once you are familiar, you will find that Tauon is pleasant, fast, and fun to use.

Downloads: 5 This Week

Last Update: 2026-01-19

See Project

Internet DJ Console

A feature packed DJ console and internet radio client for Linux users

Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no means complete. More info at the project homepage.

32 Reviews

Downloads: 18 This Week

Last Update: 2026-01-10

See Project

iPod shuffle database builder

This program, called "rebuild_db", allows the iPod shuffle to be used without iTunes. It rebuilds the iPod shuffle's databases based on the actual content of the iPod's file system, without restrictions regarding the directory structure.

4 Reviews

Downloads: 24 This Week

Last Update: 2013-04-16

See Project

Open Source Python Sound/Audio Software

Python Sound/Audio Software

LAME (Lame Aint an MP3 Encoder)

spotDL

Spleeter

savonet

EasyABC

MediaCrate — Video/Audio Downloader

DeepSpeech

SpeechRecognition

GNU Solfege

Radio Tray

cwtext text to morse code converter

Curlew Multimedia Converter

PyTube Downloader

NovaSR

Spotify Music Downloader

FF Multi Converter

Swing Music

Spatial Media

StreamTuner2 ♪♬#

LiveAvatar

PersonaPlex

Quod Libet

Tauon

Internet DJ Console

iPod shuffle database builder

Related Searches