Showing 71 open source projects for "audio samples"

View related business solutions
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Step-Audio-EditX

    Step-Audio-EditX

    LLM-based Reinforcement Learning audio edit model

    Step-Audio-EditX is an open-source, 3 billion-parameter audio model from StepFun AI designed to make expressive and precise editing of speech and audio as easy as text editing. Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level token operations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Giada

    Giada

    Your Hardcore Loop Machine.

    ...Process samples or audio/MIDI input signals with VST instruments from your plug-ins collection. Control other software or synchronize physical MIDI devices by using Giada as a MIDI master sequencer.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Sonic Pi

    Sonic Pi

    Sonic Pi is your free code-based music creation and performance tool

    Sonic Pi is a new kind of musical instrument. Instead of strumming strings or whacking things with sticks - you write code, live. Sonic Pi is a complete open source programming environment originally designed to explore and teach programming concepts within schools through the process of creating new sounds. In addition to being an engaging education resource it has evolved into an extremely powerful and performance-ready live coding instrument suitable for professional artists and DJs....
    Downloads: 30 This Week
    Last Update:
    See Project
  • 4
    OuteTTS

    OuteTTS

    Interface for OuteTTS models

    OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Atera - The depth of a full-stack IT platform, with the power of AI. Icon
    Atera - The depth of a full-stack IT platform, with the power of AI.

    Atera introduces your autonomous AI agent - Ensure operational efficiency at any scale with 24/7 autonomous IT support.

    Atera prioritizes security and compliance through robust protections that align with industry standards. Our AI-driven features were built on responsible AI principles and empower IT teams to work efficiently while maintaining trust and compliance.
    Learn More
  • 5
    Generative AI

    Generative AI

    Sample code and notebooks for Generative AI on Google Cloud

    Generative AI is a comprehensive collection of code samples, notebooks, and demo applications designed to help developers build generative-AI workflows on the Vertex AI platform. It spans multiple modalities—text, image, audio, search (RAG/grounding) and more—showing how to integrate foundation models like the Gemini family into cloud projects. The README emphasises getting started with prompts, datasets, environments and sample apps, making it ideal for both experimentation and production-ready usage. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    ElevenLabs Python

    ElevenLabs Python

    The official Python SDK for the ElevenLabs API

    elevenlabs-python is the official Python SDK for the ElevenLabs API, giving developers a convenient way to access ElevenLabs’ high-quality, lifelike voices. The library wraps the HTTP API into a typed Python client, so you can perform text-to-speech, streaming, voice cloning, voice management, and agents-related operations with simple method calls. It exposes ElevenLabs’ main models such as Eleven Multilingual v2, Eleven Flash v2.5, and Eleven Turbo v2.5, each targeting different trade-offs...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    VibeVoice ComfyUI

    VibeVoice ComfyUI

    ComfyUI integration for Microsoft's VibeVoice text-to-speech model

    VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each speaker. It includes advanced control over generation parameters like attention backend, diffusion steps, sampling temperature, guidance scale, and quantization settings, allowing users to tune the trade-offs between quality, VRAM usage, and speed. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    StreamSpeech

    StreamSpeech

    StreamSpeech is a seamless model for offline speech recognition

    StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Git Large File Storage

    Git Large File Storage

    Git extension for versioning large files

    An open source Git extension for versioning large files. Git Large File Storage (LFS) replaces large files such as audio samples, videos, datasets, and graphics with text pointers inside Git, while storing the file contents on a remote server like GitHub.com or GitHub Enterprise. Download and install the Git command line extension. Once downloaded and installed, set up Git LFS for your user account. In each Git repository where you want to use Git LFS, select the file types you'd like Git LFS to manage (or directly edit your .gitattributes). ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Say goodbye to broken revenue funnels and poor customer experiences Icon
    Say goodbye to broken revenue funnels and poor customer experiences

    Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

    LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
    Learn More
  • 10
    dash.js

    dash.js

    A reference client implementation for the playback of MPEG DASH

    A reference client implementation for the playback of MPEG DASH via JavaScript and compliant browsers. If your intent is to use the player code without contributing back to this project, then use the MASTER branch which holds the approved and stable public releases. If your goal is to improve or extend the code and contribute back to this project, then you should make your changes, and submit a pull request. dash.js is an initiative of the DASH Industry Forum to establish a production...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Orange Juice Audio Enhancer for Linux
    ...Orange Juice called "Tangerine Fix" in past. To see Usage, Please see Project Samples bellow. How to install "Orange Juice" This Audio Enchancer requires pipewire environment on your system. sudo apt install easyeffects lsp-plugins calf-plugins Download tar.gz and unarchive, these 3 files put into /home/{users folder}/.config/easyeffects/output note: MX-23 uses "rubberband" for pitch effect, MX-25 uses "soundtouch" for pitch effect.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    SoundTracker
    SoundTracker is a pattern-oriented music editor (similar to the DOS program 'FastTracker'). Samples are lined up on tracks and patterns which are then arranged to a song. Supported module formats are XM and MOD; the player code is the one from Open
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    AudiooPy

    AudiooPy

    Audio manager in Python Object-Oriented Programming

    AudiooPy stands for "Audio Manager in Python Object-Oriented Programming." This library provides a range of useful operations for sound files and audio fragments. It processes audio at the frame level, working with signed integer samples of 8, 16, or 32 bits, stored in byte-like objects. Key features include: - Reading and writing WAV files using Python's standard library
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    IMYplay

    IMYplay

    Plays iMelody (IMY) files using many sound systems

    IMYplay is a player for iMelody (IMY) ringtones / files (https://en.wikipedia.org/wiki/IMelody). IMYplay can use the following outputs to produce sound: 1. The Allegro game library (http://alleg.sf.net), 2. SDL (Simple DirectMedia Layer; http://www.libsdl.org), 3. ALSA (Advanced Linux Sound Architecture; http://alsa-project.org), 4. OSS (Open Sound System), 5. Libao (http://xiph.org/ao), 6. PortAudiov19 (http://www.portaudio.com), 7. PulseAudio (http://www.pulseaudio.org), 8....
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15

    speech intonator

    The purpose of the project is to develop audio processing algorithms

    The initial version of the main branch of the project has been completed. The main name of the project is "Java audio mixer Summaha". The second name of the project is "Sound Arithmometer". Main purpose - production of musical sound remixes from a set of samples. The name "Summaha" rhymes well with 'Yamaha' and creates motivation and inspiration to achieve a sound quality comparable to with a well-known brand. Detailed documentation in 'read' signature files. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LoopAuditioneer

    LoopAuditioneer

    Software for loop and cue handling in .wav files.

    Since 2024-02-28 development of LoopAuditioneer has moved to GitHub. Please go to https://github.com/GrandOrgue/LoopAuditioneer for recent updates and releases.
    Leader badge
    Downloads: 65 This Week
    Last Update:
    See Project
  • 17

    nbo-plugins

    An entire virtual orchestra in LV2 plugin format

    The No Budget Orchestra Plugins are a collection of virtual (software) musical instruments. comprising a full, classical orchestra There are flute, piccolo, oboe, bassoon, clarinet, violin, viola, cello, contrabass, trumpet, trombone, tuba, french horn, and various percussion sections. The various sections include articulations. For example, the violin section can play legato, pizzicato, staccato, and tremulo. There are subgroups as well, for example Violins I and II. The plugins are in...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 18
    PlayerPro music editor and SoundTracker

    PlayerPro music editor and SoundTracker

    Play SoundTracker media on your computer.

    PlayerPro is a music composer and sound editor ( SoundTracker ) for MOD S3M XM IT UMX . Uses XI, Quicktime or SoundFonts. Currently runs on MacOS and OSX; playback engines also available for Windows and BeOS. Upcoming version 6 will support ALSA (Linux).
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 19
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Universal Data Tool

    Universal Data Tool

    Collaborate & label any type of data, images, text, or documents etc.

    An open-source tool and library for creating and labeling datasets of images, audio, text, documents and video in an open data format. The Universal Data Tool can be used by anyone on your team, no data or programming skills needed. Simplicity without sacrificing any powerful developer features and integrations. Use the Universal Data Tool directly from a web browser or with a Windows, Mac or Linux desktop application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Vivid3D-Sharp

    Vivid3D-Sharp

    Modern 3D engine and IDE written using C# and C++.

    Modern 3D engine and IDE written using C# and C++. Vivid features all the required functionality to write modern apps ang games. Including a fully working custom GL UI called "ResonanceUI". It features lighting and shadows(Unified) and can even embed and playback videos via FFMPEG, in the UI or full screen, with fully synced audio+picture. It offers two scripting solutions: C# and a custom language called VividScript, which is an OOP style language. The Editor is based on other...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Deepvoice3_pytorch

    Deepvoice3_pytorch

    PyTorch implementation of convolutional neural networks

    An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DC-TTS

    DC-TTS

    TensorFlow Implementation of DC-TTS: yet another text-to-speech model

    DC-TTS is a TensorFlow implementation of the DC-TTS architecture, a fully convolutional text-to-speech system designed to be efficiently trainable while producing natural speech. It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Vital Sign Simulator

    Vital Sign Simulator

    Patient vital sign simulator for medical emergency training purposes

    The vital sign simulator simulates a patient monitor and is intended for use in medical emergency training simulations. In combination with a (cheap) cpr-manikin, it offers a low-cost alternative to commercial high-tech patient simulation manikins. It is used with a dual monitor system, one monitor with controls for the operator and one providing the vital signs to the trainees. Heart rate, oxygen saturation, etCO2, respiratory rate, blood pressure and various moving ecg-samples can be set...
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next