Skip to content
Help Build a Better ComfyUI Knowledge Base Become a Patron
NewsAlibaba Open Sources InspireMusic: An Innovative Framework for Music, Song and Audio Generation

Alibaba Open Sources InspireMusic: An Innovative Framework for Music, Song and Audio Generation

Alibaba’s research team has officially open-sourced the InspireMusic project, an innovative unified framework aimed at breaking boundaries in music, song, and audio generation. InspireMusic combines advanced AI technology to bring new possibilities for music creation, generation, and experience.

InspireMusic Project Overview

InspireMusic is a multi-functional platform capable of efficiently generating music and songs while supporting various audio synthesis tasks. Its core is based on the FunAudioLLM framework, which has been widely applied in speech understanding and generation. InspireMusic further extends this technological advantage to music generation.

Key Features

  • Unified Framework: InspireMusic builds a unified generation framework with advanced AI technology at its core, supporting multiple music generation tasks.
  • Deep Learning Models: Utilizing the latest deep learning models to generate high-quality, creatively unlimited musical works.
  • Diverse Application Scenarios:
    • Automatic music composition
    • Personalized background music generation
    • Film and game soundtrack design
    • Intelligent song generation service

Open Source Information

InspireMusic is now fully open-sourced on GitHub, providing rich tools and flexible interfaces for developers, musicians, and AI researchers. The open-source nature allows developers to explore its underlying technology and contribute code to drive continuous project improvement.

Future Prospects

The Alibaba research team states that InspireMusic will continuously optimize framework performance and introduce more innovative features through collaboration with global developers and music creators. In the future, the platform will provide richer support for music creation, lower creation barriers, and aid digital innovation in the music industry.

Technical Highlights

  • Unified Audio Generation Framework: Supporting music, song, and audio generation, providing diverse generation possibilities.
  • Flexible Controllable Output: Generate music with precise style and structure through text prompts and musical feature descriptions.
  • User-Friendly: Offering simplified model fine-tuning and inference tools for efficient training and improvement.

InspireMusic Models and Resource Downloads

InspireMusic offers various pre-trained models supporting 24kHz and 48kHz audio generation. Here are some key model links:

Model NameModel LinkNotes
InspireMusic-Base-24kHzModelScope24kHz mono, 30-second music generation
InspireMusic-1.5B-LongHuggingFace48kHz, supports over 5-minute long music generation
WavTokenizer (75Hz)ModelScopeUltra-low bitrate audio encoder for 24kHz audio

Community and Discussion

You can join the InspireMusic community through the following links: