File:Generative AI Audio with Audiocraft 3 – Audiogen with Gradio.webm

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Original file(WebM audio/video file, VP9/Opus, length 10 min 31 s, 3,360 × 2,100 pixels, 1.23 Mbps overall, file size: 92.6 MB)

Captions

Captions

Add a one-line explanation of what this file represents

Summary

[edit]
Description
English: This tutorial series is on making generative music and audio from a simple text description of the sound using Facebook's Audiocraft Library.

In the last video, we set up a web form to be able to submit descriptions to a python backend, and return the generated audio file for download.

In this video we'll replace our complicated front and backend with some very simple UI components and functionality using Gradio, the interface commonly seen used in Hugging Face (https://huggingface.co/spaces/facebook/MusicGen)

Starting code: https://github.com/heaversm/audiocraft-tutorials/tree/tutorials/03-audiogen-flask

Final code: https://github.com/heaversm/audiocraft-tutorials/tree/tutorials/04-audiogen-gradio

About Me: I am a Staff Design Technologist on Mozilla's Innovation Team. All opinions and explorations are my own. Learn more about Mozilla Innovation at future.mozilla.org

0:00 - review 0:45 - gradio interfaces on hugging face 1:30 - remove all the complicated frontend and backend code 2:00 - installing gradio 3:25 - creating a simple gradio frontend 4:45 - understanding gradio blocks and components 5:40 - running the gradio app 6:20 - setting up our form, submission, and audio player with gradio 7:30 - modifying our generate audio function to work with gradio 8:35 - testing our final app

9:20 - wrapping up and next steps
Date
Source YouTube: Generative AI Audio with Audiocraft 3: Audiogen with Gradio – View/save archived versions on archive.org and archive.today
Author Practical AI through Prototypes

Licensing

[edit]
This video, screenshot or audio excerpt was originally uploaded on YouTube under a CC license.
Their website states: "YouTube allows users to mark their videos with a Creative Commons CC BY license."
To the uploader: You must provide a link (URL) to the original file and the authorship information if available.
w:en:Creative Commons
attribution
This file is licensed under the Creative Commons Attribution 3.0 Unported license.
Attribution: Practical AI through Prototypes
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
This file, which was originally posted to an external website, has not yet been reviewed by an administrator or reviewer to confirm that the above license is valid. See Category:License review needed for further instructions.

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current22:18, 11 September 202410 min 31 s, 3,360 × 2,100 (92.6 MB)Prototyperspective (talk | contribs)Imported media from https://www.youtube.com/watch?v=V6AE_itHWFA

The following page uses this file:

Transcode status

Update transcode status
Format Bitrate Download Status Encode time
VP9 1080P 563 kbps Completed 22:39, 11 September 2024 21 min 29 s
VP9 720P 329 kbps Completed 22:28, 11 September 2024 10 min 49 s
VP9 480P 210 kbps Completed 22:24, 11 September 2024 5 min 39 s
VP9 360P 163 kbps Completed 22:22, 11 September 2024 4 min 43 s
VP9 240P 134 kbps Completed 22:22, 11 September 2024 4 min 51 s
WebM 360P 455 kbps Completed 22:24, 11 September 2024 5 min 53 s
Streaming 144p (MJPEG) 952 kbps Completed 22:19, 11 September 2024 54 s
Stereo (Opus) 95 kbps Completed 22:23, 11 September 2024 14 s
Stereo (MP3) 128 kbps Completed 22:23, 11 September 2024 19 s

Metadata