Google Gemini: Lyria 3 Now Creates Music from Text & Images
Google is expanding the creative capabilities of its Gemini application with the integration of Lyria 3, a new music generation model developed by its DeepMind research division. The rollout of this feature is currently underway for users who are of age and will be available in all languages supported by the app.
New Musical Horizons for Gemini Users
Following the introduction of tools for creating images and videos, Google is now extending Gemini’s functionality to include the production of short musical pieces. These tracks are generated from text descriptions or content uploaded by the user, such as images and documents.
How Lyria 3 Works
Lyria 3 allows users to generate tracks approximately 30 seconds in length by describing a specific genre, mood, or theme. For example, a user could request a relaxing jazz piece with piano and a slow tempo, or an instrumental backing track with particular rhythmic characteristics. The system also allows users to upload files – photos, documents, or presentations – and use their content as inspiration for musical composition.
Users can even transform moments, topics, or phrases into shareable tracks. As an example, a prompt could be: “Create a 1990s skate punk track to tell my roommate Ryan to do the dishes, with a lot of energy and fast drums.”
Improvements with Lyria 3
Google highlights three key improvements in Lyria 3 compared to previous versions: automatic lyric generation based on prompts, increased control over elements like style, voice, and rhythm, and a more articulate and realistic musical output.
Expanding to YouTube
Lyria 3 will also be progressively integrated into Dream Track on YouTube, available to all creators. This function is designed for creating short, customized tracks for Shorts content.
AI Content Identification and Copyright Protection
Music produced through the app includes SynthID, a digital watermark developed by Google to identify content generated by artificial intelligence systems. Google has also extended its verification tools to audio, allowing users to check files for the presence of the watermark or other signals indicating AI-generated content from Google.
Google states that Lyria 3 was developed in collaboration with the music community, with consideration given to copyright issues and agreements with partners during the model’s training. When a prompt references a specific artist, the system is designed to interpret it as a general stylistic indication, avoiding direct imitation. Filters are also in place to limit the production of content that could potentially infringe on existing works. Google provides a reporting tool for potential copyright violations, acknowledging that its approach may not be foolproof.
To use these products, Google requires adherence to its Terms of Service and Policies for Generative AI use, which aim to prevent intellectual property violations.
Frequently Asked Questions
What length are the musical tracks generated by Lyria 3?
The tracks generated by Lyria 3 are approximately 30 seconds in length.
Can I use my own files to inspire the music creation?
Yes, the system allows users to upload files like photos, documents, or presentations and use their content as a basis for musical composition.
How does Google address copyright concerns with Lyria 3?
Google developed Lyria 3 in collaboration with the music community and has implemented filters to limit the production of potentially infringing content, as well as a reporting tool for copyright violations.
As AI-powered music creation becomes more accessible, how do you envision these tools impacting the future of music and artistic expression?