Sony Reportedly Develops New Tech That Can Trace Original Music in AI Songs

Sony Groups research team has come up with a tool that can identify the existing songs used to train or influence an AI-generated track. This tool can also estimate how much each original song contributed to the AI-generated track. For instance it might say that a track is “30% X and 10% Y”. The goal of this tool is to give rights-holders an understanding of how their music was used and to help them get fair compensation.

The tool can work in ways. When AI developers cooperate it can directly analyze the training data or the base models to get evidence. When they don’t cooperate it can still work by analyzing the output and comparing it to the songs.

Here are the main points from the reports about Sonys tool:

1. The tool can identify the existing songs used to train or influence an AI-generated track even when multiple originals were used.

2. It can estimate the percentage contribution of each work to the AI-generated track.

3. Sony thinks this tool could be used to create a licensing or royalty system for works that contributed to AI outputs.

The music industry is interested in this tool because generative music models are trained on collections of existing recordings and scores. Rights-holders have been complaining that many music-generation systems reuse copyrighted material without permission. A tool that can identify which songs were used and how much they contributed could change the dynamics.

There are a ways that Sonys tool could work:

* Neural fingerprinting: This involves computing a fingerprint for each known song and for the AI-generated track and then matching patterns.

* Model-interrogation: This involves analyzing the base model or training logs to see which songs were used and how much they contributed.

* Spectral or musical feature decomposition: This involves breaking down the AI-generated track into its component parts and trying to figure out which known songs could have produced those parts.

Each of these approaches has its strengths and weaknesses. For example neural fingerprinting is well. Fast, but it can be brittle when the AI output transforms the content heavily. Model-interrogation can be more definitive. It requires cooperation from the AI developer.

When Sonys tool estimates the percentage contribution of each work it’s not always clear what that means. It might mean that the AI-generated track shares a percentage of its learned features with the original song.. It might mean that the AI-generated track sounds a certain percentage similar to the original song.

There are also some limitations and risks to consider. For example if the AI developer doesn’t cooperate the tool has to rely on output analysis, which’s less reliable.. If the AI-generated track is a statistical blend of multiple songs it can be hard to figure out which songs were used and how much they contributed.

The music industry is already moving to develop detection technology like Sonys tool. For instance SoundPatrol has been working with Universal and Sony Music on fingerprinting systems.

If a trustworthy detector like Sonys tool exists it could lead to some business models. For example platforms could pay into a fund thats split according to detected contributions.. They could use the tool to clear generated tracks, with a rights registry.

Overall Sonys tool has the potential to change the way that the music industry handles AI-generated music. It could provide a way to identify which songs were used and how much they contributed and to give rights-holders compensation.

Paid training access is important because rights-holders sell their training datasets to AI developers and detection is used to make sure the terms are followed.

For finding and removing infringing AI tracks rights-holders use the tool to request removal or get compensation.

All of these steps depend on the detector being credible having -vendor standards and being backed by law.

9) Why independent validation is

Technology that has financial and legal consequences needs to be checked by neutral third parties. Having benchmarks, challenge datasets and competitions like an “AI music attribution challenge” will help the community measure how well the tool works, including its false positives and negatives and how robust it is to changes.

Sonys research may be good but until they share their methods and code or standard benchmarks are published and evaluated the tool should be seen as promising research than a finished product.

10) A realistic timeline for rollout

First the research prototype is published with details of the method, controlled experiments and error rates. This is where Sony is now.

Next pilot integrations happen with partners who agree to provide access to their training models for attributions.

Then an industry consortium and standards are formed, with labels, tech companies and standards bodies agreeing on formats and minimum requirements for auditing.

After independent audits and standardization will courts or government agencies rely on it. So the path to use is multi-year and needs work on social, legal and technical fronts.

11) Things to consider ethically

There’s a balance between being transparent and keeping trade secrets. AI companies may not want to share their training data because of property and competitive reasons.

Overbroad attribution could discourage sampling and reinterpretation that is culturally valuable so clear safe harbors for legitimate remixing and parody must be preserved.

Detection systems trained on Western music catalogs may not work well for non-Western music.

12) The bottom line is optimism

A tool that can reliably trace original music in AI outputs would be a big deal giving rights-holders a technical advantage.. The details of how it works matter cooperation, from developers matters and independent validation matters. The world needs benchmarks and third-party audits before any attribution system is treated as authoritative.