Saturday, January 11, 2025

Ethical Ai - call to create synthetic datasets

Ethical Ai - a call to create original "synthetic" datasets.

With regards to music prediction, what is ethical Ai?

The issue lies in the datasets utilized. The ethical considerations surrounding the technology underpinning large language models (LLMs) are often overstated. LLMs derive their predictive capabilities related to syntax, semantics, and ontologies embedded within human-generated corpora; however, inheriting inaccuracies and biases present in the training data. The nature of the learning data within a given dataset ultimately determines whether the outputs produced by the LLM are deemed ethical. If a organization develops an AI system and generates modified versions of existing musical works for personal enjoyment, this practice is generally regarded as ethical. Conversely, if the objective is to distribute or profit from these alternative renditions of established music, such actions would be considered unethical.

A studio might develop two moral AI systems. First, a dataset of all the music the artist has produced over the years. Based on earlier compositions, this technique could be used to forecast new music ideas or compositions. An LLM system that uses a dataset made out of created input data is the endeavor. I'm going to demonstrate a system that can be applied to the quest for new music. As AI prediction advances and the original dataset and predictions are re-incorporated into the dataset, the second idea ultimately becomes the first.

There are many ways to generate MIDI input data for AI datasets, including sequencers, drum machines, MAX, Pure Data, Audiomulch, noise, voltages, and scientific data. The potential for discovering new music genres is limitless when exploring the predicted MIDI outputs derived from these original sources. It is clear that large language models (LLMs) can play a significant role in music exploration and can facilitate the creation of new compositions from original datasets. The music industry would benefit from more artists developing their own AI models and utilizing unique datasets to advance this emerging genre. I employ Pure Data to generate my MIDI input, focusing on structuring algorithms that produce MIDI data sculpted to a genre type and in a given musical key(A#b). I advocate the importance of artists building their own datasets and moving away from a reliance on historical and commercial data to produce music.

Technologies establish cyclical procedures for our adherence. Engaging in the iterative refinement of a process is essential to achieving the desired outcome. Is there work to be accomplished? Indeed, determining the specific type of input data required for your dataset and devising methods for generating that data is a time-intensive endeavor. If the objective is to create something original, like band practice, it becomes a labor of love.

dataset procedures that are suggested online

No comments:

Post a Comment