Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hazard idea : AI sourced Data #209

Open
dsmukilan opened this issue Jul 3, 2024 · 2 comments
Open

Hazard idea : AI sourced Data #209

dsmukilan opened this issue Jul 3, 2024 · 2 comments
Assignees

Comments

@dsmukilan
Copy link

I am suggesting a new category of Data Hazard called "AI Sourced Data". Suggested symbol : Ouroboros

These would be cases in which the data is scrapped over the internet or any other sources, which turns out to be AI-generated data. These scrapped data will then be used to train more AI models, thereby creating a negative feedback loop making worse and worse trained models.

This can be intentional in some aspects - for example : "Nightshade" - AI Poisoning for protecting Copyrights. But in many cases ,this can be oversight on training or direct malicious intent of sabotaging.

Also such models trained with 'AI sourced data', can further reinforce other data hazards such as existing bias, privacy issues, and more.
Serpiente_alquimica

@dsmukilan dsmukilan changed the title Hazard Idea : AI sourced Data Hazard idea : AI sourced Data Jul 3, 2024
@NatalieZelenka NatalieZelenka self-assigned this Jul 24, 2024
@NatalieZelenka
Copy link
Contributor

Hi, thanks for the suggestion!

I really like this idea as it's very specific! I think it might fit under "reinforces existing bias". We are thinking about a more specific categorisation of the hazard labels where they would be related to eachother in a knowledge graph.

For example something like AI sourced data ---causes---> Reinforcement of existing biases.

I'm on mat leave at the moment (hence the slow response) but I look forward to thinking about how this could fit in and getting some feedback from others!

@dsmukilan
Copy link
Author

Adding recent reference : https://doi.org/10.1038/s41586-024-07566-y

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants