Knowledge

Data labeling: a call for ethical regulation of AI in Europe

Written by

Aïcha

Published on

2023-06-23

Reading time

This is some text inside of a div block.

min

📘 CONTENTS

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

💡 Artificial intelligence (AI) is revolutionizing our world, but to ensure its ethical and responsible use, proper regulation is essential.

‍

In this article, we wanted to address the importance of the data labeling process in the construction of AI products, data annotation, crowdsourcing and ethical labeling. We call on the European Union (EU) to adopt theEU AI Act while highlighting the shortcomings of the current text with regard to the AI Supply Chain and data management.

‍

Data labeling for AI

‍

Data labeling is a key step in the development of AI. It involves assigning tags or labels todatasets, enabling machine learning algorithms to understand and interpret the information. However, it is imperative to ensure that this labeling is carried out with care, precision and under ethical conditions, in order to avoid bias and prejudice.

‍

Ethical data annotation

‍

Data annotation requires human expertise. It involves adding additional information to the data, i.e. a semantic layer associated with images, videos, etc. videos or text, such as metadata or detailed descriptions. In the context of AI, it is essential that data annotation is carried out ethically. This means that annotators (or Data Labelers) must follow strict guidelines to ensure the integrity and objectivity of annotated data, avoiding stereotyping, discrimination and value judgments. They must also work in good conditions (decent working hours, stability, career prospects) and be accompanied (training and support) to produce quality data.

‍

The importance of crowdsourcing in labeling processes

‍

Crowdsourcing is an effective method for labeling and annotating large quantities of data. By calling on a community of contributors, it is possible to obtain fast, accurate results. However, it is crucial to put in place rigorous quality control mechanisms to guarantee the reliability of the data produced by crowdsourcing . It's also important to remember that this is not the only way to label large quantities of data: it's often more effective to call on a panel of functional specialists to annotate data, and to accept their gradual increase in competence, rather than demanding an immediate maximum level of quality (which is often the case in crowdsourcing-based labeling processes). Data Labeling is an important job, and the people willing to invest themselves in it, the Data Labelers, must be treated with dignity and regarded as AI specialists in the same way as a Data Scientist.

‍

Ethical labelling: a fundamental requirement

‍

Ethical labeling is a fundamental aspect of responsible AI. It aims to ensure that the data used to train AI models is collected, labeled and annotated in an ethical and human-friendly way. Transparency and fairness are key principles of ethical labeling, helping to avoid bias and discrimination in automated decision-making.

‍

Weaknesses of the EU AI Act: Supply Chain AI and data management

‍

Despite the progress made by the EU AI Act project, it still has certain weaknesses with regard to the AI Supply Chain and data management. It is essential to put in place clear measures to ensure transparency and ethics throughout the lifecycle of AI systems, from data collection to use. Accountability and control mechanisms must be put in place to ensure proper data management and avoid abuse.

‍

Data Labeling for ethical AI: conclusion

‍

It is imperative that the European Union adopts sound ethical regulations to frame the development and use of AI. Regulation is necessary and must not hold back innovation. Data labeling and ethical sourcing are key to ensuring responsible AI and an AI data supply chain that respects human life and fundamental rights. However, it is also important to take into account the current weaknesses of the EU AI Act in terms of AI Supply Chain and data management, in order to strengthen the protection of these rights.

You might like:

Top 10 image annotation platforms for AI / Computer Vision projects [2025]

Data annotation for Machine Learning, our complete guide

Data or entity annotation is essential for transforming raw information into resources that can be exploited by ML models.

Data Labeling is a profession, not a menial job

The end of crowdsourcing: data annotation is a profession in its own right, not a casual job!