Transparency for whom? Designing data documentation with data workers — digitaljustice.dgtf

DGTF Conference

20th Anniversary

12:00

Registration

13:00

Welcome

13:10–19:00

Open Space

13:10

Transparency for whom? Designing data documentation with data workers

Milagros Miceli

Talk and Diskussion

13:40

Social movements-led tech and policy design

Alê Costa Barbosa

Talk and Diskussion

14:10

Dreaming beyond AI

Raziye Buse Çetin and Iyo Bisseck

Talk and Diskussion

14:40

Design and Democratizing Skills: the Centre for Tactile Internet

Jens Krzywinski

Talk and Diskussion

15:10

Break

15:30

At an inflection point – generative AI and its planetary consequences

Open Conversation

16:30

Break

17:00

Let’s make the public administration more inclusive!

Tobias Witt, CityLAB Berlin

Workshop

17:00

Flushed away: A Workshop on Disgust, Gender, and the Technical Object

Corinna Canali and Ines Weigand, DRLab/Weizenbaum

Workshop

17:00

Urban Citizenship and Digital Practices

Henrik Lebuhn and Stefania Animento, HU Berlin

Workshop

17:00

There is no Y in AI

Aeneas Stankowski, DFKI

Workshop

18:30

Conclusion

19:00

20th anniversary

Uta Brandes

Dinner + Talk

3th June 10–18:00: 28th Colloquium Design Promoviert

Conference Committee: Bianca Herlo, Sabine Foraita, Michelle Christensen

Conference Team: Bianca Herlo, Sabine Foraita, Michelle Christensen, Florian Conradi, Leonie Egge, Ben Siegler, Ines Weigand, Athena Grandis, Corinna Canali, Berkay Soykan

Transparency for whom? Designing data documentation with data workers

The lack of transparency in datasets poses a significant challenge to creating inclusive and intelligible machine learning (ML) systems. Various AI ethics initiatives have addressed this issue by proposing standardized dataset documentation frameworks based on the value of transparency. In this talk, I propose a shift of perspective: from documenting for transparency to documenting for reflexivity. Based on a long-term project with outsourced data workers in Argentina, Bulgaria and Syria, I argue for the need of designing documentation starting from the needs and experience of the workers who collect, sort, and label the data that trains ML models. This requires considering the historical inequalities, working conditions, and epistemological standpoints that shape both data work and datasets.