How to Make Language Technologies More Inclusive
NEWS |

How to Make Language Technologies More Inclusive

DIRK HOVY SUGGESTS A FAIRER WAY FOR AUTOMATIC TRANSLATION SYSTEMS TO DEAL WITH MODERN PRONOUNS AND ADVOCATES TECHNOLOGIES THAT ADAPT TO THE USERS, INSTEAD OF THE OPPOSITE

Language technologies like automatic translation or voice-activated commands have made giant leaps in the last ten years, from laughingstock for kids to working tools for everyone. “Now, they should take some additional steps towards inclusivity,” said Dirk Hovy, Associate Professor at Bocconi Department of Computing Sciences, anticipating his speech at the 27 June workshop Fairness in AI.
 
One area where automatic translation struggles is the use of modern pronouns. People with a female name and a male identity, for example, might choose to use feminine pronouns for themselves; non-binary people are frequently featured as they/them in English; the mid central vowel schwa (ə) is sometimes used in place of the gendered suffixes -o/i and -a/e in Italian. When translating into other languages, issues of intelligibility, grammatical correctness, and true representation of individuals arise.
 
Professor Hovy is studying the problem as a part of his ERC-funded INTEGRATOR project with Research Fellows Anne Lauscher and Debora Nozza. “Currently, automatic translation systems choose what they think to be the best translation, a sort of default choice, regardless of intelligibility, grammatical correctness, or representativeness” he says. “In theory, other choices could be possible: not translating – even though that sounds extreme – or, better, letting the user choose. What we say is that an automatic translation system could highlight the positions where pronouns do not fit its training and experience, and ask readers to make their own choice.”
 
Link to related stories. Image: rainbow colors. Story headline: Pride: STEM Disciplines Fight Algorithmic Bias Link to related stories. Image: CPU processor. Story headline: When Machines Learn Prejudices Link to related stories. Image: a hooded person and symbols recalling cyber bullying. Story headline: Machines Get It Wrong: How to Avoid that Woman and Gay Are Mistaken as Bad Words Link to related stories. Image: a gavel on a computer. Story headline: How to Protect User Rights Against an Algorithm

To train a machine translation system, you feed it with large amounts of texts and let it learn by experience. If you feed it documents mostly written by middle-aged males, the translations will also sound as if men in that age bracket wrote them, as Hovy and co-authors showed in a paper. “These systems are now commonly used to translate a wide range of documents,” said Hovy, “thus normalizing and reinforcing stereotypes. The risk is that, in the long run, only a middle-aged, straight male’s way of speaking would seem legitimate.”
 
The same is true for voice-activated commands that tend to work well for adult males, but not for women, kids, or people with strong local accents. “They are often trained with low frequency voices and struggle to adapt to higher pitches,” Professor Hovy says. “Kids and women must lower their tone if they want to be understood. It’s a clear example of a technology that requires users to adapt to the system, instead of the system adapting to the user. But this setup is preventing users from expressing themselves through their way of speaking.”

by Fabio Todesco
Bocconi Knowledge newsletter

People

  • Kapacinskaite Nominated Among Top 5 for Two Dissertation Awards at AOM

    The Academy of Management leads the discussion on the world's most prominent organizational and management issues  

  • Catherine De Vries in the 50 Influential Researchers List by Apolitical Foundation

    A list of scholars from around the world whose research could help cultivate reflective, representative, and informed politicians  

Seminars

  August 2022  
Mon Tue Wed Thu Fri Sat Sun
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31        

Seminars

  • ELLIS@Milan Artificial Intelligence workshop

    GABOR LUGOSI - Department of Economics, Pompeu Fabra University
    RICARDO BAEZA-YATES - Khoury College of Computer Sciences Northeastern University
    NOAM NISAN - School of Computer Science and Engineering, Hebrew University of Jerusalem
    MICHAL VALKO - Institut national de recherche en sciences et technologies du numérique

    AS02 DEUTSCHE BANK - Roentgen building

  • tbd

    ANDREW KING - Questrom School of Business

    Meeting room 4E4SR03 (Roentgen) 4