Lazy coders are training artificial intelligences to be sexist

Technology

Not Like Us is Aviva Rutkin鈥檚 monthly column exploring the minds of intelligent machines 鈥� and how we live with them

5 December 2016

A woman flits between three tables of vintage items — Vintage sexism can show up in modern algorithms
Martyn Goddard/REX/Shutterstock

Employers: do the ladies on your payroll have any 鈥渇emale weaknesses鈥� that would make them mentally or physically unfit for the job?

The question comes to you courtesy of the year 1943. It was posed in a , written for the flummoxed male supervisors at Transportation Magazine tasked with integrating a new female workforce during a wartime shortage of manpower.

Back then, you wouldn鈥檛 be surprised to see logical reasoning like 鈥淢en are to programmers as women are to homemakers鈥�. Or 鈥淢en are to surgeons what women are to nurses鈥�. Or 鈥淢en are bosses. Women are receptionists鈥�.

But to have these associations littering software in 2016? That鈥檚 exactly what at Microsoft Research and his colleagues found at the Fairness, Accountability, and Transparency in Machine Learning workshop in New York City. The group let a data mining algorithm loose on Google news articles, where it examined the word associations it found there. When they scoured the associations it had come up with, they discovered a trove of familiar stereotypes coded into occupations, with some weighed heavily as either masculine or feminine. The sexism was straight out of the 1943 playbook, with jobs ranked as male including philosopher, captain, warrior and boss. The top jobs on the 鈥渟he鈥� end of the spectrum? Homemaker, nurse and receptionist.

It鈥檚 tempting to throw your hands up and blame sexism in the tech industry, but the story is more subtle than that. The problem is partly down to the way computers learn our language 鈥� and all the inadvertent sexist, racist and otherwise unsavory predispositions it carries.

Picasso = painter

When we humans hear a word like 鈥渞ose鈥�, it might elicit a rush of related memories and associations: romance, the color red, Shakespeare鈥檚 famous line.

But for a machine, there aren鈥檛 many clues about meaning in the arrangement of a handful of letters. So, to help computers form associations, programmers often turn to a popular technique called 鈥渨ord-embedding鈥�. The computer crunches through a pile of text, mapping words as 鈥渧ectors鈥� that demonstrate their relationships to each other.

Through these maps, machines can learn the subtle linguistic links that come intuitively to humans. For example, a king and a queen pair together 鈥� they鈥檙e both royalty 鈥� but one is male and the other female. It鈥檚 similar for uncle and aunt. Einstein was the scientist and Picasso the painter. Beijing is probably the capital of China, not Germany. Pile all these relationships together, and you鈥檝e got some semblance of meaning.

Inevitably, less agreeable associations are also hiding in those calculations, and these are what Kalai has been hunting. He thinks it鈥檚 valuable to find these flaws, because technology has so much power to amplify our stereotypes. Imagine, for example, that you鈥檙e doing a web search for 鈥淐MU computer science phD student鈥�. The search engine wants to give you the most relevant results, so perhaps it decides to show you links to male students first, sidelining women to the second page. In an unfortunate loop, this also makes women look even less likely to be programmers, reinforcing the bias.

This mechanism can launder a multitude of sins. In of word-embeddings, and his colleagues at Princeton University looked at how closely words were associated with pleasant terms (鈥渓ove鈥�, 鈥減eace鈥�, 鈥渉appy鈥�) and unpleasant ones (鈥渄eath鈥�, 鈥渄isaster鈥�, 鈥渧omit鈥�). Flowers, for example, mapped more closely to pleasant words, while insects related more closely to the unpleasant. Musical instruments ranked as more pleasant than weapons.

Again, worrisome relationships surfaced. Female names were more closely associated with home and the arts, while male names dovetailed with career and mathematics. It wasn鈥檛 just sexism: European-American names (Adam, Stephanie, Greg) ranked as more 鈥減leasant鈥� than African-American names (Darnell, Yolanda). 鈥淲e have found every linguistic bias we looked for,鈥� they write.

Lazy bias

What should we make of findings like these? Well, in short, even the most unbiased algorithm is going to flag up the biases of a slanted culture. If you don鈥檛 take steps to remove it, you should assume prejudice is well-represented in all your software. Worse: if you don鈥檛 get rid of it, the glossy appearance of impartiality conferred by search engines and algorithms could actually amplify our subtle biases, Kalai says.

It鈥檚 a lesson we keep having to re-learn. We learned it when it came out that Google serves higher-paying job ads to men, or that Uber drivers cancel more often on riders of colour, or that artificially intelligent hiring programs or sentencing programs might carry historical baggage.

So how do we get rid of it? It鈥檚 less about stamping out prejudice than about not being lazy, says , a web developer in San Francisco. His imaginary culprits are tech bros 鈥淐had and Brad鈥� 鈥� 鈥渕ental shorthand for developers who are just trying to crush out some code on deadline, and don鈥檛 think about the wider consequences of their actions,鈥� explained Ceg艂owski at a talk this month at the Direction16 conference in Sydney, Australia. They don鈥檛 mean to algorithmically punish you for being female or having an ethnic name or living in a low-income neighborhood. They were just hustling to push a product out. 鈥淭he tech industry slaps this stuff together in the expectation that the social implications will take care of themselves.鈥�

Perhaps we could make algorithms that can strip these mistakes back out of software. Kalai鈥檚 group has come up with tools to tweak the word maps without losing much of the original meaning. For example, certain words could be reset to gender-neutral. Other words, like 鈥済randma鈥� and 鈥済randpa鈥�, could be 鈥渆qualised鈥�, making them more similar in meaning without losing the gender essential to their definition.

Their group is hopeful, and maybe we can be too. After all, we know to laugh ruefully when we see the language of 1943. Maybe we can teach our machines the same trick.

Topics: Artificial intelligence / Language / Software

麻豆传媒

Technology