Twitter Moderators Turn to Automation Amid a Reported Surge in Hate Speech – Amidst reports of an increase in hate speech on the social media network, Elon Musk’s Twitter relies heavily on automation to censor content, according to the company’s new head of trust and safety. Ella Irwin told Reuters that Musk, who acquired the firm in October, was keen on increasing the use of automation, noting that Twitter had previously erred by relying on labor-intensive and time-consuming human assessments of harmful content.
“He’s encouraged the team to take more risks, move fast, get the platform safe,” she said. On child safety Irwin said Twitter had shifted toward automatically taking down tweets reported by trusted figures with a track record of accurately flagging harmful posts. Twitter is also more aggressively restricting abuse-prone hashtags and search results in areas including child exploitation, regardless of potential impacts on “benign uses” of those terms, she said.
People Also Read: Meta Fined €265m Over Data Protection Breach That Hit More Than 500m Users
“The biggest thing that’s changed is the team is fully empowered to move fast and be as aggressive as possible,” Irwin said. Her comments come as researchers reported a surge in hate speech on the social media service, after Musk announced an amnesty for accounts suspended under the company’s previous leadership that had not broken the law or engaged in “egregious spam.”
Since Musk slashed half of Twitter’s personnel and given an ultimatum to work long hours, resulting in the loss of hundreds more employees, the firm has faced critical questions over its ability and desire to filter damaging and unlawful content. In a meeting with the French president, Emmanuel Macron, on Friday, Musk committed to “significantly strengthen content control and the protection of free speech.”
Irwin said Musk encouraged the team to worry less about how their actions would affect user growth or revenue, saying safety was the company’s top priority. “He emphasises that every single day, multiple times a day,” she said. Researchers say the number of tweets containing hateful content on Twitter rose sharply in the week before Musk tweeted on 23 November that impressions, or views, of hateful speech were declining.
Tweets containing words that were anti-Black that week were triple the number seen in the month before Musk took over, while tweets containing a gay slur were up 31%, a study from the Center for Countering Digital Hate showed. Irwin said layoffs did not significantly affect full-time employees or contractors working on what the company referred to as its “health” divisions, including in “critical areas” like child safety and content moderation.
People Also Read: Twitter Reportedly Disbands Brussels Office, Leading to Compliance Concern
She said Twitter took down about 44,000 accounts involved in child safety violations, in collaboration with cybersecurity group Ghost Data. Twitter is also restricting hashtags and search results frequently associated with abuse, like those aimed at looking up “teen” pornography. Past concerns about the impact of such restrictions on permitted uses of the terms were gone, she said. The use of “trusted reporters” was “something we’ve discussed in the past at Twitter, but there was some hesitancy and frankly just some delay,” said Irwin.