Facebook’s Working On A Tool To Help The Blind “See” Images

Comment

Facebook connects 1.5 billion people all over the world, but for the blind and visually impaired, it can be difficult to gain access to what has become a vast platform for connectivity. That’s where Facebook’s accessibility team, led by Jeff Wieland, comes in. The sole purpose of the accessibility team is to help people with disabilities have a seamless experience on Facebook, and ultimately help the social network achieve its mission of connecting the world.

Right now, blind and visually impaired people who have access to screen readers — tools used to identify what’s displayed on a screen — can listen to what people are writing on Facebook, but there’s currently no way to figure out what’s going on in the millions of photos shared on Facebook every day.

“You just think about how much of your news feed is visual — and is probably most of it — and so often people will make a comment about a photo or they’ll say something about it when they post it, but they won’t really tell you what is in the photo,” Matt King, Facebook’s first blind engineer, told TechCrunch. “So for somebody like myself, it can be really like, ‘Ok, what’s going on here? What’s the discussion all about?’”

That’s why Facebook is currently working on an artificial intelligence-based object recognition tool to help blind users get an idea of what’s in all of the photos people share on Facebook. King, who started at the company just three months ago, recently showed me how he uses a screen reader to navigate Facebook.

“My view of the page is totally sequential,” King explained to me. “I can’t see the whole thing at one time. I see a little piece.”

As he scrolled down the page, the screen reader would tell King that he’s at a list of six items, which referred to the number of notifications he had at the time. It also told him when he reached a “convo box,” which signaled to him that he could interact with that element and leave a comment.

King eventually scrolled to a friend’s post that featured text and a photo. His friend, Anne, wrote, “Ready for picture day of first grade” accompanied with a photo. Thanks to the object recognition technology Facebook is prototyping, King heard: “This image may contain, colon, one or more people. Child.” Without it, all King would’ve known was that Anne wrote, “Ready for picture day of first grade,” and that she posted a photo — but nothing about what was in the photo. For another photo, the tool told him: “This image may contain colon nature, outdoor, cloud, foliage, grass, tree.” 

In the photo gallery below, you’ll see what a blind person hears read aloud when they’re using a screen reader to browse photos on Facebook.

“This might not be 100 percent yet, but even if it’s just halfway there, the level of engagement that’s possible, the amount of enjoyment I can get — that’s like going from zero percent to at least 50 percent of what you might get,” King said. “That’s a huge jump, and it’s only going to get better from here. I personally find Facebook’s willingness to invest in ways like that just really powerful and exciting, and just one more way to make connecting people with disabilities a great experience.”

Unfortunately, there’s a bit of a learning curve with screen readers. That’s why one of King’s goals is to make that on-boarding process a lot simpler for the blind and visually impaired people. He wants to make it as easy for people with disabilities to access web connectivity as it is for people who don’t have disabilities, regardless of where they are in the world.

“We start thinking of access to information and information technology almost like a human right,” King said. “I mean, it’s the gateway to employment, it’s the gateway to opportunity of all different kinds — participating in your government and everything. So when we can flatten that on-ramp, I see that as the ultimate goal in accessibility and I think that Facebook is really uniquely positioned to help do that. So that gets me, just really excited. It’s a way of giving dignity to every person with a disability in the world by helping them get connected to everybody else.”

Ideally, the team hopes to release this product by end of the year to one platform — either web or iOS — and allow people to opt in to experience it.

“We want to make sure that the concepts we deliver, we feel strongly that they are in the photo,” Wieland said. “We don’t want to get that wrong. So we definitely need to continue investing in AI to make this great. We’re optimistic we can ship this in the relatively short term.”

More TechCrunch

Arati Prabhakar, profiled as part of TechCrunch’s Women in AI series, is director of the White House Office of Science and Technology Policy.

Women in AI: Arati Prabhakar thinks it’s crucial to get AI ‘right’

AniML, the French startup behind a new 3D capture app called Doly, wants to create the PhotoRoom of product videos, sort of. If you’re selling sneakers on an online marketplace…

Doly lets you generate 3D product videos from your iPhone

Elon Musk’s AI startup, xAI, has raised $6 billion in a new funding round, it said today, in one of the largest deals in the red-hot nascent space, as he…

Elon Musk’s xAI raises $6B from Valor, a16z, and Sequoia

Indian startup Zypp Electric plans to use fresh investment from Japanese oil and energy conglomerate ENEOS to take its EV rental service into Southeast Asia early next year, TechCrunch has…

Indian EV startup Zypp Electric secures backing to fund expansion to Southeast Asia

Last month, one of the Bay Area’s better-known early-stage venture capital firms, Uncork Capital, marked its 20th anniversary with a party in a renovated church in San Francisco’s SoMa neighborhood,…

A venture capital firm looks back on changing norms, from board seats to backing rival startups

The families of victims of the shooting at Robb Elementary School in Uvalde, Texas are suing Activision and Meta, as well as gun manufacturer Daniel Defense. The families bringing the…

Families of Uvalde shooting victims sue Activision and Meta

Like most Silicon Valley VCs, what Garry Tan sees is opportunities for new, huge, lucrative businesses.

Y Combinator’s Garry Tan supports some AI regulation but warns against AI monopolies

Everything in society can feel geared toward optimization – whether that’s standardized testing or artificial intelligence algorithms. We’re taught to know what outcome you want to achieve, and find the…

How Maven’s AI-run ‘serendipity network’ can make social media interesting again

Miriam Vogel, profiled as part of TechCrunch’s Women in AI series, is the CEO of the nonprofit responsible AI advocacy organization EqualAI.

Women in AI: Miriam Vogel stresses the need for responsible AI

Google has been taking heat for some of the inaccurate, funny, and downright weird answers that it’s been providing via AI Overviews in search. AI Overviews are the AI-generated search…

What are Google’s AI Overviews good for?

When it comes to the world of venture-backed startups, some issues are universal, and some are very dependent on where the startups and its backers are located. It’s something we…

The ups and downs of investing in Europe, with VCs Saul Klein and Raluca Ragab

Welcome back to TechCrunch’s Week in Review — TechCrunch’s newsletter recapping the week’s biggest news. Want it in your inbox every Saturday? Sign up here. OpenAI announced this week that…

Scarlett Johansson brought receipts to the OpenAI controversy

Accurate weather forecasts are critical to industries like agriculture, and they’re also important to help prevent and mitigate harm from inclement weather events or natural disasters. But getting forecasts right…

Deal Dive: Can blockchain make weather forecasts better? WeatherXM thinks so

pcTattletale’s website was briefly defaced and contained links containing files from the spyware maker’s servers, before going offline.

Spyware app pcTattletale was hacked and its website defaced

Featured Article

Synapse, backed by a16z, has collapsed, and 10 million consumers could be hurt

Synapse’s bankruptcy shows just how treacherous things are for the often-interdependent fintech world when one key player hits trouble. 

2 days ago
Synapse, backed by a16z, has collapsed, and 10 million consumers could be hurt

Sarah Myers West, profiled as part of TechCrunch’s Women in AI series, is managing director at the AI Now institute.

Women in AI: Sarah Myers West says we should ask, ‘Why build AI at all?’

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI and publishers are partners of convenience

Evan, a high school sophomore from Houston, was stuck on a calculus problem. He pulled up Answer AI on his iPhone, snapped a photo of the problem from his Advanced…

AI tutors are quietly changing how kids in the US study, and the leading apps are from China

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Well,…

Startups Weekly: Drama at Techstars. Drama in AI. Drama everywhere.

Last year’s investor dreams of a strong 2024 IPO pipeline have faded, if not fully disappeared, as we approach the halfway point of the year. 2024 delivered four venture-backed tech…

From Plaid to Figma, here are the startups that are likely — or definitely — not having IPOs this year

Federal safety regulators have discovered nine more incidents that raise questions about the safety of Waymo’s self-driving vehicles operating in Phoenix and San Francisco.  The National Highway Traffic Safety Administration…

Feds add nine more incidents to Waymo robotaxi investigation

Terra One’s pitch deck has a few wins, but also a few misses. Here’s how to fix that.

Pitch Deck Teardown: Terra One’s $7.5M Seed deck

Chinasa T. Okolo researches AI policy and governance in the Global South.

Women in AI: Chinasa T. Okolo researches AI’s impact on the Global South

TechCrunch Disrupt takes place on October 28–30 in San Francisco. While the event is a few months away, the deadline to secure your early-bird tickets and save up to $800…

Disrupt 2024 early-bird tickets fly away next Friday

Another week, and another round of crazy cash injections and valuations emerged from the AI realm. DeepL, an AI language translation startup, raised $300 million on a $2 billion valuation;…

Big tech companies are plowing money into AI startups, which could help them dodge antitrust concerns

If raised, this new fund, the firm’s third, would be its largest to date.

Harlem Capital is raising a $150 million fund

About half a million patients have been notified so far, but the number of affected individuals is likely far higher.

US pharma giant Cencora says Americans’ health information stolen in data breach

Attention, tech enthusiasts and startup supporters! The final countdown is here: Today is the last day to cast your vote for the TechCrunch Disrupt 2024 Audience Choice program. Voting closes…

Last day to vote for TC Disrupt 2024 Audience Choice program

Featured Article

Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Among other things, Whittaker is concerned about the concentration of power in the five main social media platforms.

3 days ago
Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Lucid Motors is laying off about 400 employees, or roughly 6% of its workforce, as part of a restructuring ahead of the launch of its first electric SUV later this…

Lucid Motors slashes 400 jobs ahead of crucial SUV launch