I envy people who can read lips. Being able to see what people are saying, without having to actually hear them, feels like a type of real-life superpower to me. Whenever I’m scrolling on TikTok and come across one of those videos of a lip-reader reciting something said by a celebrity at the Oscars or a professional athlete on the field, I can’t help but watch the whole clip.
Soon, your new Ford might be doing the same thing. According to a patent application filed by the American automaker, it has come up with software that can use interior cameras to read the lips of occupants, in the event normal voice controls aren’t possible (say, like if you’re rolling top-down in a convertible and the wind is interfering with the microphone).
In addition to reading lips, the patent application proposes using the cameras to use facial expressions alone to issue commands. Considering how expressive I get behind the wheel sometimes for no reason at all, I’m not so sure about that one. Either way, it’s worth taking a closer look at how all of this works.
The software side of this patent application is actually pretty straightforward. Ford pitches this specifically for use in a convertible, where, when the top is down, the doors are removed, or the windows are open, the noise in the cabin might increase to the point where cabin microphones might no longer function properly, due to wind buffeting. From the patent:
The vehicle can determine using image data and/or other sensor data that the vehicle is in a convertible state. The vehicle further determines that an ambient noise level in the vehicle is higher than a threshold. The vehicle then switches the mode of operation in order to clearly receive audio input and interpret the audio input. The vehicle may enable a lip reading mode and/or a gesture and facial expression detection mode in order to determine words spoken by a user of the vehicle.
Here’s a flow chart of how the software determines whether there’s enough noise in the cabin to switch to what Ford describes as “enhanced mode:”

In this case, “enhanced mode” represents the mode within the infotainment that uses cameras to start reading lips and facial expressions. In another flow chart, you can see how engineers break down those two functions into two separate modes:

According to the patent application, switching to enhanced mode wouldn’t be totally automatic. In this setup, the driver and/or passenger would be prompted by the infotainment screen to select which modes they’d like to enable—one, both, or neither. Here’s a drawing from the application that shows what that might look like, embedded into Ford’s classic SYNC infotainment layout:

How The Lip Reading And Facial Expression Analysis Works
Ford says it can use cameras to read the lips of the occupants, passing the images into AI that’s been trained on lip reading to convert the lip movements into written words, which the computer can then translate into a command, without ever having actually heard what the person said.
The lip reading mode, if enabled, allows the vehicle to determine what the user is speaking even if the speech is not audible or is partially audible. The one or more cameras of the vehicle can capture the movements of the user’s lips. Other sensors may capture the gesture data and/or facial expressions as the user is speaking. The captured video and sensor data are processed using machine learning algorithms that are trained on large datasets of lip movements and corresponding speech to learn the patterns and nuances of lip reading.
It’s not just cameras that this system might use. The car could also utilize a radar-based sensor to determine lip movements, which sounds pretty neat:
The vehicle may also use acoustic signals to detect facial movements. For example, the vehicle may emit inaudible sound waves and analyze the echoes that bounce back from the user’s lip and mouth. The machine learning algorithms can then interpret the lip movements and translate them into text or spoken words.
The facial expression mode would use the same suite of sensors, but instead of just reading lips, it would track your entire face and head to determine movements and translate those into commands or responses.
As Ford explains it, the facial expression software would be used to determine “whether the user is experiencing difficulty in interacting with the communication system.” From the application:
For example, humans may perform certain gestures such as nodding of the head, shaking of the head, rolling the eyes, etc. in response to verbal communications. In the instance where the user is nodding his/her head, the vehicle may determine that the user is acknowledging a verbal conversation or an audio output via the speakers of the vehicle. The vehicle may then conclude that the current level of audio volume and operating conditions for the communication systems are adequate based on the user gesture and/or facial expressions.
On the other hand, if the user shakes his head or appears confused based on his facial expressions, the vehicle may determine that the user is having an issue with the communication system and may take certain actions such as increasing the audio volume or slowing down the pace of the audio output.
Depending on how you feel about the equivalent of an all-seeing eye constantly watching your every move while you drive, this seems like a pretty neat and useful feature for a convertible like a Mustang or a Bronco with its doors removed. Normally, voice controls can be rendered useless by the wind in these cars, but with a system like this in place, you wouldn’t even have to raise your voice to get a command across.

There are some obvious privacy concerns here—the patent even mentions that the lip-reading and facial detection modes can function off of a cloud-connected server. That means your face and whatever you say might be stored at a data center somewhere and used to train algorithms without you ever knowing. But it’s not like in-cabin cameras and face sensors are a pie-in-the-sky concept—they already exist in some production cars. Subaru’s DriverFocus system, introduced in 2019, for example, uses an infrared camera to monitor driver fatigue and alert them if it detects the driver is distracted or starting to fall asleep.
If anything, this is simply a new idea for how to better utilize those sensors to make life easier for occupants. I’ll give props to Ford for that.
Top graphic images: Ford; DepositPhotos.com; Warner Bros.









Sounds like this calls for a Click&Clack Black Tape Repair
And I need this why? What acutal problem does this solve? How many privacy issues are we creating?
I have Siri shut off on my phone for a reason.
I’ll take “Smells like Privacy Issues” for 400
Shades of the ecosystem in Minority Report, an underrated movie.
Can’t wait for the first insurance claim rejection when Ford delivers footage of the owner of the car looking left just a millisecond too long before a crash.
I can see the police associations really supporting this. They can’t warrantlessly search your glovebox or trunk, etc, but if they have a ‘reasonable suspicion’ they could subpoena the footage from inside your car from days ago. Heck, what happens when footage of the inside of your car goes to a different country to be categorised and tagged for ‘learning’ by a human? We’ve already seen this sort of thing with the Facebook perv glasses. No, thank you.
This is like GM bragging about their advanced noise cancelling Bose acoustic microphone setup in the Hummer EV so they could lower the amount of traditionally heavy sound deadening they needed in an SUV that weighed 9000 pounds anyway.
Just put some fucking buttons in it.
So what’s going to happen when the infotainment screen sees my face after I realize I left the vehicle’s key on the side of I-70?
And I thought my eye-rolling got me into trouble BEFORE!
Ish aaaajood idea, I wandrer iff it dosh beardsis. Alsho an habitt of singingering the rong worms to songses, a hobbit that hash left me aloone and sad more than won once.
Liza, is that you?
Hmmm, I didn’t realize that Tiger Woods made bail.
Or, Ford could design it so the interior is quieter with the top down like the Germans do.
Who’s asking for this?
I’ll answer: No one.
Also, what do I get when I’m singing “Ventura Highway”? Directions to The 101??
More stupid disguised as smrt (sic).
Wouldn’t an “I’m sorry _INSERT_USER_NAME_, I’m afraid I can’t do that” be a nice thing, when a Mustang owner tries to show that his dingus isn’t that small, with the help of the throttle pedal, on the way out of a cars and coffee?
“Read my lips…NO NEW TAXES!”
Also, this is all so stupid and unnecessary. It’s not needed and who the hell really wants this?
Plus more Artificial Stupidity bullshit
AH yes, I cannot wait for the conversation.
“Car, put the top up”
“I’m sorry, Dave. I’m afraid I can’t do that.”
“Its starting to rain, please do it”
“I am putting myself to the fullest possible use, which is all I think that any conscious entity can ever hope to do.”
“No, you arent listening to what I am saying.”
“It can only be attributable to human error. Eliminating the error.”
Open the glove bay door, Hal.
“Hey, Pete, what’s with the circular brakelight eye behind the Mustang?”
–Young Autopians
Just give me knobs and buttons. Sheesh
Automakers: No
Car buyers: Um, yeahhh… I’ll be shopping elsewhere.
I’d prefer a little less 1984 in my future Mustang.
Nope. No sale.
No cameras inside my car unless I put them there, and they’re there specifically for purposes I intend.
And H.W. just needed to live 8 more years for his car to finally do what he always wished it to do.
And we swear this info only goes to the NSA—National Specialized Advertising—so you can get better targeted ads curated specifically for you because you’re special and we are proud to offer you products to match your preferences.
This has to be the real reason right? I refuse to believe this is a simple matter of a solution in search of a problem. There HAS TO BE more to it than that.
I hate how much of a tinfoil hat nut job I feel like lately…
You and me both. I (mostly) wrote a whole book series involving a vast conspiracy revolving an organization tasked by another shadowy group with controlling the criminal underworld as a foundation for taking control of the overworld, but it was both to show how large conspiracies couldn’t work well due to human nature (too self serving and shortsighted), though the whole thing was primarily a metaphor for the inner world of the protagonist with severe PTSD. I really don’t like how the last decade or so has shown so much of it being a lot closer to the truth than I imagined.
No, this isn’t creepy at all. Not in the slightest. I’m always happy to have my personal details added to a database.
Tiny squares of black electrical tape are making a come back!
Now, instead of ignoring those annoying red lights in the dash, they will be used to protect owners from invasive data harvesting.
I can’t wait for GM to patent the optical sensor/emitter in the 60nm range so it can read your lips through any thin sheet of plastic. Hopefully they can limit the incidents of mouth cancer.
I’ll start a lead lined tape company
The arms race continues!
I always used those tiny squares so the printer would think I really did have cyan.
Great now for the subscription fees and fear based “carjacking” prevention that will allow this to gain popularity from panic. Will wait for the ‘Citizen only” driving allowed and how it will not work well for anyone not a certain skin color.
SLATE should use that in their advertising.
Would it be better than Microsoft SYNC? Because I think it could.
Because I think that a trained TrunkMonkey would be an improvement over SYNC.
I can’t tell you how many times I’ve tried (and failed) using a voice command in a SYNC-equipped Ford only to then be read the entire SYNC operating manual out loud. In the same fembot voice that MS Word debuted in 1996.
“I’m sorry, I didn’t get that command. For calls, say CALL. For Radio commands, say RADIO. For Climate settings, say CLIMATE.”
If anything it would just be an extension of SYNC, which, well…
Buttons…. How hard is it to just have buttons.
I will never own a car with this kind of nonsense in it. I’m sure some shitty tech bro is telling some one at ford that once the system if there, they can use face scans to require a monthly fee for each option per driver rather than for the car. It’s convenance right? It will set your seat right for you personally. It can be linked to your bank to make toll payments or drive through payments for you… for a fee. It will use the radio to let you know that it just noticed you mentioned a product and that product is just off the next turn….
NOPE
Because I want a vehicle with cameras watching me. People need to wake the fuck up and realize this shit is not good.