Where did you think the training data was coming from?

2026-03-1113:335014idiallo.com

When the news broke that Meta's smart glasses were feeding data directly into their Facebook servers, I wondered what all the fuss was about. Who thought AI glasses used to secretly record people woul

Show article

The camera on your laptop is pointed at you right now. When activated, it can record everything you do. When Zuckerberg posted a selfie with his laptop visible in the background, people were quick to notice that both the webcam and the microphone had black tape over them. If the CEO of one of the largest tech companies in the world doesn't trust his own device, what are the rest of us supposed to do?

On my Windows 7 machine, I could at least assume the default behavior wasn't to secretly spy on me. With good security hygiene, my computer would stay safe. For Windows 10 and beyond, that assumption may no longer hold. Microsoft's incentives have shifted. They now require users to create an online account, which comes with pages of terms to agree to, and they are in the business of collecting data.

As part of our efforts to improve and develop our products, we may use your data to develop and train our AI models.

That's your local data being uploaded to their servers for their benefit. Under their licensing agreement (because you don't buy Windows, you only license it) you are contractually required to allow certain information to be sent back to Microsoft:

By accepting this agreement or using the software, you agree to all of these terms, and consent to the transmission of certain information during activation and during your use of the software as per the privacy statement described in Section 3. If you do not accept and comply with these terms, you may not use the software or its features.

The data transmitted includes telemetry, personalization, AI improvement, and advertising features.

On a Chromebook, there was never an option to use the device without a Google account. Google is in the advertising business, and reading their terms of service, even partially, it all revolves around data collection. Your data is used to build a profile both for advertising and AI training.

None of this is a secret. It's public information, buried in those terms of service agreements we blindly click through. Even Apple, which touts itself as privacy-first in every ad, was caught using user data without consent. Tesla employees were found sharing videos recorded inside customers' private homes.

While some treat the Ray-Ban glasses story as an isolated incident, here is Yann LeCun, Meta's former chief AI scientist, describing transfer learning using billions of user images:

We do this at Facebook in production, right? We train large convolutional nets to predict hashtags that people type on Instagram, and we train on literally billions of images. Then we chop off the last layer and fine-tune on whatever task we want. That works really well.

That was seven years ago, and he was talking about pictures and videos people upload to Instagram. When you put your data on someone else's server, all you can do is trust that they use it as intended. Privacy policies are kept deliberately vague for exactly this reason. Today, Meta calls itself AI-first, meaning it's collecting even more to train its models.

Meta's incentive to collect data exceeds even that of Google or Microsoft. Advertising is their primary revenue source. Last year, it accounted for 98% of their forecasted $189 billion in revenue.

Yes, Meta glasses record you in moments you expect to be private, and their workers process those videos at their discretion. We shouldn't expect privacy from a camera or a microphone, or any internet-connected device, that we don't control. That's the reality we have to accept.

AI is not a magical technology that simply happens to know a great deal about us. It is trained on a pipeline of people's information: video, audio, text. That's how it works. If you buy the device, it will monitor you.

Read the original article

speckx

Karma: 29759

@Hacker__News
@hacker._news

Comments

By goodmythical 2026-03-1116:16

I thought it was common knowledge that we've been providing free training for machnine learning since the around the first introduction of captcha.

Not sure if the absolute first human verification systems were machine training, but it definitely became that quite quickly.

Like, did everyone just forget about the "provide feedback" button on your search pages? Or when google maps used to ask if its information was accurate?

And the fact that google/youtube/facebook/etc have almost always used your interactions to train algorithms to tune the machine learning not just for you but for everyone?

Why should it be any surprise that every new offering from these companies tracks user data in order to improve the economic efficiency of their models just as every single prior offering from these companies has always done?

By tracker1 2026-03-1117:04

This is why I like the kill switch that Framework laptops have to disable the webcam and mic physically. I wish it was standard... especially with some of the more creepy articles of school issued laptop admins being able to "observe" remotely.

By youknownothing 2026-03-1115:301 reply

I've said it for a long time: the main reason Meta wanted to created glasses or VR/AR is to circumvent the privacy rules that Google and Apple slowly introduced in their App Stores.

By 10000truths 2026-03-1115:431 reply

More specifically, it's because Meta views the lack of ownership of their own hardware platform as an existential threat. They see AR/VR as the next revolutionary platform, so they're betting the farm on being first movers in a mass-market AR/VR space that they anticipate to exist in the future.

Hacker News