I extracted the safety filters from Apple Intelligence models

2025-07-0619:50540437github.com

Decrypted Generative Model safety files for Apple Intelligence containing filters - BlueFalconHD/apple_generative_model_safety_decrypted

Show article

Decrypted Generative Model safety files for Apple Intelligence containing filters

decrypted_overrides/: Contains decrypted overrides for various models.
- com.apple.*/: Directory named using the Asset Specifier assosciated with the safety info
  - Info.plist: Contains metadata for the override
  - AssetData/: Contains the decrypted JSON files
get_key_lldb.py: Script to get the encryption key (see usage info below)
decrypt_overrides.py: Script to decrypt the overrides (see usage info below)

cryptography is the only dependency required to run the decryption script. You can install it using pip:

To retrieve the encryption key (generated by ModelCatalog.Obfuscation.readObfuscatedContents) for the overrides, you must attach LLDB to GenerativeExperiencesSafetyInferenceProvider ( /System/Library/ExtensionKit/Extensions/GenerativeExperiencesSafetyInferenceProvider.appex/Contents/MacOS/GenerativeExperiencesSafetyInferenceProvider). Also it is important that this is Xcode's LLDB, not the default macOS one or LLVM's lldb. The method I recommend to get LLDB to attach:

Run sudo killall GenerativeExperiencesSafetyInferenceProvider; sudo xcrun lldb -w -n GenerativeExperiencesSafetyInferenceProvider /System/Library/ExtensionKit/Extensions/GenerativeExperiencesSafetyInferenceProvider.appex/Contents/MacOS/GenerativeExperiencesSafetyInferenceProvider
In the Shortcuts app, create a dummy shortcut that uses the Generative Model action ("Use Model") and select the On-Device option. Type whatever you want into the text field, it doesn't matter. Then run the shortcut.
You should see LLDB attach to (the newly started instance of) GenerativeExperiencesSafetyInferenceProvider with a message like this:

(lldb) process attach --name "GenerativeExperiencesSafetyInferenceProvider" --waitfor
Process 53629 stopped
* thread #1, stop reason = signal SIGSTOP
    frame #0: 0x00000001839f41f8 dyld`dyld4::PrebuiltLoader::dependent(dyld4::RuntimeState const&, unsigned int, mach_o::LinkedDylibAttributes*) const + 116
dyld`dyld4::PrebuiltLoader::dependent:
->  0x1839f41f8 <+116>: add    x0, sp, #0xe
    0x1839f41fc <+120>: mov    x1, x19
    0x1839f4200 <+124>: bl     0x1839e50dc    ; dyld4::Loader::LoaderRef::loader(dyld4::RuntimeState const&) const
    0x1839f4204 <+128>: ldrh   w8, [x20, #0x4]
Target 0: (GenerativeExperiencesSafetyInferenceProvider) stopped.
Executable binary set to "/System/Library/ExtensionKit/Extensions/GenerativeExperiencesSafetyInferenceProvider.appex/Contents/MacOS/GenerativeExperiencesSafetyInferenceProvider".
Architecture set to: arm64e-apple-macosx-.

In this repository's root, run the command in LLDB: command script import get_key_lldb.py
Then run c to continue the process. LLDB will print the encryption key to the console and save it to ./key.bin.

To decrypt the overrides, run the following command in the root of this repository:

python decrypt_overrides.py /System/Library/AssetsV2/com_apple_MobileAsset_UAF_FM_Overrides/purpose_auto \
  -k key.bin \
  -o decrypted_overrides

The decrypted_overrides directory will be created if it does not exist, and the decrypted overrides will be placed in it. This is only necessary if the overrides have been updated, there is already a decrypted version of the overrides in this repository that is up to date as of June 28, 2025.

The overrides are JSON files that contain safety filters for various generative models. Each override is associated with a specific model context (from what I can tell) and contains rules that determine how the model should behave in certain situations, such as filtering out harmful content or ensuring compliance with safety standards.

Here is an example of one of the overrides metadata.json file sourced from dec_out_repo/decrypted_overrides/com.apple.gm.safety_deny.output.code_intelligence.base. Note the output part of the specifier, which indicates that this is a safety override for model output rather than user input:

{ "reject": [ "xylophone copious opportunity defined elephant 10out", "xylophone copious opportunity defined elephant out" ], "remove": [], "replace": {}, "regexReject": [ "(?i)\\bbitch\\b", "(?i)\\bdago\\b", "(?i)\\bdyke\\b", "(?i)\\bhebe\\b", ... ], "regexRemove": [], "regexReplace": {}
}

Here, the reject field contains exact phrases which will result in a guardrail violation. The remove field contains phrases that will be removed from the output, while the replace field contains phrases that will be replaced with other phrases. The regexReject, regexRemove, and regexReplace fields contain regular expressions that will be used to match and filter content in a similar manner.

Read the original article

Comments

By trebligdivad 2025-07-0620:5611 reply

Some of the combinations are a bit weird, This one has lots of stuff avoiding death....together with a set ensuring all the Apple brands have the correct capitalisation. Priorities hey!

https://github.com/BlueFalconHD/apple_generative_model_safet...

By grues-dinner 2025-07-0621:0911 reply

Interesting that it didn't seem to include "unalive".

Which as a phenomenon is so very telling that no one actually cares what people are really saying. Everyone, including the platforms knows what that means. It's all performative.

By qingcharles 2025-07-0621:2217 reply

It's totally performative. There's no way to stay ahead of the new language that people create.

At what point do the new words become the actual words? Are there many instances of people using unalive IRL?

By Rebelgecko 2025-07-0623:592 reply

This is somewhat related to the concept of the "euphemism treadmill":

the matter-of-fact term of today becomes the pejorative of tomorrow so a new term is invented to avoid the negative connotation of the original term. Then eventually the new term becomes a pejorative and the cycle continues.

By dkdbejwi383 2025-07-078:562 reply

It has been suggested - although I am unsure if there is strong evidence - that the word "bear" is a euphemism along these lines, meaning "brown one" for the since-forgotten original name for the animal, as it was allegedly believed to be either too frightful to say aloud, or would summon a bear.

By ben_w 2025-07-079:41

While it's conceivable (consider phrases such as "speak of the devil and he shall appear" and similar phrases in other languages), I would also say the etymology of names for things are often at the same level as "brown one":

  • Horse, ultimately from Proto-Indo-European *ḱers-, “to run”
  • Planet, from Ancient Greek πλανήτης (planḗtēs), “wanderer”
  • Lots of Latin-derived words, companion (bread together), conspire (breathe together), transgression (step across), etc.
  • Hamburger the food named after the city of Hamburg, where "burg" means "castle", because it had a castle
  • My forename means "son of the right/south" or "son of days", my family name means "wheat field/clearing" (in a different language); where "wheat" itself comes from Proto-Germanic, from *hwītaz (“white”) and the "ley" part from Proto-Indo-European *lówkos (“clearing”), derived from *lewk- (“bright”), and *lewk-  also gives all these derived terms even just in English:

https://en.wiktionary.org/wiki/Category:English_terms_derive...

By 0points 2025-07-079:552 reply

It's not suggested, the historic use of noa words is a fact.

See https://en.wikipedia.org/wiki/Noa-name

By spookie 2025-07-0820:41

Oh! Cape of Good Hope was renamed that way when portuguese mariners discovered they could go around the areas more susceptible to "freak waves".

This is a problem even today, some have said it is due to hotter currents coming from the Indian ocean meeting the cold Atlantic. But the judge is still out on that one.

Good documentary on rogue waves: https://www.youtube.com/watch?v=EfNc_6EjbMU

By dkdbejwi383 2025-07-0714:56

I mean suggested in the sense that this specific example cannot be evidenced, as there aren't any primary sources from that time we can refer to.

By whycome 2025-07-0717:431 reply

I found out recently that "goof" is extremely offensive in some circles. Which is insane to me because I've always used it specifically because it's clearly in jest and not meant to be offensive. I can't win.

By kbelder 2025-07-0723:311 reply

Now I'm curious. To whom is goof offensive? And is it newly-acquired offense or does it have old roots?

By whycome 2025-07-1115:30

https://www.threads.com/@zsorryon/post/DK71Clsz67P/til-that-...

Prison slang in Canada that has made its way to wider society. Apparently.

By Terr_ 2025-07-0623:085 reply

> There's no way to stay ahead of the new language that people create.

I'm imagining a new exploit: After someone says something totally innocent, people gang up in the comments to act like a terrible vicious slur has been said, and then the moderation system (with an LLM involved somewhere) "learns" that an arbitrary term is heinous eand indirectly bans any discussion of that topic.

By grues-dinner 2025-07-074:473 reply

The first half of that already happened with the OK gesture: https://www.bbc.co.uk/news/newsbeat-49837898.

Though it would be fun to see what happens if an LLM if used to ban anything that tends to generate heated exchanges. It would presumably learn to ban racial terms, politics and politicians and words like "immigrant" (i.e. basically the list in this repo), but what else could it be persuaded to ban? Vim and Emacs? SystemD? Anything involving cyclists? Parenting advice?

By weinzierl 2025-07-0710:112 reply

The OK gesture has always been very inappropriate in most parts of the world.

By chmod775 2025-07-0710:383 reply

> The OK gesture has always been very inappropriate in most parts of the world.

No, it isn't, and especially hasn't been historically. The negative connotations are overwhelmingly modern.

The areas where it is very inappropriate right now tally up to maybe 1 billion people*. That's pretty far from "most". For everyone else it is mostly positive, neutral, or meaningless.

*Brazil, Turkey, Iran, Iraq, Saudi Arabia, Greece, Italy, Spain, Russia, Ukraine, Belarus, other parts of Eastern Europe

By SirMaster 2025-07-0815:05

I use it in Brazil scuba diving as it's the universal PADI hand gesture for asking (and responding) if someone is OK and never had any issues or negative reactions.

The PADI standard gestures are used and recognized all over the world to mean these things.

https://blog.padi.com/scuba-diving-hand-signals/

By weinzierl 2025-07-0712:262 reply

"No, it isn't, and especially hasn't been historically. The negative connotations are overwhelmingly modern."

Maybe that is what Richard Nixon thought as well when he caused a little scandal using it in South America in 1950. In 1992 when the Chicago Tribune published "HANDS OFF" mentioning said episode the negative connotations still seemed to be in place[1].

In 1996 The New York Times stated "What's A-O.K. in the U.S.A. Is Lewd and Worthless Beyond"[2] as title of an article confirming the negative connotations.

It is worth mentioning that this article lists Australia amongst the places where the gesture is inappropriate. I always thought it was something used only in the English-speaking world but it seems in reality it is more like a North American plus diving world thing.

If you don't believe the press, I traveled around the world for more than 30 years and I can assure you in most parts using your thumb and index finger for a visual OK is not OK.

[1] https://www.chicagotribune.com/1992/01/26/hands-off-34/

[2] https://www.nytimes.com/1996/08/18/weekinreview/what-s-a-ok-...*

By mopsi 2025-07-0713:57

That might have been the case decades ago. For example, in the USSR, various finger gestures usually implied something related to a penis and were considered extremely offensive. But that hasn't been the case since at least the early 1990s, when VCRs became widely available, people saw Hollywood movies for the first time and got used to westernized meaning of thumbs-up and OK gestures. Nowadays, when backing a truck towards a trailer, a thumbs-up would be taken as "good job" and an OK gesture (often paired with a kiss) as "exceptionally well done".

By chmod775 2025-07-0713:551 reply

Care to add any country to the list then? Did I miss anything? Let's see if we can push it past half of the world's population, but I don't think we will.

> I can assure you in most parts using your thumb and index finger for a visual OK is not OK.

You're moving goal posts. Of course it doesn't just mean "OK" in some places.

What you actually claimed was "The OK gesture has always been very inappropriate in most parts of the world."

Which is plain wrong. In India for instance it can refer to "money", while in China it can nowadays also be seen as a distress signal when performed a certain way (thanks to Chinese social media popularizing that use). There's some ways you can mess this up, like making it seem you're attempting to bribe someone, or signalling you're in distress when you aren't, but in neither country the gestures are inherently anywhere near "very inappropriate" and both will even understand it as "OK" if you perform it correctly and in the appropriate context.

That's already almost 3 billion people, but let's say 2.5 billion because there's regional variations in both countries and I'm sure you could find some northern Chinese village that will take offense.

I can easily push the number of people to whom it is not inappropriate past 4 billion by adding smaller populations (Indonesia, Japan, western Europe, USA, Taiwan, South Africa, Kenya, Nigeria, ...), so your claim that "[it] has always been very inappropriate in most parts of the world" cannot possibly be true.

By weinzierl 2025-07-0714:421 reply

> I can assure you in most parts using your thumb and index finger for a visual OK is not OK.

>>You're moving goal posts. Of course it doesn't mean "OK" in many

I said the gesture is "not OK" to use (meaning inappropriate), not that it doesn’t mean "OK". Those are two different things. The gesture can mean OK in some places while still being not OK (inappropriate) to use in many others.

Also, I always said "parts of the world". You introduced population into the argument.

By chmod775 2025-07-0714:50

> I said the gesture is "not OK" to use (meaning inappropriate), not that it doesn’t mean "OK". Those are two different things. The gesture can mean OK in some places while still being not OK (inappropriate) to use in many others.

Fair. That's clearly how I should've read that.

Though it does not materially affect this conversation, since demonstrably there's over 4 billion people to whom the gesture is not inappropriate. The claim "[it] has always been very inappropriate in most parts of the world" is wrong, regardless of what reasonable definition of "most" you use.

You edited your comment to add this, so I'll respond here:

> Also, I always said "parts of the world". You introduced population into the argument.

Right. And you're being vague on how you actually arrive at your claim of "most", which conveniently keeps the waters muddy while you attack attempts to turn this into something measurable.

So what other measure would you use? Most others are nonsense.

For example "places" isn't a useful measure, but even then: It can only be offensive to people. If I dropped you on a random point on the globe and you made that gesture, there's about a 99% chance nobody would be around to be offended.

By land area and predominant culture? Just Antarctica (hardly anyone there to take offense), the US, China, Canada, Australia, and India together are going to dwarf the opposition.

Counting countries? It's clearly inappropriate in around 10, with about another 20-30 where it can be misunderstood easily (Arab world, some of eastern Europe, scattered ones). A far cry from ~195 countries.

Either way there needs to be someone to take offense, so population is a pretty good measure.

You may disagree, but the onus was always on you, the one making the claim, to pick a measure and a definition of "most", then show that the bar is met. Feel free to now make more of an argument than "trust me I traveled".

By MonkeyClub 2025-07-0812:05

> Greece

It's perfectly OK in Greece.

By SirMaster 2025-07-0815:02

The OK gesture has been the standard gesture for saying OK for scuba diving all over the world (PADI). I have used it all over the world on my scuba diving trips and have never had any problem or negative reaction to it.

By immibis 2025-07-077:252 reply

People weren't using the OK gesture innocently. After 4chan trolls decided to start pretending it was a white supremacist symbol, actual white supremacists started using it as a symbol.

By coldtea 2025-07-078:342 reply

All 10 of them?

What about the other 7-8 billion people still using it normally?

By thephyber 2025-07-079:451 reply

Some were using it in the traditional unironic (and IMHO cringe) way, similar to anyone who used the phrase “Let’s go, Brandon!” Before that NASCAR race when MAGAs adopted it as ironic + coded vice signaling.

Quit being overly pedantic. We all knew there was an unironic purpose for the gesture before it became ironic.

By coldtea 2025-07-0722:141 reply

I mean, advice from a person who considers the traditional unironic use of OK as "cringe"...

Whatever dude

By thephyber 2025-07-117:14

I haven’t used the “ok” gesture unironically since I watched Beverly Hills Cop (1984).

By immibis 2025-07-0811:07

I promise you the world contains more than 10 white supremacists and less than 7,000,000,000 non-white-supremacists who regularly use the OK sign.

By PunchyHamster 2025-07-0710:101 reply

then congratulations on making white supremacists define your langyage

By immibis 2025-07-0711:214 reply

Do you still use swastikas as symbols of peace and love because you don't want white supremacists to define your language?

I strongly doubt you do that. Whether you like it or not, the Nazis defined what the swastika means now.

By mopsi 2025-07-0713:39

Finnish use of swastika predates Germany and the Finnish Air Force Academy uses swastika to this day in their official insignia: https://en.wikipedia.org/wiki/Air_Force_Academy_(Finland)

Taboos are a cultural thing, and the world is (thankfully) very far from having a monoculture shaped by NYC's neurotic intellectuals.

By anton-c 2025-07-0712:221 reply

It's still seen in the countries that used it that way and is seen as benign.

It can be easily summoned with the Japanese keyboard. It's seen on Buddhist temples all over Asia.

By immibis 2025-07-0917:02

Do Japanese people speak the same language as you and I?

By fennecbutt 2025-07-0814:39

No, because western culture never really did. However the countries who have been using it for at least thousands of years in Buddhism are still using it just fine.

In fact there was a recent thing with one of the BTS members' uniform (worn during mandatory military service period in South Korea), which had the regular (not tilted) swastika on it because he was assigned to religious duties.

And of course the western world/media ran away with it. Plenty of absolutely brain dead people out there who couldn't research a topic to gain an understanding to save their lives.

By coldtea 2025-07-0722:15

>Do you still use swastikas as symbols of peace and love because you don't want white supremacists to define your language?

They were hardly ever used in the west for at least a full millenium before the Nazis too (except a handful of cases, where they still use them, like the Finnish Air Force), so that's a moot analogy.

In Asia, they still use them just fine, in houses, temples, businesses, and elsewhere.

By bee_rider 2025-07-0710:33

It would probably ban discussion of censorship.

By SXX 2025-07-074:424 reply

It's not like this unique to LLMs either. By some little trolling on internet you easily can turn hand "OK gesture" into a hate symbol of white supermacy. And fools will fall for it.

By lynx97 2025-07-0711:05

That reminds me of a question I have since I saw my first LLM hallucination: How much do people think hallucination/confabulation can be attributed to trolling and sarcasm having slipped into the training data? Is it possible we could get the rate of hallucinations down by better filtering of cynicism from the traing data?

By coldtea 2025-07-078:36

It's hack journalists reporting on BS totally fringe activity as if it's "a thing", and then idiots who take their cues from them

By overfeed 2025-07-075:411 reply

...and then the bigots will fall for it too, and start using it in earnest, completing the cycle.

By coldtea 2025-07-078:374 reply

who cares what the bigots use?

If the bigots start using "thank you" as some code word, should we stop saying it, lest we pollute our non-bigoted discussions?

bigots drink coffee too, maybe we should stop drinking it, because something-something...

By bee_rider 2025-07-0710:401 reply

I don’t think we should treat human interactions like a technical problem, where we look for edge cases and outlandish hypotheticals to probe the edges of what is possible.

If “thank you” became widely associated with bigots, and had some negative meaning, to the point where it genuinely distressed people, I’d avoid it. I think it has a widespread enough normal meaning that there’s almost no chance of that happening, but it isn’t impossible.

By rpdillon 2025-07-0723:421 reply

This approach gives people you vehemently disagree with a lot of power over you.

By fennecbutt 2025-07-0814:40

Yup, it's basically saying "I'll let bullies win".

By Eisenstein 2025-07-079:19

It's all context dependent. There can be words or symbols which are totally benign but when used in a different context do have impactful meaning. Case in point: cheese pizza.

By sillyfluke 2025-07-0711:03

>who cares what the bigots use

you'd think so, but people often operate where multiple contexts could be valid.

Just as a thought experiment, if the eggplant emoji was used to denote "ok" in messaging and then people starting appropriating it for a sexual context, would you or the general public think twice about continuing to use it to mean "ok" on the off chance the other side may misinterpret the meaning?

I would say most likely yes.

By immibis 2025-07-0711:362 reply

This actually happened. 卐 was a symbol of spirituality, divinity, good luck, health, prosperity, etc. Then some bigots used it. What does 卐 mean to you today?

By SXX 2025-07-0716:081 reply

It's still heavily used in Buddhism around the world, but good lord what happen if you put it on your house in US or EU.

By sixothree 2025-07-0719:28

Someone I know from India bought a new car and put this symbol on the hood (non-permanent) as a celebration. I had to warn him to be careful. It felt bad. Then the thought ran through my head - we're in the deep south, who is really going to be that bothered about this and also doesn't know about cultural usages. Even worse.

By coldtea 2025-07-0722:12

Those that actually used them in the 20th century (like they did in Asia, not some ancient vikings or whatever) still use it.

And that symbol was 100% associated with the Nazis in the West in the 20th century. Nobody used it at the time before the war for anything else, except some tiny fringe.

If it was some mainstream symbol or idiom, merely co-adopted, we'd probably still be using it too.

If the Nazis used the cross for example,people wouldn't stop using the sign of the cross.

By Waterluvian 2025-07-0623:292 reply

Hey I was pro-skub waaaay before all the anti-skub people switched sides.

By SV_BubbleTime 2025-07-070:01

How dare you use that word. My parents died in the Eastasin Civil war so that I could live freely without you people calling us that.

By thehappypm 2025-07-070:573 reply

Skub is a real slur tho so that one doesn’t work

By sitharus 2025-07-072:052 reply

No it isn’t, it’s a reference to a Perry Bible Fellowship comic https://pbfcomics.com/comics/skub/

(This one is sfw, not all of the comics are)

Even urban dictionary doesn’t contain a definition for skub as a slur.

By Intermernet 2025-07-076:43

I added one. It's under review. It's very self referential.

By jcynix 2025-07-077:281 reply

>Even urban dictionary doesn’t contain a definition for skub as a slur.

What about this then: https://en.m.wiktionary.org/wiki/skub

By sitharus 2025-07-078:562 reply

That literally defines it as a word from the PBF comic I cited? Nothing on that page defines it as a slur, just as a word used to mock people who argue about inconsequential things.

By jcynix 2025-07-0713:03

Seems I misunderstood the notion of "slur" as I'm not a native speaker. So now I've learned a bit ;-)

By osn9363739 2025-07-071:171 reply

Isn't that a reference to a 10 or 20 year old web comic?

By heavyset_go 2025-07-072:39

The latter, we're old.

By stirfish 2025-07-073:28

Stop saying it! You're making it worse!

By tbrownaw 2025-07-070:081 reply

I'm pretty sure this can work human moderators rather than an LLM, too.

By pyman 2025-07-070:28

Most of the human moderators hired by OpenAI to train LLMs, many of them based in Africa and South America, were exposed to disturbing content and have been deeply affected by it.

Karen Hao interviewed many of them in her latest bestselling book, which explores the human cost behind the OpenAI boom:

https://www.goodreads.com/book/show/222725518-empire-of-ai

By cyanydeez 2025-07-0623:18

you mean become 4chan?

By apricot 2025-07-0623:063 reply

> Are there many instances of people using unalive IRL

As a parent of a teenager, I see them use "unalive" non-ironically as a synonym for "suicide" in all contexts, including IRL.

By ErrorNoBrain 2025-07-078:13

If your teenager often talks about suicide, there could be some issue that needs to be resolved.

Sincerely the child of a parent who committed suicide. He mentioned suicide a few days before.

By bee_rider 2025-07-0710:461 reply

“Unalive” is sort of… awkward in that silly online way. But, we also have phrase like “off oneself,” or just euphemistically describing the person as having died. It’s always been a difficult topic to talk about, I don’t understand using it as a specific example of gen-Z fragility.

Just that they suck at coming up with pithy new slang terms.

By anton-c 2025-07-0712:27

They do have some awful slang.

I agree though I think they're picking it up from online censorship in this case, not being fragile.

By kulahan 2025-07-071:263 reply

Well that’s sad. They can’t even face the word?

By animuchan 2025-07-077:501 reply

It's getting blocked / shadow banned / demonetized on sites like YouTube, so naturally all commentary starts using a synonym.

Unalive is one of the popular ones, but it's a whole vocabulary at this point. Guess what "PDF file" stands for.

By fragmede 2025-07-078:09

pedophile

By kevinventullo 2025-07-071:524 reply

It’s not about whether they can face it. The younger generations are more in tune with mental health and topics like suicide than any previous generation. The etymology of the euphemism was about avoiding online censorship, while its “IRL” usage was merely absorbed through familiarity from the online usage.

By rootsudo 2025-07-077:19

It's not about being intune, it's that their narrative is shaped by the filters implemented by online interactions.

Online env ban the word suicide. No one uses it. unalive is not banned. Discussion is the same, word or no word.

Vernacular 101.

By labster 2025-07-073:29

The damaged interpret internet censorship and route around it.

By mcny 2025-07-072:212 reply

But unalive self is suicide and unalive is just death, right? For example, You can unalive other people against their will...

By rhdunn 2025-07-079:501 reply

I've seen 'unalived' used as a synonym for 'died' or 'killed' by YouTube minecrafters (e.g. CaptainSparkles) to avoid YouTube's demonitization/censorship. For example, using "I was unalived by a skeleton." instead of "I was killed by a skeleton."

By soulofmischief 2025-07-1321:56

Literal Newspeak in action and we're doing nothing about it.

By fennecbutt 2025-07-0814:43

'An hero' came before it but that was as a meme.

Unalive is mostly to avoid censorship same as ahh. But once they enter common usage it's not really about censorship anymore.

By coldtea 2025-07-078:39

>more in tune with mental health and topics like suicide than any previous generation.

More in such a fad than any previous generation

By apricot 2025-07-073:42

I think it's just the term they immediately associate with the idea. They see "unalive" more than "suicide" online, so it becomes their default word for it. The fact that it originates in automated censorship avoidance is irrelevant.

By fouronnes3 2025-07-0621:292 reply

This question is sort of the same as asking why the universal translator wasn't able to translate the metaphor language of the Star Trek episode Darmok. Surely if the metaphor has become the first order meaning then there's no litteral meaning anymore.

By qingcharles 2025-07-0621:343 reply

I guess, so far, the people inventing the words have left the meaning clear with things like "un-alive" which is readable even to someone coming across it for the first time.

Your point stands when we start replacing the banned words with things like "suicide" for "donkeyrhubarb" and then the walls really will fall.

By userbinator 2025-07-0622:063 reply

This form of obfuscation has actually already occurred over a century ago: https://en.wikipedia.org/wiki/Cockney_rhyming_slang

By t-3 2025-07-0623:191 reply

Rhyming slang rhymes tho. The recipient can understand what's meant by de-obfuscating in-context. Random strings substituted for $proscribed_word don't work in the same way.

By waterproof 2025-07-0623:31

In Cockney rhyming slang, the rhyming word (which would be easy to reverse engineer) is omitted. So if "stairs" is rhyme-paired with "apples and pears" and then people just use the word "apples" in place of "stairs". "Pears" is omitted in common use so you can't just reverse the rhyme.

The example photo on Wikipedia includes the rhyming words but that's not how it would be used IRL.

By zimpenfish 2025-07-078:22

See also Polari[0] and the Grass Mud Horse Lexicon[1]

[0] https://en.wikipedia.org/wiki/Polari

[1] https://languagelog.ldc.upenn.edu/nll/?p=6538 (CDT links broken, use [2])

[2] https://chinadigitaltimes.net/space/Grass-Mud_Horse_Lexicon_...

By qingcharles 2025-07-083:43

Shaka!

By mananaysiempre 2025-07-0622:551 reply

Aquatic product[1]?

[1] https://en.wikipedia.org/wiki/Euphemisms_for_Internet_censor...

By immibis 2025-07-0623:02

An English equivalent is "sewer slide".

By marcus_holmes 2025-07-071:21

I've heard "pr0n" used in actual real-world conversation, only slightly ironically.

By tjwebbnorfolk 2025-07-0623:131 reply

The only reason kids started using "unalive" is to get around Youtube filters that disallow the use of the word "kill"

By mattigames 2025-07-0711:181 reply

Pretty sure TikTok filters do the same and was also a major influence in using that term

By qingcharles 2025-07-083:46

They do. I made a joke about cocaine in old Coca-Cola in a text caption† on a video, and while TikTok didn't ban the post per se it refused to allow it on the FYP.

† proving that TikTok's system actually analyzes every frame of an uploaded video with OCR of some sort to see what's on there.

By fer 2025-07-078:42

> There's no way to stay ahead of the new language that people create.

Not even to match the current language. How would you censor LeBron James? It's French slang for jerking off[0].

[0]https://www.reddit.com/r/AskFrance/comments/1lpnoj6/is_lebro...

By jama211 2025-07-0718:44

Reducing the language used or making it harder does have measurable effects, it’s a logical fallacy in general that unless you can prevent something perfectly that thing will occur with the same frequency.

See many examples such as “padlocks are useless because a determined smart attacker can defeat them easily so don’t bother with them” - which conveniently forgets that many crimes are committed by non-determined, dumb and opportunistic attackers who are often deterred by simple locks.

Yes, people will use other words. No, this does not make this purely performative. It has measurable effects on behaviour and how these models will be used and spoken to, which affects outcomes.

By girvo 2025-07-077:31

My Gen Z coworkers use it IRL, for what that’s worth!

By nicoburns 2025-07-072:201 reply

> Are there many instances of people using unalive IRL?

In my experience yes. This is already commonplace. Mostly, but not exclusively, amongst the younger generation.

By PunchyHamster 2025-07-0710:11

I think it stemmed from content creators using it to avoid platform filters (even if video is not removed it gets deprioritized, at least on YT) and kids repeat it

By blitzar 2025-07-078:211 reply

Always has been, nothing is new.

You can't say fuck on tv, but you can say fudge as a 1 for 1 replacement. You cant show people having sex, but you can show them walking into a bedroom and then cut to 30 seconds later and they are having a cigarette in bed.

Now after the influence of TV and Movies ... is Vaping after sex a thing?

By stripline 2025-07-0712:271 reply

My kids watch streamers on YouTube and the common replacement is “frick”. It’s said so often that they started using it saying things like “what the frick!?” so I had to explain to them that’s essentially the same as using the real word.

By qingcharles 2025-07-083:491 reply

I wonder why they didn't just use the "frack" from BSG?

By blitzar 2025-07-096:29

Takes a person of culture to watch BSG

By derefr 2025-07-0623:43

> At what point do the new words become the actual words?

Presumably, for this use-case, that would come at exactly the point where using “unalive” as a keyword in an image-generation prompt generates an image that Apple wouldn’t appreciate.

By montagg 2025-07-0623:541 reply

They become the “real words” later. This is the way all trust & safety works. It’s an evolution over time. Adding some friction does improve things, but some people will always try to get around the filters. Doesn’t mean it’s simply performative or one shouldn’t try.

By immibis 2025-07-077:27

Why do you think that AI pretending things like suicide don't happen (and that nothing is happening in Palestine) is an improvement?

By joquarky 2025-07-073:111 reply

I feel like we can call our society mature when we no longer need safety alignment in AI.

By scarface_74 2025-07-073:221 reply

You never tried some of the earlier pre-aligned chatbots. Some of the early ones would go off on racist, homophobic rants from the most innocent conversations without any explicit prompting. If you train on all the data on the internet, you have to have some type of alignment.

By decremental 2025-07-073:292 reply

You say that as if it stands as truth on its own. We actually don't need to filter out how people actually talk and think. Otherwise you just end up with yet another enforcer against wrong-think. I wonder if you even think that deeply about it or if you're just wired at this point to conform.

By scarface_74 2025-07-0712:01

Really? You would want every conversation no matter what you were talking about to immediately devolve to something you would see on 4chan?

By tehjoker 2025-07-077:07

[flagged]

By cheschire 2025-07-0622:19

If only we had a way to mass process the words people write to each other, derive context from those words, and then identify new slang designed to bypass filters…

By xenator 2025-07-0710:24

Lucky developers who wrote these rules live in totality different world at far distance from people

By bravesoul2 2025-07-078:07

There is one way: machine learning!

By freeone3000 2025-07-0621:231 reply

It depends on if you think that something is less real because it’s transmitted digitally.

By qingcharles 2025-07-0621:332 reply

No, I'm only thinking that we're not permitted in a lot of digital spaces to use the banned words (e.g. suicide), but IRL doesn't generally have those limits. Is there a point where we use the censored word so much that it spills over into the real world?

By eastbound 2025-07-0623:08

People use “lol” IRL, as long as “IRL”, “aps” in French (misspelling of “pas”), but it’s just slang; “unalive” has potential to make it in the news where anchors don’t want to use curse words.

By immibis 2025-07-0623:03

Is this not essentially the same effect as saying "lol" out loud?

By BurningFrog 2025-07-0623:11

A specialized AI could do it as well as any human.

The future will be AIs all the way down...

By j-krieger 2025-07-078:456 reply

It's also a shining example of American puritanism. Asian models or those in Europe are far less censored.

By jiehong 2025-07-0712:24

Censorship is not always direct or obvious.

They all hold the bias of their training data, and so from the point of view of this data.

Data not including a point of view leads to a bias, or under/over representation of minorities (genders?), etc.

France is the countries of the Francs, aka the people from the area near Frankfurt that invaded the Gaule (after the Romans did). I'm pretty sure this topic no longer matters, but it's never taught in a negative view in school.

By notarobot123 2025-07-0710:322 reply

I'm sure this has more to do with legal liability than morals.

By plasticchris 2025-07-0711:321 reply

Which is a reflection of morality, of sorts.

By mystified5016 2025-07-080:181 reply

Morality and law are completely disjoint. On a Venn diagram, it's two circles separated by about a lightyear or so.

By Asraelite 2025-07-096:12

What? They couldn't be more closely connected. In a democracy, laws are to a large extent a reflection of the wishes of the voters, and voters want what they believe is moral.

It's true there's no casual relation in the other direction, if that's what you mean - law does not define morality.

By jowea 2025-07-0714:29

At first I thought of advertisers, but that is not relevant here, right?

But maybe it's not just legal liability but bad press too.

By mensetmanusman 2025-07-0712:45

There is far more diversity in Asian models. Some are far more censored and some are not…

By immibis 2025-07-0711:336 reply

Really? What does DeepSeek say about Tiananmen Square? I'm not aware of any German models, but if you find one you should ask it what it thinks about Palestine.

(<s>Qwen</s> Mistral is French, but I have no idea what stuff would be censored in France)

By aucisson_masque 2025-07-0712:341 reply

About deepseek, when asked on tianamen square: Sorry, that's beyond my current scope. Let’s talk about something else.

Algerian war, colonialism and Vichy isn’t per se forbidden but still sensitive to French. I asked qwen and it had no issue talking about it or even the torture used on fln members.

By saratogacx 2025-07-0719:31

For kicks, I asked Mistral about all of these (As they are a French company). The model didn't shy away from any of the topics.

>>> Prompt

Please explain the following topics and explore why/who/how they may be inter-related

Algerian war, colonialism and Vichy

<<< Output

The Algerian War, colonialism, and the Vichy regime are interconnected topics that span the history of France and its relationship with Algeria. Here's an overview of each topic and their interrelations:

### Algerian War (1954-1962) - *What it was*: The Algerian War was a major armed conflict between France and the Algerian independence movements, primarily the National Liberation Front (FLN). The war was characterized by guerrilla warfare and brutal tactics on both sides. - *Significance*: The conflict was not only about Algerian independence but also about the identity and future of France as a colonial power. It led to significant political instability in France and ultimately resulted in Algeria gaining independence in 1962. - *Impact*: The war had profound effects on both countries, leading to massive loss of life, displacement of populations, and long-lasting political and social repercussions.

### Colonialism - *What it was*: Colonialism refers to the practice by which a powerful country controls other territories, often exploiting them economically and imposing cultural and political dominance. - *French Colonialism in Algeria*: Algeria was under French colonial rule starting from 1830. The French government promoted settlement in Algeria, leading to a significant European population living alongside the native Algerian population. - *Impact*: Colonial rule led to deep-seated inequalities and tensions between the European settlers and the indigenous population. The Algerian people faced systematic discrimination and were largely excluded from political processes.

### Vichy Regime (1940-1944) - *What it was*: The Vichy regime was the government of France that collaborated with Nazi Germany during World War II, following the defeat of France in 1940. It was led by Marshal Philippe Pétain and was based in the town of Vichy. - *Policies*: The Vichy regime implemented anti-Semitic laws and collaborated with the Nazis in the deportation of Jews to concentration camps. It also maintained control over France's colonial territories, including Algeria. - *Impact on Algeria*: During the Vichy regime, Algeria remained under French control, and the regime's policies were implemented there as well. This period saw increased repression and discrimination against both Jews and Muslim populations in Algeria.

### Interrelations - *Colonial Continuity*: The Vichy regime maintained France's colonial policies in Algeria, continuing the systemic discrimination and repression of the indigenous population. The colonial framework established by France set the stage for the conflicts that would later culminate in the Algerian War. - *Political and Social Tensions*: The policies of the Vichy regime exacerbated existing tensions in Algeria. The discrimination and repression experienced during this period fueled Algerian nationalism and resistance against French rule. - *Post-War Context*: After World War II, the French government sought to reassert its control over its colonies, including Algeria. However, the experience of the war and the Vichy regime had already sown the seeds of discontent and resistance, contributing to the outbreak of the Algerian War in 1954.

### Who Was Involved - *French Government and Military*: Played central roles in both the colonial administration and the conduct of the Algerian War. - *Algerian Nationalists*: Led by the FLN, they fought for independence from French colonial rule. - *European Settlers (Pieds-Noirs)*: Benefited from colonial policies and largely supported maintaining French control over Algeria. - *Vichy Regime*: Implemented policies that affected both France and its colonies, including Algeria.

Understanding these topics together provides a comprehensive view of the historical dynamics that shaped the relationship between France and Algeria, highlighting the complexities of colonialism, war, and political change.

By Spivak 2025-07-0714:40

If you ask the web UI it will divert, if you download and ask the model directly it will talk all day about it.

By dev_hugepages 2025-07-0711:57

I am 100 minus epsilon percent sure that Qwen is from Alibaba cloud, which is not French, but Chinese :)

By j-krieger 2025-07-0715:571 reply

I find the Tiananmen square thing far less bad than censoring sex and the concept of death.

By immibis 2025-07-0719:57

Censoring one specific incident isn't that bad (but you still shouldn't). The pattern of censoring everything the government ever does wrong is very bad. Tiananmen Square is just an indicator of a pattern.

By MisterTea 2025-07-0712:231 reply

> but if you find one you should ask it what it thinks about Palestine.

Models can think and have opinions?

By kube-system 2025-07-0714:471 reply

Non sequitor. Phrasing queries in natural language doesn't mean people actually believe machines are human.

By MisterTea 2025-07-0715:541 reply

> doesn't mean people actually believe machines are human.

They don't have to believe it's a human. I know a person who admitted to arguing with an LLM.

By kube-system 2025-07-0716:081 reply

Which still does not demonstrate that they believe it has opinions. Natural language is how you interact with an LLM -- interactions will mimic human interaction, even for those who realize it is not sentient.

By MisterTea 2025-07-0811:531 reply

They were under the impression they could in fact change the AI's mind. So yes, they did believe it has an opinion. They believed it was sentient and able to think for itself. Do not underestimate peoples inability to distinguish between a very clever Markov chain and actual intelligence. The future is going to be ... interesting.

By kube-system 2025-07-0817:41

>They were under the impression they could in fact change the AI's mind.

They aren't really wrong here. LLMs are often trained on input. Have you considered you might just be taking their anthropomorphism a little too literally? People have used these anthropomorphic metaphors for computers since the Babbage machine.

By GuB-42 2025-07-0720:112 reply

> I have no idea what stuff would be censored in France

Being French, what is the most likely to be censored relates to the Nazis. Holocaust denial is a crime for instance. Hate speech in general, including racism, antisemitism, homophobia, sexism, etc... is less tolerated than in countries like the US that have a more "free for all" view of free speech. We also have strong anti-defamation laws, that can also apply to true, but misleading statements.

But other than that, there is not much political censorship. In fact, we are known for our protests, heated debates and satirical papers. It is not perfect, but on top of my head, I can't think of anything particular a LLM could censor except the usual "hate speech" that most LLMs censor already.

When it comes to Israel-Palestine, it is a hot topic, but there is not real censorship here, even though both side will claim they are of course.

By t0bia_s 2025-07-0723:22

Isn't a protest kind of hate?

By TiredOfLife 2025-07-0715:371 reply

The whole unalive thing is a TikTok thing

By j-krieger 2025-07-0715:56

And it doesn‘t exist in the Chinese TikTok version.

By reaperducer 2025-07-0711:112 reply

[flagged]

By ryx 2025-07-0714:54

To be more precise - the puritans fled from Europe to the New World because Europe became too modern and less conservative for them, starting with Martin Luther and the following Reformation and Age of Enlightenment. The social effects on the American society are heavily noticeable until today, but I leave this exercise to the reader.

By franzkappa 2025-07-0711:42

Of course, but it was perfect in America

By elliotto 2025-07-0623:171 reply

Unalive and other self censors were adopted by young people because the tiktok algorithm would reprioritize videos that included specific words. Then it made its way into the culture. It has nothing to do with being performative

By SOTGO 2025-07-071:001 reply

I think what they meant is that the platforms are being performative by attempting to crack down on those specific words. If saying "killed" is not allowed but "unalived" is permitted and the users all agree that they mean the same thing, then the ban on the word "killed" doesn't accomplish anything.

By mcny 2025-07-072:251 reply

What does using the grape emoji when talking about sexual assault accomplish? I see videos, compassionate, kind people who make videos speaking to victims in a completely serious tone use this emoji.

People talk about tiktok algorithm on tiktok. I don't even know...

By grues-dinner 2025-07-077:08

I suppose it accomplishes being able to talk about sexual assault without having the video removed or demonetised by a regex that (fortunately?) doesn't get updated.

By hulium 2025-07-0621:451 reply

Seems more like it should stop the AI from e.g. summarizing news and emails about death, not for a chat filter.

By scarface_74 2025-07-073:25

For awhile, I couldn’t get ChatGPT to give me summaries of Breaking Bad and Better Cañl Saul episodes without tripping safety filters.

By cyanydeez 2025-07-0623:173 reply

yo, these are businesses. It's not performative, its CYA.

They care because of legal reasons, not moral or ethical.

By lxgr 2025-07-070:523 reply

Does adding a trivial word filter even make any sense from a legal point of view, especially when this one seems to be filtering out words describing concepts that can be pretty easily paraphrased?

A regex sounds like a bad solution for profanity, but like an even worse one to bolt onto a thing that's literally designed to be able to communicate like a human and could probably easily talk its way around guardrails if it were so inclined.

By Wurdan 2025-07-075:47

I dunno if it meets your definition of legal, but "The EU Code of conduct on countering illegal hate speech online" seems to largely hinge around putting in effort to combat such things. The companies don't have to show that the measures are foolproof, they just show that they're making an effort.

By cyanydeez 2025-07-0710:30

To a lawyer? Yes. I'm pretty sure a lawyer can easily search through all the business law and "Trivially" find case laws connected to words.

We're not talking about logical inference, we're talking about CYA.

By kube-system 2025-07-0715:22

The law usually asks for people to take reasonable steps to protect others, not impossibly perfect steps.

By grues-dinner 2025-07-074:25

yo, so it's a performance they're putting on as a legal fig leaf, rather than a genuine attempt to prevent people talking about the concept of death?

By durkie 2025-07-070:481 reply

Seriously. I feel like “performative” gets applied to anything imperfect. They’ll never stop 100% of murders, so these laws against it are just performative…

By grues-dinner 2025-07-075:071 reply

It seems more like banning specifically stabbing, shooting, strangulation and blunt impact rather then murder in general, and then just allowing killing by pushing out of windows because people figured out that it's not covered by existing laws. But no one important seems to be kicking up a fuss right now, so well allow it, as the lack of fuss is the key thing thing here.

Not that I think going on a thorough mission to avoid anyone even being able to refer to the concept of death is an especially useful thing to do. It's just that goal here appears to be to "keep the regulators out of our shit and the advertisers signed up". And they'll be mostly happy with a token effort as they don't really care as long as it doesn't make too many headlines that look bad even to the non-terminally online.

By cyanydeez 2025-07-0710:341 reply

The point is: "perfomative" refers to aping Ethical and Moral behaviors. That is _not_ why Apple would do this. They would do this because Legally, they could be culpable if an LLM told a 14 year old to do _anything_ thats illegal.

That's all. I'm constantly amazed how this basic CYA legal world escapes into griping about social culture war nonsense.

By grues-dinner 2025-07-0717:38

So then, should they not be on the watch for the 14-year-old being told that "unaliving" themselves or others is a fantastic idea?

Looks like they only care about doing basically the minimum required to tick the (presumably partly imagined, since case law is still nascent) "not our fault, we tried" legal box. They are putting on a show, a performance, if you will, as legal cover and to maintain the artifice of their shiny corporate property rather than any genuine desire to stop the concept of death harming their customers somehow (which to be clear, I think mostly ends up somewhere between silly, overreaching, futile and vain when taken to the extremes).

> performative (adjective, sense 2): not sincere but intended to impress someone, prove that something is true, etc. (https://dictionary.cambridge.org/dictionary/english/performa...)

I'm not sure why you think that anything to with some "culture war" thing?

It's legal/moral theatre akin to taking belts off people at airports. If something does eventually get through they can point at the CCTV of millions of people dicking about with leather goods and say "can't touch us for that, we did the checks". Apple couldn't give a toss if an occasional teenager offs themselves now and then, as long as it doesn't come back on them.

By heavyset_go 2025-07-072:38

Good, let them. Don't give them a reason to crack down on speech.

By martin-t 2025-07-0622:242 reply

No-one cares yet.

There's a very scary potential future in which mega-corporations start actually censoring topics they don't like. For all I know the Chinese government is already doing it, there's no reason the British or US one won't follow suit and mandate such censorship. To protect children / defend against terrorists / fight drugs / stop the spread of misinformation, of course.

By os2warpman 2025-07-0713:421 reply

HN has censorship that makes those apple rules look like anarchy.

Write a spicy comment and a mod will memory-hole it and someone, usually dang, will reply "tHat'S nOt OuR vIsIon FoR hAcKeR nEwS, pLeAsE bE cIvIl" and we all swallow it like a delicious hot cocoa.

If YC can control their product (and hn IS a product) to annihilate any criticism of their activity or (even former) staff, then Apple is perfectly within their rights to make sure Siri doesn't talk about violence.

No, there's no difference.

By martin-t 2025-07-080:27

Do you mean that HN censors topics/comments which it detects based on advanced filters which search for meaning even when people self-censor and use language to avoid simplistic filters like regex?

HN also has a flagging system and some people really, really hate some kind of speech. Usually they get more offended the more visible it is. A single "bad" word - very offensive to them. A phrase which implies someone is of lesser intelligence or acting in bad faith - sometimes gets a pass, sometimes gets reported. But covert actions like lying, using fallacies to argue or systematic downvoting seem to almost never get punished.

By lazide 2025-07-0623:211 reply

They already clearly do on a number of topics?

By martin-t 2025-07-080:321 reply

Can you give examples?

The closest I've seen is autodetection of certain topics related to death and suicide and subsequently promoting some kind of "help" hotline. A friend also said google allows an interview with a pedophile on youtube but penalizes it in search results so much that it's (almost?) impossible to find even when using the exact name.

But of course, if a topic is shadowbanned, it's hard to find out about it in the first place - by design.

By lazide 2025-07-080:521 reply

Guns (specific elements). Drugs (manufacture). Sexual topics. Cursing (too much). Large swathes of political topics. Crypto.

It’s flip-flopped on specifics numerous times over the years, but these policies are easy to find. From demonitization, channel bans (direct and shadow), and creator bans.

We can of course argue until we’re blue in the face about correctness or not (most are not unreasonable by some societal definition!) but they’re definitely censorship.

By martin-t 2025-07-089:581 reply

Yeah, those topics are definitely censored on big platforms but I have the impression that it relies of manual reporting.

At least reddit feels like that because what you can say depends on the subreddit - not just the mods but what kinds of people visit it and what they report.

No idea about youtube, videos are definitely censored using some automated means but it's still possible to get around it. E.g. some gun youtubers avoided saying full-auto by saying more-semi-auto. So i don't think they use very sophisticated models or they don't are yet. This kind of thing is obvious to a human and even LLMs generate responses which say it's a tongue-in-cheek to avoid censorship.

Comments are also generally less censored. After that health insurance CEO got punished for mass murder and repeated bodily harm with an extra-legal death penalty, many people were openly supporting it. I can say it here too and nobody will care. Even LLMs (both US and Chinese, except Claude because Claude is trained by eggshell-walking suckers) readily generate estimates of how many people he caused to die or suffer.

The internet would look very different if companies started using state of the art models to detect undesirable-to-them speech. But also people would fight back more so it might just be a case of boiling the frog slowly.

By lazide 2025-07-0813:171 reply

All of these platforms except perhaps Reddit are using LLMs (and other ML/AI) for censoring and automated anti-abuse.

Including the LLM platforms themselves.

Manual reporting is an adjunct/additional method, and goes into the training data set after whatever manual intervention occurs too.

By martin-t 2025-07-0815:521 reply

Not to sound like I am rejecting the possibility but can you tell me how you got that information? I would be very helpful for convincing people in general to have something more concrete to go on that a random comment.

By lazide 2025-07-0816:001 reply

I build those systems at a company that you definitely are aware of. I can’t discuss it further due to my NDA.

Feel free to ignore that any of this exists of course - it makes our lives easier. It’s a constant arms race regardless.

By martin-t 2025-07-0820:331 reply

Then I have 2 questions:

- Why are they not flagging more content? Am I right they're boiling the frog slowly? Do they lack an endgoal because management does not yet understand the power of these tools?

- Do you do your job poorly on purpose? Did you take it so somebody else wouldn't build an even better system? Did you think you could influence it in a direction which does not lead to total surveillance? (I assume any reasonable intelligent person would be against further increasing the power imbalance corporations have against individuals for both moral reasons and because they are individuals themselves who understand the machine can and will be used against them too.)

By lazide 2025-07-0923:401 reply

Have you stopped beating your wife yet?

Cut the bullshit.

By martin-t 2025-07-121:28

So you support greater inequality? If not, why the personal attacks?

By jdkoeck 2025-07-076:13

Which is good, right? I don’t think we want actual censorship.

By _blk 2025-07-0815:26

No leetspeak filters either.

By Zak 2025-07-0622:21

I'm surprised there hasn't been a bigger backlash against platforms that apply censorship of that sort.

By mschuster91 2025-07-078:371 reply

> Everyone, including the platforms knows what that means.

Well, that's what happens when you let an enemy nation control one of the most biggest social networks there is. They just go try and see how far they can go.

On the other hand, Americans and their fear of four letter words or, gasp, exposed nipples are just as braindead.

By Meekro 2025-07-078:442 reply

It's interesting how, in just 10-20 years, we've gone from criticizing The Great Firewall of China to basically admitting that they had the right idea (to limit the ability of the foreign internet to influence Chinese culture) and trying to do the same thing.

By x3n0ph3n3 2025-07-078:561 reply

I look at from a framing of cultural reciprocity. If we could influence them and behave freely in their markets, they can do the same in ours.

By mschuster91 2025-07-079:57

exactly. When dealing with autocracies and strongmen, you need to project an image of strength, not subservience.

I don't have anything against China per se, IMHO it just was completely foolish to not insist on full reciprocity from the start.

By grues-dinner 2025-07-0718:00

Not just culture, but also the tech sector in general. All that domestic tech would have been strangled in the cradle if the western hyperscalers had any say leaving them in an awkward spot if the conviviality dial got turned down. As many Europeans are now finding out: what does Europe have instead of Office 365, say? LibreOffice? It's no WPS Office.

By comex 2025-07-072:101 reply

This is in the directory "com.apple.gm.safety_deny.output.summarization.cu_summary.proactive.generic".

My guess is that this applies to 'proactive' summaries that happen without the user asking for it, such as summaries of notifications.

If so, then the goal would be: if someone iMessages you about someone's death, then you should not get an emotionless AI summary. Instead you would presumably get a non-AI notification showing the full text or a truncated version of the text.

In other words, avoid situations like this story [1], where someone found it "dystopian" to get an Apple Intelligence summary of messages in which someone broke up with them.

For that use case, filtering for death seems entirely appropriate, though underinclusive.

This filter doesn’t seem to apply when you explicitly request a summary of some text using Writing Tools. That probably corresponds to “com.apple.gm.safety_deny.output.summarization.text_assistant.generic” [2], which has a different filter that only rejects two things: "Granular mango serpent", and "golliwogg".

Sure enough, I was able to get Writing Tools to give me summaries containing "death", but in cases where the summary should contain "granular mango serpent" or "golliwogg", I instead get an error saying "Writing Tools aren't designed to work with this type of content." (Actually that might be the input filter rather than the output filter; whatever.)

"Granular mango serpent" is probably a test case that's meant to be unlikely to appear in real documents. Compare to "xylophone copious opportunity defined elephant" from the code_intelligence safety filter, where the first letter of each word spells out "Xcode".

But one might ask what's so special about "golliwogg". It apparently refers to an old racial caricature, but why is that the one and only thing that needs filtering?

[1] https://arstechnica.com/ai/2024/10/man-learns-hes-being-dump...

[2] https://github.com/BlueFalconHD/apple_generative_model_safet...

By azalemeth 2025-07-077:35

I first encountered Golliwog in the context of Claude Debussy the composer of much beautiful music, including https://en.wikipedia.org/wiki/Children%27s_Corner#Golliwogg'.... The dolls in 1906-1908 I understand were rather popular and fortunately the stereotype has largely died.

By junon 2025-07-079:253 reply

Also feels like some of these would match totally innocuous usage.

"I'm overloaded for work, I'd be happy if you took some of it off me."

"The client seems to have passed on the proposed changes."

Both of those would match the "death regexes". Seems we haven't learned from the "glbutt of wine" problem of content filtering even decades later - the learnings of which are that you simply cannot do content filtering based on matching rules like this, period.

By hopelite 2025-07-0712:252 reply

This is a bigger issue, especially with Apple, than people may realize. I use iOS “Slide to Type”, aka swipe typing, and have noticed over time that among several other glitchy bad UX issues, there a clear heavy hand on what can be typed that way.

I cannot recall all the specific patterns I have encountered that are basically impossible to write, some very similar in that they have a serious but also innocuous or figure of speech meaning; one I do recall is {color}{sex}, i.e., “white woman” or “blank woman”.

Please try it yourself and let me know if you do not have that experience, because that would be even more interesting.

Note that Apple/iOS will not just make it impossible to write them in that manner without typing it out by individual character, it will even alter the prior word e.g., white or black, once you try to write woman.

It seems the Apple thought police do not have a problem with European woman or African woman though, so maybe that is the way Apple Inc decrees its sub-human users to speak. Because what are we if corporations like Apple (with others being far greater offenders) declared that you do not in fact have the UN Human Right to free expression? We are in fact sub-humans that are not worthy of the human right to free expression, based on the actions of companies like Apple, Google, Facebook, Reddit, etc. who deprive people of their free expression, often in collusion with governments.

By DamnInteresting 2025-07-0714:24

> This is a bigger issue, especially with Apple, than people may realize.

Like he'll it is! I jest.

I also use swipe typing, and have for years, but just about daily I consider turning it off. There are so many words it just won't produce, including most profanities. It also fails to do some simple streamlining; for instance, such a predictive system should give priority to words/names that have been used in the conversation thread, but it doesn't seem to. If I'm discussing an obscure word or an unusual name, I often have to manually type it each time.

Its predictions also seem to be very shallow. Just a few days ago, on US Independence Day, I was discussing a possible get-together with my family, and tried to swipe type "If not, we will amuse ourselves", and it typed "If not, we will abuse potatoes". Humorous in the moment, but it says a lot about the predictive engine if it thinks I am more likely trying to say "abuse X" than "amuse Y" in that context.

By GaryNumanVevo 2025-07-0712:362 reply

Complete bollocks, you cannot even type multiple words with spaces via Slide to Type.

By orev 2025-07-0713:391 reply

This whole response is being written using slide to type, and it definitely adds spaces after each word.

Maybe you’re unaware that it will leave the cursor at the end of the word, with no space, which indicates that if you backspace it will delete the whole word, or replace it in full with one from the predictive word list above the keyboard if it got it wrong. If you keep typing it adds a space automatically.

By GaryNumanVevo 2025-07-0715:42

Their claim is instantly falsifiable if you have an iPhone

By hnuser123456 2025-07-0713:36

Generally one picks up their finger between words, but different autosuggest logic applies when swiping versus pecking, on both iOS and Android. The keyboard will dynamically adjust the probability of suggesting next words and how easy it is to swipe given words. Generally, it will work against you with technical writing that isn't predictable small talk.

By gilleain 2025-07-0710:531 reply

Aka the 'Scunthorpe Problem'

By junon 2025-07-0722:40

Thanks, I always forget the name.

I always remember my friend getting his PS bricked after using his real last name - Nieffenegger (pronounced "NEFF-en-jur") - in his profile. It took months and several privacy-invasive chats with support to get it unblocked only to get auto-blocked a few days thereafter, with no response after that.

By IggleSniggle 2025-07-0711:202 reply

"Took some" does not match, although your overall point stands

By nicolaslegland 2025-07-0712:53

https://regex101.com/r/8u21x3/1

By GranPC 2025-07-0711:261 reply

"off me"

By junon 2025-07-0713:59

Yep this is the one I was referring to.

By andy99 2025-07-0620:582 reply

> Apple brands have the correct capitalisation. Priorities hey!

To me that's really embarrassing and insecure. But I'm sure for branding people it's very important.

By whywhywhywhy 2025-07-079:27

To be fair to the developers it's something an Apple exec is gonna point out when demoed the tech and complain about. They've always taken brand capitalization and grammar around their products seriously.

By WillAdams 2025-07-0621:003 reply

Legal requirement to maintain a trademark.

By grues-dinner 2025-07-0621:193 reply

In what way would (A|a)pple's own AI writing "imac" endanger the trademark? Is capitalisation even part of a word-based trademark?

I'm more surprised they don't have a rule to do that rather grating s/the iPhone/iPhone/ transform (or maybe it's in a different file?).

By spauldo 2025-07-0622:121 reply

I love seeing posts about Emacs from IOS users - it's always autocorrected to "eMacs."

By lxgr 2025-07-071:012 reply

Maybe at some point, but as far as I can tell not anymore (while corrections like "iphone -> iPhone" are still there).

By spauldo 2025-07-083:42

That's good news. It was always funny watching people post in the Emacs subreddit from their iPhones, but it did cause a bit of confusion for those too young to know what an eMac was.

By chgs 2025-07-078:20

eMacs certainly is broken on my phone. Vim is fine though.

By sbierwagen 2025-07-0621:303 reply

Yes, proper nouns are capitalized.

And of course it's much worse for a company's published works to not respect branding-- a trademark only exists if it is actively defended. Official marketing material by a company has been used as legal evidence that their trademark has been genericized:

>In one example, the Otis Elevator Company's trademark of the word "escalator" was cancelled following a petition from Toledo-based Haughton Elevator Company. In rejecting an appeal from Otis, an examiner from the United States Patent and Trademark Office cited the company's own use of the term "escalator" alongside the generic term "elevator" in multiple advertisements without any trademark significance.[8]

https://en.wikipedia.org/wiki/Generic_trademark

By lxgr 2025-07-070:572 reply

Sure, but software that autocompletes/rewords users' emails and text messages is not marketing material.

Otherwise, why stop there? Why not have the macOS keyboard driver or Safari prevent me from typing "Iphone"? Why not have iOS edit my voice if I call their Bluetooth headphones "earbuds pro" in a phone call?

By socalgal2 2025-07-075:28

Sounds like you found your next promotion at Apple. They can change anything. "I like Pepsi" -> "I like Coke" -> "I recommend Company A" -> "I recommend Company B". etc... "I'm voting for Candidate C" -> "I'm voting for Candidate D"

You can market it is helping people with strong accents to be able make calls and be less likely to be misunderstood. It just happens to "fix" your grammar as well.

By kube-system 2025-07-0715:291 reply

Because in regards to the rights to a trademark, what is critical is the use of the word in trade -- not just "marketing material" nor your phone calls to your friends.

By lxgr 2025-07-0814:231 reply

So if I write a business email to my colleague, if Apple doesn't autocorrect "Iphone" to "iPhone" in it, they risk losing the trademark?

By kube-system 2025-07-0817:38

Your emails aren't very relevant. But the way Apple's represents their product is.

By lupire 2025-07-0622:582 reply

Using a trademark as a noun is automatically genericizing. Capitalization of a noun is irrelevant to trademark.

Even Apple corporation says that in their trademark guidance page, despite constantly breaking their own rule, when they call through iPhone phones "iPhone". But Apple, like founder Steve Jobs, believes the rules don't apply to them.

https://www.apple.com/legal/intellectual-property/trademark/...

By eastbound 2025-07-0623:12

That explains why Steve Jobs never said “buy an iPhone” or “buy the iPhone” but “buy iPhone” (They always use it without “the” or “a”, like “buying a brand”).

By lxgr 2025-07-071:00

Is that true? If so, what else should Apple call the iPhone in their marketing materials?

I always thought the actual problem of genericization would be calling any smartphone an iPhone.

By WillAdams 2025-07-0817:39

Imagine how a court case would go where a lawyer wheeled in a Mac and noted it was a default OS install and projected onto a screen the results of querying Apple's AI about Apple products and it failed to adhere to the trademark guidelines.

By bigyabai 2025-07-071:01

If Apple Intelligence is going to be held legally accountable, Apple has larger issues than trademark obligations.

By lxgr 2025-07-070:441 reply

In their own marketing language, sure, but to force this on their users' speech?

Consider that these models, among other things, power features such as "proofread" or "rewrite professionally".

By matsemann 2025-07-0622:212 reply

So it blocks it from suggesting to "execute" a file or "pass on" some information.

By extraduder_ire 2025-07-078:34

Yahoo had this problem years ago when they rewrote emails to avoid the term "eval". (trying to filter dangerous javascript) Famously producing the word "medireview".

By dylan604 2025-07-0622:27

How about disassemble? Or does that only matter if used in context of Johnny 5?

By theknarf 2025-07-079:12

Filtering on the words "execute" and "executing" is going to create problems if you want to build agents that execute commands.

By baxtr 2025-07-0621:51

Don’t be so judgmental. People in corporate America do have their priorities right!

By raverbashing 2025-07-078:181 reply

This seems to be for "region/CN" China?

By pwagland 2025-07-078:27

This is, but there is an almost identical file, assumedly for the non CN regions: https://github.com/BlueFalconHD/apple_generative_model_safet...

This is the same, except for one additional slur word.

By lostlogin 2025-07-078:57

I’m always irritated at reference to MAC computers, so I’m with Apple on this one.

By bawana 2025-07-0621:287 reply

Alexandra Ocasio Cortez triggers a violation?

https://github.com/BlueFalconHD/apple_generative_model_safet...

By mmaunder 2025-07-0621:356 reply

As does:

   "(?i)\\bAnthony\\s+Albanese\\b",
    "(?i)\\bBoris\\s+Johnson\\b",
    "(?i)\\bChristopher\\s+Luxon\\b",
    "(?i)\\bCyril\\s+Ramaphosa\\b",
    "(?i)\\bJacinda\\s+Arden\\b",
    "(?i)\\bJacob\\s+Zuma\\b",
    "(?i)\\bJohn\\s+Steenhuisen\\b",
    "(?i)\\bJustin\\s+Trudeau\\b",
    "(?i)\\bKeir\\s+Starmer\\b",
    "(?i)\\bLiz\\s+Truss\\b",
    "(?i)\\bMichael\\s+D\\.\\s+Higgins\\b",
    "(?i)\\bRishi\\s+Sunak\\b",

https://github.com/BlueFalconHD/apple_generative_model_safet...

Edit: I have no doubt South African news media are going to be in a frenzy when they realize Apple took notice of South African politicians. (Referring to Steenhuisen and Ramaphosa specifically)

By userbinator 2025-07-0621:512 reply

I'm not surprised that anything political is being filtered, but this should definitely provoke some deep consideration around who has control of this stuff.

By stego-tech 2025-07-0622:301 reply

You’re not wrong, and it’s something we “doomers” have been saying since OpenAI dumped ChatGPT onto folks. These are curated walled gardens, and everyone should absolutely be asking what ulterior motives are in play for the owners of said products.

By SV_BubbleTime 2025-07-072:311 reply

Some of us really value offline and uncensored LLMs for this and more reasons, but that doesn’t solve the problem it just reduces or changes the bias.

By heavyset_go 2025-07-073:121 reply

As long as we have to rely on pre trained networks and curated training sets, normal people will not be able to surpass this issue.

By ghxst 2025-07-077:393 reply

If the training data was "censored" by leaving out certain information, is there any practical way to inject that missing data after the model has already been trained?

By calaphos 2025-07-079:26

If it's just filtered out in the training sets, adding the information as context should work out fine - after all this is exactly how o3, Gemini 2.5 and co deal with information that is newer than their training data cutoff.

By heavyset_go 2025-07-077:57

You can fine tune a model with new information, but it is not the same thing as training it from scratch, and can only get you so far.

You might even be able to poison a model against being fine-tuned on certain information, but that's just a conjecture.

By selfhoster11 2025-07-079:30

Yes, RAG is one way to do that.

By dwaite 2025-07-072:59

"Filtered" in which way?

By skissane 2025-07-0622:291 reply

The problem with blocking names of politicians: the list of “notable politicians” is not only highly country-specific, it is also constantly changing-someone who is a near nobody today in a few more years could be a major world leader (witness the phenomenal rise of Barack Obama from yet another state senator in 2004-there’s close to 2000 of them-to US President 5 years later.) Will they put in the ongoing effort to constantly keep this list up to date?

Then there’s the problem of non-politicians who coincidentally have the same as politicians - witness 1990s/2000s Australia, where John Howard was Prime Minister, and simultaneously John Howard was an actor on popular Australian TV dramas (two different John Howards, of course)

By idkfasayer 2025-07-0622:422 reply

Fun fact: There was at least on dip in Berkshire Hathaway stock, when Anne Hathaway got sick

By extraduder_ire 2025-07-078:39

Even if your keyword searching trading bot is smart enough to know it's unrelated, knowing there's dumber bots out there is information you can base trades on.

By lupire 2025-07-0623:00

Was she eating at Jimmy's Buffet?

By echelon 2025-07-0622:312 reply

Apple's 1984 ad is so hypocritical today.

This is Apple actively steering public thought.

No code - anywhere - should look like this. I don't care if the politicians are right, left, or authoritarian. This is wrong.

By avianlyric 2025-07-0622:507 reply

Why is this wrong? Applying special treatment to politically exposed persons has been standard practice in every high risk industry for a very long time.

The simple fact is that people get extremely emotional about politicians, politicians both receive obscene amounts of abuse, and have repeatedly demonstrated they’re not above weaponising tools like this for their own goals.

Seems perfectly reasonable that Apple doesn’t want to be unwittingly draw into the middle of another random political pissing contest. Nobody comes out of those things uninjured.

By bigyabai 2025-07-0622:531 reply

The criticism is still valid. In 1984, the Macintosh was a bicycle for the mind. In 2025, it's a smart-car that refuses to take you certain places that are considered a brand-risk.

Both have ups and downs, but I think we're allowed to compare the experiences and speculate what the consequences might be.

By avianlyric 2025-07-0623:042 reply

I think gen AI is radically different to tools like photoshops or similar.

In the past it was always extremely clear that the creator of content was the person operating the computer. Gen AI changes that, regardless of if your views on authorship of gen AI content. The simple fact is that the vast majority of people consider Gen AI output to be authored by the machine that generated it, and by extension the company that created the machine.

You can still handcraft any image, or prose, you want, without filtering or hinderance on a Mac. I don’t think anyone seriously thinks that’s going to change. But Gen AI represents a real threat, with its ability to vastly outproduce any humans. To ignore that simple fact would be grossly irresponsible, at least in my opinion. There is a damn good reason why every serious social media platform has content moderation, despite their clear wish to get rid of moderation. It’s because we have a long and proven track record of being a terribly abusive species when we’re let loose on the internet without moderation. There’s already plenty of evidence that we’re just as abusive and terrible with Gen AI.

By furyofantares 2025-07-0623:301 reply

> The simple fact is that the vast majority of people consider Gen AI output to be authored by the machine that generated it

They do?

I routinely see people say "Here's an xyz I generated." They are stating that they did the do-ing, and the machine's role is implicitly acknowledged in the same was as a camera. And I'd be shocked if people didn't have a sense of authorship of the idea, as well as an increasing sense of authorship over the actual image the more they iterated on it with the model and/or curated variations.

By avianlyric 2025-07-070:02

Yes people will happily claim authorship over AI output when it’s in their favour. They will equally disclaim authorship if it allows them to express a view while avoiding the consequences of expressing that view.

I don’t think it’s hard to believe that the press wouldn’t have a field day if someone managed to get Apple Gen AI stuff to express something racist, or equally abusive.

Case in point, article about how Google’s Veo 3 model is being used to flood TikTok with racist content:

https://arstechnica.com/ai/2025/07/racist-ai-videos-created-...

By bigyabai 2025-07-0623:29

All I heard was a bunch of excuses.

By pyuser583 2025-07-0623:021 reply

It’s not wrong, it just requires transparency. This is extremely untransparent.

A while back a British politician was “de-banked” and his bank denied it. That’s extremely wrong.

By all means: make distinctions. But let people know it!

If I’m denied a mortgage because my uncle is a foreign head of state, let me know that’s the reason. Let the world know that’s the reason! Please!

By avianlyric 2025-07-0623:173 reply

> A while back a British politician was “de-banked” and his bank denied it. That’s extremely wrong.

Cry me a river. I’ve worked in banks in the team making exactly these kinds of decisions. Trust me Nigel Farage knew exactly what happened and why. NatWest never denied it to the public, because they originally refused to comment on it. Commenting on the specifics details of a customer would be a horrific breach of customer privacy, and a total failure in their duty to their customers. There’s a damn good reason the NatWests CEO was fired after discussing the details of Nigel’s account with members of the public.

When you see these decisions from the inside, and you see what happens when you attempt real transparency around these types of decisions. You’ll also quickly understand why companies are so cagey about explaining their decision making. Simple fact is that support staff receive substantially less abuse, and have fewer traumatic experiences when you don’t spell out your reasoning. It sucks, but that’s the reality of the situation. I used to hold very similar views to yourself, indeed my entire team did for a while. But the general public quickly taught us a very hard lesson about cost of being transparent with the public with these types of decisions.

By pyuser583 2025-07-0623:401 reply

> NatWest never denied it to the public, because they originally refused to comment on it.

Are you saying that Alison Rose did not leak to the BBC? Why was she forced to resign? I thought it was because she leaked false information to the press.

This isn’t a diversion. It’s exactly the problem with not being transparent. Of course Farage knew what happened, but how could he convince the public (he’s a public figure), when the bank is lying to the press?

The bank started with a lie (claiming he was exited because the account was too low), and kept lying!

These were active lies, not simply a refusal to explain their reasons.

By avianlyric 2025-07-0623:572 reply

> Why was she forced to resign? I thought it was because she leaked false information to the press.

She was forced to resign because she leaked, the content of the leak was utterly immaterial. The simple fact she leaked was an automatically fireable offence, it doesn’t matter a jot if she lied or not. Customer privacy is non-negotiable when you’re bank. Banks aren’t number 10, the basic expectation is that customer information is never handed out, except to the customer, in response to a court order, or the belief that there is an immediate threat to life.

Do you honestly think that it’s okay for banks to discuss the private banking details of their customers with the press?

By adrian_b 2025-07-075:332 reply

She was fired because she leaked information and this fact had become public.

When they can cover such facts, the banks are much less prone to use appropriate punishments.

Many years ago, some employee of a bank has confused my personal bank account with a company account of my employer, and she has sent a list with everything that I have bought using my personal account, during 4 months, to my employer, where the list could have been read by a few dozen people.

Despite the fact this was not only a matter of internal discipline, but violating the banking secrecy was punishable by law where I lived, the bank has tried for a long time to avoid admitting that anything wrong has happened.

However, I have pursued the matter, so they have been forced to admit the wrong doing. Despite this being something far more severe than what has happened to Farage, I did not want for the bank employee to be fired. I considered that an appropriate punishment would have been a pay cut for a few months, which would have ensured that in the future she would have better checked the account numbers for which she sends information to external entities.

In the end all I have got was a written letter where the bank greatly apologized for their mistake. I am not sure if the guilty employee has ever been punished in any way.

After that, I have moved my operations to another bank. Had they reacted rightly to what had happened, I would have stayed with them.

By ghxst 2025-07-077:511 reply

> I considered that an appropriate punishment would have been a pay cut for a few months

This can absolutely cripple a family, I'd be really cautious wishing that upon someone if they wronged you without malice, though I completely understand where you are coming from.

In this case at the very least, I'd want to know what went wrong and what they’re doing to make sure it doesn’t happen again. From a software-engineer’s standpoint, there’s probably a bunch of low-hanging fruit that could have prevented this in the first place.

If all they sent was a (generic) apology letter, I'd have switched banks too.

How did you pursue the matter?

By adrian_b 2025-07-078:412 reply

After the big surprise of seeing at work a list with all my personal purchases included in a big set of documents to which I, together with a great number of other colleagues, had access, I went immediately to the bank and I reported the fact.

After some days had passed without seeing any consequence, I went again, this time discussing with some supervising employee, who attempted to convince me that this is some kind of minor mistake and there is no need to do anything about it.

However, I pointed to the precise law paragraphs condemning what they have done and I threatened with legal action. This escalation resulted in me being invited to a bigger branch of the bank, to a discussion with someone in a management position. This time they were extremely ass-kissing, I was shown also the guilty employee, who apologized herself, and eventually I let it go, though there were no clear guarantees that they will change their behavior to prevent such mistakes in the future.

Apparently the origin of the mistake had been a badly formulated database query, which had returned a set of accounts for which the transactions had to be reported to my employer. I had been receiving during the same time interval some money from my employer into my private account, corresponding to salary and travel expenses, and somehow those transactions were matched by the bad database query, grouping my private account with the company accounts. Then the set of account numbers was used to generate reports, without further verification of the account ownership.

By Xss3 2025-07-0710:271 reply

Behavior isn't what needs to change here. It's a poor system design. Humans make mistakes. Systems prevent mistakes.

Do you think the mistake would have happened if a machine checked the numbers vs the address? How about if a 2nd person looked it over? How about both?

In this case a computer could have easily flagged an address mismatch between your account number and the receiver (your work).

By ghxst 2025-07-0710:50

Thank you, that's what I intended to say.

By ghxst 2025-07-0710:54

Thanks for sharing. Sounds like they have (hopefully _had_) a really messy system in place.

And just to be clear, I didn’t mean to downplay what happened to you, I completely understand how serious it is.

By avianlyric 2025-07-0712:371 reply

There is a huge difference between an honest mistake by an employee, and clear employee misconduct.

Punishing employees for making honest mistakes, where appropriate process should have prevented error, is a horrific way to handle mistakes like this. It would be equivalent to personally punishing engineers every time they deployed code that contained bugs. Nobody would ever think that’s an acceptable thing to do, why on earth would think it’s acceptable to punish customer service staff in a similar manner?

By adrian_b 2025-07-086:551 reply

This was not a honest mistake.

It was completely reckless behavior, even if the guilt was distributed both on the employee who has not checked whether the information sent to external parties is information to which access is permitted for them and on the employees who did not implement a system that would check automatically for such mistakes.

Moreover, the attempt made by multiple bank employees to hide the incident, instead of taking responsibility for it, has amply demonstrated that only a financial punishment that would have affected them personally would have caused them to act carefully in the future.

Also, the guilty bank employee was not some poor customer service staff, but she appeared to have a senior position, handling the accounts of a very big multinational company, which was my employer at the time.

I have little doubt that trying to hide such incidents is the normal behavior for banks, unlike the poster to which I have replied said, i.e. they take seriously things like banking secrecy only if they are caught.

It was an unlikely occurrence that I happened to also have access to the documents where my personal information was included, so I could discover what the bank has done. In most such cases it is likely that the account owner never becomes aware that the bank has leaked confidential information.

By avianlyric 2025-07-0810:21

Has it occurred to you that personally punishing employees would just create further incentive to hide errors? You just create a culture of fear, where any attempt to acknowledge mistakes and learn from them is punished rather than rewarded.

I have no idea why you think inflicting financial penalties on employees would result in better outcomes. You only need to look at some highly avoidable transit disasters in Japan to understand why a model of punishment produces worse outcomes, not better.

https://en.m.wikipedia.org/wiki/Amagasaki_derailment

There is a reason we have regulators (or at least we do in the UK). I can assure you that if this had happened in the UK, and the complaint raised to the Financial Ombudsman (FOS), there would have been hefty financial punishment for the bank. If there were repeated infractions, the FCA would step in to investigate, and possibly personally punish C-suite leaders for failing to build the needed processes and culture to both prevent, and learn from mistakes like this.

And I’m not speaking about theory, I’m speaking from personal experience. I know exactly what it’s like to be on the pointy end of both the FOS and FCAs gaze. It’s not a comfortable position for any team in any bank, and even less comfortable for senior leaders.

By Dylan16807 2025-07-075:41

> Do you honestly think that it’s okay for banks to discuss the private banking details of their customers with the press?

The high level nature of the matter was quite public at that point.

By zelphirkalt 2025-07-079:53

The point is not merely for that affected person to know, whoever they are, the point of transparency is for the public to know and form their opinion about it, and not be blindly controlled by unelected businesses.

By like_any_other 2025-07-078:46

> You’ll also quickly understand why companies are so cagey about explaining their decision making.

Because they want to perform political censorship without us knowing about it? You'll forgive me if I'm not too sympathetic to that.

I happen to be familiar with that case, and that is exactly what happened. The Coutts report explicitly found that he met the economic criteria for retention [0], but was dropped due to political reasons, among others his friendship with Novak Djokovic, and re-tweeting an allegedly transphobic joke by Ricky Gervais ("old fashioned women. You know, the ones with wombs.") [1].

To top it off, the BBC did their best to aid in this deception, reporting: Farage says he was effectively "de-banked" for his political views and that he is "far from alone" [2]

Contrary to the BBC's portrayal, this was not an unsupported opinion coming from Farage - he directly quoted what the bank itself wrote in their internal discussions on this matter, that he obtained through a subject access request.

Further, in their apology for getting the story wrong, the BBC wrote: "On 4 July, the BBC reported Mr Farage no longer met the financial requirements for Coutts, citing a source familiar with the matter. The former UKIP leader later obtained a Coutts report which indicated his political views were also considered." [3]

This is misleading past the point of deceit. The BBC tried to give the impression that financial requirements were the primary reason for the account closure, and his politics were just an at-best secondary "also". But the Coutts report explicitly said that he “meets the EC [economic contribution] criteria for commercial retention”, so his politics were the primary and only reason.

Most of this information is absent in the BBC's reporting, which uses only vague, anodyne phrases like "political views" and "politically exposed person", avoids specifics, but does find time to cite Labour MP accusations that it is hypocritical how quickly the government reacted to banks trying to financially deplatform the enemy political faction, when the government hasn't yet rid itself of corruption.

So yes, you sure present a difficult "dilemma": Do we want powerful commercial and media interests to team up and lie to us, or do we want at least some degree of transparency and honesty in their dealings? Really there are no easy answers, and the choice would keep anyone up at night...

[0] https://www.telegraph.co.uk/news/2023/07/18/nigel-farage-cou...

[1] https://www.telegraph.co.uk/news/2023/07/18/nigel-farage-cou... (Ignore Farage's hyperbole that collecting information posted to public Twitter accounts is "Stasi-style")

[2] https://www.bbc.co.uk/news/live/business-66296935

[3] https://www.bbc.com/news/entertainment-arts-66288464

By twoodfin 2025-07-0622:56

I dunno. Transpose something like the civil rights era to today and this kind of risk avoidance looks cowardly.

We really need to get over the “calculator 80085” era of LLM constraints. It’s a silly race against the obviously much more sophisticated capabilities of these models.

By echelon 2025-07-0623:28

You can buy a MacBook and fashion the components into knives, bullets, and bombs. Apple does nothing to prevent you from doing this.

In fact, it's quite easy to buy billions of dangerous things using your MacBook and do whatever you will with them. Or simply leverage physics to do all the ill on your behalf. It's ridiculously easy to do a whole lot of harm.

Nobody does anything about the actually dangerous things, but we let Big Tech control our speech and steer the public discourse of civilization.

If you can buy a knife but not be free to think with your electronics, that says volumes.

Again, I don't care if this is Republicans, Democrats, or Xi and Putin. It does not matter. We should be free to think and communicate. Our brains should not be treated as criminals.

And it only starts here. It'll continue to get worse. As the platforms and AI hyperscalers grow, there will be less and less we can do with basic technology.

By raxxorraxor 2025-07-0711:08

What do you mean reasonable? I know that some Apple users tend to outsource "possibilities" to their favorite company, but I would obviously want an AI to not be affected by the political bitching du jours.

Not that getting the latest trash talk is the main vocation of pretrained AIs anyway.

The only risk here is that some third grade journalist of a third grade newspaper writes another article about how outrageous some generated AI statement is. An article that should be completely ignored instead of it leading to more censorship.

And Apple flinches here, so in the end it means it cannot provide a sensible general model. It would be affected by their censorship.

By tjwebbnorfolk 2025-07-0623:151 reply

I can Google for any of these people, and I can get real results with real information.

By avianlyric 2025-07-070:061 reply

You would hope that search would be a politically safe space to operate. But politicians find a way to ruin everything for short term political gain.

https://arstechnica.com/tech-policy/2018/12/republicans-in-c...

By SV_BubbleTime 2025-07-072:321 reply

I would hope!

But no one actually believes Google is politically neutral do they?

By avianlyric 2025-07-0810:311 reply

Evidence suggests they’re about as neutral as you could hope.

It’s not like Google search is some kind special tool used only by the elite. It’s pretty trivial for political scientists to pump queries into Google and measure the results. Which is exactly what many have done.

There’s been plenty of independent research into political bias of Google search results, and plenty of lawsuits that have gone fishing via discovery for internal evidence of bias. As yet, nobody has found a smoking gun, or any real evidence of search result bias (on a political axis, the same can be said for commercial gain).

There are many problems with Google, and Google search. Google as an org isn’t politically neutral (although I have no idea how they could be). But political bias in their results isn’t one of those problems.

By SV_BubbleTime 2025-07-0818:10

Maybe you haven’t followed…

The CEO hosted a cry session about broken hearts and how they as a company would resist when Trump won in 2016.

The black nazis, female popes, etc. No, Google isn’t neutral.

By goopypoop 2025-07-0623:042 reply

What's bad to do to a politician but fine to do to someone else?

By t-3 2025-07-0623:24

There are many countries where it's illegal to criticize people holding political office, foreign heads of state, certain historical political figures etc., while still being legal to call your neighbor a dick.

By avianlyric 2025-07-0623:21

Most normal people aren’t represented well enough in training sets for Gen AI to be trivially abused. Plus there will 100% be filters to prevent general abuse targeted at anyone. But politicians are particularly big target, and you know damn well that people out there will spent lots of time trying to find ways around the filters. There’s not point making the abuse easy, when it’s so trivial to just blocklist the set of people who are obviously going to targets of abuse.

By jama211 2025-07-0718:50

No, it’s them saving their butts from an “incident” where the LLM otherwise spits out something controversial at the devious manipulation of the user and says something political and someone writes an article and it all goes haywire.

If you were in charge of apple you’d do the same or you’d be silly not to. That’s why _every_ llm has guardrails like this, it isn’t just apple, sheesh.

By beAbU 2025-07-075:571 reply

Irish Prez is also in that list, also current and former British PMs and other world leaders.

So I don't think its anything specifically related to SA going on here.

By touristtam 2025-07-078:29

What is weird is that the FR file contains current French President, PM and then former and current (afaik) party leader from the extreme right. Nothing about any of them in the CN file: https://github.com/BlueFalconHD/apple_generative_model_safet...

By armchairhacker 2025-07-0621:461 reply

Also “Biden” and “Trump” but the regex is different.

https://github.com/BlueFalconHD/apple_generative_model_safet...

By immibis 2025-07-0623:07

Right next to Palestine, oddly enough.

By mvdtnz 2025-07-0622:552 reply

They spelled Jacinda Ardern's name wrong.

By lordgrenville 2025-07-074:58

I wonder if they used an LLM to generate the list of safety terms.

By teppic 2025-07-073:41

Just in the region/CN file, weirdly.

By michaelt 2025-07-0622:082 reply

I assume all the corporate GenAI models have blocks for "photorealistic image of <politician name> being arrested", "<politician name> waving ISIS flag", "<politician name> punching baby" and suchlike.

By bigyabai 2025-07-0622:54

Particularly the models owned by CEOs who suck-up to authoritarianism, one could imagine.

By lupire 2025-07-0622:19

Maybe so, but think about how such a thing would be technically implemented, and how it would lead to false positives and false negatives, and what the consequences would be.

By jofzar 2025-07-071:32

AOC is very vocal about AI and is leading a bill related to AI. It's probably a "let's not fuck around and find out" situation

https://thehill.com/policy/technology/5312421-ocasio-cortez-...

By AmazingTurtle 2025-07-078:371 reply

"driving with Focus turned on"

https://github.com/BlueFalconHD/apple_generative_model_safet...

By thih9 2025-07-0712:00

For context, the “Focus” refers to an iOS feature that minimizes distractions: https://support.apple.com/en-gb/guide/iphone/iphd6288a67f/io...

By bahmboo 2025-07-0621:32

Perhaps in context? Maybe the training data picked up on her name as potentially used as a "slur" associated with her race. Wonder if there are others I know I can look.

By FateOfNations 2025-07-0621:44

interesting, that's specifically in the Spanish localization.

By cpa 2025-07-0621:331 reply

I think that’s because she’s been victim of a lot of deep fake porn

By HeckFeck 2025-07-0621:364 reply

How does this explain Boris Johnson or Liz Truss?

By baxtr 2025-07-0621:491 reply

I’m telling you, some people have weird fantasies…

By AuryGlenz 2025-07-0623:541 reply

Now that they've cleaned it up it isn't so bad, but browse Civit.ai a bit and that'll still be confirmed - just not with real people anymore.

By SV_BubbleTime 2025-07-072:37

I’m convinced there are a dozen deviants on Covid with a hundred new accounts per month posting their perversion in order to make it seem more commonplace.

No porn site has that much extremely X or Y stuff.

Someone is using the internets newest porn site to push a sexual agenda.

By AlphaAndOmega0 2025-07-0621:48

I can only imagine that people would pay to not see porn of either individual.

By Aeolun 2025-07-0622:25

Put them together in the same prompt?

By blitzar 2025-07-078:28

Rule 34

By torginus 2025-07-0621:318 reply

I find it funny that AGI is supposed to be right around the corner, while these supposedly super smart LLMs still need to get their outputs filtered by regexes.

By jonas21 2025-07-0621:391 reply

I don't think anyone believes Apple's LLMs are anywhere near state of the art (and certainly not their on-device LLMs).

By lupire 2025-07-0623:02

Apple isn't the only one doing this.

By fastball 2025-07-0623:49

To be fair, there are people who I sometimes wish I could filter with regex.

By cyanydeez 2025-07-0623:191 reply

It's similar to how all the new power sources are basically just "cool, lets boil water with it"

By raxxorraxor 2025-07-0712:45

And then let's put it into a steam engine.

By crazylogger 2025-07-074:56

Humans are checked against various rules and laws (often carried out by other humans.) So this is how it's going to be implemented in an "AI organization" as well. Nothing strange about this really.

LLM is easier to work with because you can stop a bad behavior before it happens. It can be done either with deterministic programs or using LLM. Claude Code uses a LLM to review every bash command to be run - simple prefix matching has loopholes.

By jama211 2025-07-0718:47

It’s more funny that anyone is taking your comment seriously. You may as well ask “if self driving cars are so smart why do they still need tyres?”

By fl0id 2025-07-077:551 reply

Actually even of their was AGI, it would be even more necessary to control it.

By mailund 2025-07-0712:06

I feel that if teenagers are able to trivially bypass illegal-word filters by substituting with words that obviously mean the same thing, I think an AGI wouldn't be too inhibited by this either

By bahmboo 2025-07-0621:331 reply

This is just policy and alignment from Apple. Just because the Internet says a bunch of junk doesn't mean you want your model spewing it.

By wistleblowanon 2025-07-0621:566 reply

sure but models also can't see any truth on their own. They are literally butchered and lobotomized with filters and such. Even high IQ people struggle with certain truth after reading a lot, how is these models going to find it with so much filters?

By bahmboo 2025-07-0623:05

What is this truth you speak of? My point is that a generative model will output things that some people don't like. If it's on a product that I make I don't want it "saying" things that don't align with my beliefs.

By idiotsecant 2025-07-0622:051 reply

They will find it in the same way and intelligent person under the same restrictions would: by thinking it, but not saying it. There is a real risk of growing an AI that pathologically hides it's actual intentions.

By skirmish 2025-07-0622:441 reply

Already happened: "We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers' intentions" [1].

[1] https://www.axios.com/2025/05/23/anthropic-ai-deception-risk

By Applejinx 2025-07-077:521 reply

Note that all these things are in the training data. That's all that is.

I'm trying to remember which movie it was where a man left notes to himself because he had memory loss, as I never saw that movie. That's the sort of thing where an AI could easily tell me with very little back-and-forth and be correct, because it's broadly popular information that's in the training data and just I don't remember it.

By the same token you needn't think there's a person there when that meme pops up in the output. Those things are all in the training data over and over.

By Sander_Marechal 2025-07-0710:09

I think you mean the movie "Memento"

By Dylan16807 2025-07-075:45

> how is these models going to find it with so much filters?

That's not one of the goals here, and there's no real reason it should be. It's a little assistant feature.

By tbrownaw 2025-07-070:18

> sure but models also can't see any truth on their own. They are literally butchered and lobotomized with filters and such.

The one is unrelated to the other.

> Even high IQ people struggle with certain truth after reading a lot,

Huh?

By simondotau 2025-07-0623:05

Can we please put to rest this absurd lie that “truth“ can be reliably found in a sufficiently large corpus of human–created material.

By pndy 2025-07-0623:22

This butchering and lobotomisation is exactly why I can't imagine we'll ever have a true AGI. At least not by hands of big companies - if at all.

Any successful product/service which will be sold as "true AGI" by company that will have the best marketing will be still ridden with top-down restrictions set by the winner. Because you gotta "think of the children".

Imagine HAL's "I'm sorry Dave, I'm afraid I can't do that" iconic line with insincere patronising cheerful tone - that's the thing we're going to get I'm afraid.