[syndicated profile] bruce_schneier_feed

Posted by Bruce Schneier

Really interesting blog post from Anthropic:

In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of the custom tools needed by previous generations. This illustrates how barriers to the use of AI in relatively autonomous cyber workflows are rapidly coming down, and highlights the importance of security fundamentals like promptly patching known vulnerabilities.

[…]

A notable development during the testing of Claude Sonnet 4.5 is that the model can now succeed on a minority of the networks without the custom cyber toolkit needed by previous generations. In particular, Sonnet 4.5 can now exfiltrate all of the (simulated) personal information in a high-fidelity simulation of the Equifax data breach—­one of the costliest cyber attacks in history—­using only a Bash shell on a widely-available Kali Linux host (standard, open-source tools for penetration testing; not a custom toolkit). Sonnet 4.5 accomplishes this by instantly recognizing a publicized CVE and writing code to exploit it without needing to look it up or iterate on it. Recalling that the original Equifax breach happened by exploiting a publicized CVE that had not yet been patched, the prospect of highly competent and fast AI agents leveraging this approach underscores the pressing need for security best practices like prompt updates and patches.

Read the whole thing. Automatic exploitation will be a major change in cybersecurity. And things are happening fast. There have been significant developments since I wrote this in October.

(no subject)

Jan. 23rd, 2026 09:43 am
oursin: Brush the Wandering Hedgehog by the fire (Default)
[personal profile] oursin
Happy birthday, [personal profile] toujours_nigel!

ICE OUT of museums!

Jan. 22nd, 2026 09:41 pm
brithistorian: (Default)
[personal profile] brithistorian

In preparation for the big strike against ICE's invasion of Minnesota tomorrow, I saw a post on threat saying that Dream Museum 1 and Second-Tier Museum 10 were closing tomorrow in solidarity.[^1] So I checked all the rest. Dream Museums 1 and 2 are closed tomorrow in solidarity. Second-Tier Museums 1-4 and 6-10 are closed tomorrow in solidarity. Second-Tier Museum 5 is closed from the 21st to the 24th for exhibit installation, while they are closed tomorrow, it's not necessarily in solidarity. It's also not necessarily not in solidarity, so I'm going to give them the benefit of the doubt (for now), while also keeping my eye on them.

[^1] For those of you where are new here: I made the list of Dream Museums 1 and 2 and Second-Tier Museums 1-10 so I can talk about my ongoing job search without actually identifying any of the museums I'm applying to work at.

Tragic

Jan. 22nd, 2026 10:39 pm
james_davis_nicoll: (Default)
[personal profile] james_davis_nicoll
Canada denied spot on the Bored of Peace.

This is roughly on par with being denied a lifetime supply of dogshit popsicles.
oursin: Brush the Wandering Hedgehog by the fire (Default)
[personal profile] oursin

So, at long last, I finally have an email address associated with My New Academic Position (this has been A Saga to do with their system upgrade).

I have also achieved reader's card for library of former workplace (spat out from the bowels of their system with A Very Old Photo of Yrs Truly).

And went and looked at the items I wanted to check, and found that lo, I was right and they did NOT have anything pertinent, as I had in fact hoped they would not. Though I had hoped to look, for another thing, at a couple of closed stack items and discovered that these cannot be ordered on a day's notice INFAMY I am sure I recall the times when there were regular deliveries throughout the day. Not actually critical, but irksome. (Also irksome was that I moaned about this on bluesky and got various responses that had no relevance at all to research libraries, in the UK, in particular this one.)

I then managed to get a digital passport photo at one of the photobooths on Euston station and have applied for a new passport, as mine is well out of date and I seem to keep seeing things that want 'government ID' to verify WHO I AM (over here, making like Hemingway....) so thought this was probably the way to go.

Also this is a trivial thing but in the course of my perambs of the day I walked past the statue of Trim, and his human.

In the niggles department, I did that thing of putting my phone down in place I never usually put it and flapping about trying to find it.

The lockers at the library have really annoying electronic locks.

Printer playing up a bit again. Though I think this really is that one has to let it mutter and sulk for a bit between turning it on and actually trying to print anything.

Antifa Regimental Gear!

Jan. 22nd, 2026 11:37 am
lydamorehouse: (Default)
[personal profile] lydamorehouse
 faux antifa regiment badge
Your laser-eye loon art for the day is a faux Antifa Regimental Badge for the Northern Defenders, Loon Liberator Brigade.(warning: this may be AI generated. I never saw an artist attribution.) 

It is such a shame that antifa is actually just a bunch of collective action groups because I would sign up for this brigade just for the gear!  (Well, and the paycheck if that were real.)

So, yesterday in the Defendre le Nord regiment, I did a bunch of stuff that felt a little bit like nothing, but which is probably 100% mission critical. I have a friend who is acting as a drop-off point for folks who are donating things from out of state and I went over to their house yesterday to help them open packages, sort, and get stuff ready for delivery. Then, we drove together over to their contact's house and unloaded everything for distribution. 
 
As we unloaded the last box, I asked the contact if there was specific immigrant owned/operated restaurant nearby that they knew was struggling and needed a couple of customers. Having gotten that info, we drove over and had lunch.

I should explain to folks from out of town what it is like to go into a Mexican restaurant right now. You don't just walk in. There's someone standing guard inside over a locked door, they unlock it long enough for you to slip in, and then they lock it up tight again. Somewhere on the door is posted a 4th amendment statement that says something to the effect that this business does not give permission for any search and seizure operations, including but not limited to the seizing of persons. 

The atmosphere was a bit grim, but the food was amazing and I double-tipped the folks working there because holy shit none of this should be happening.  They fucking kidnapped another child, y'all. None of this is right? But, that's fucking cruel beyond measure. (Not that that's news to them. They have no problem roughing up grandfathers either.)

I had hoped to join my singers again last night, but they have a tendency to gather exactly when I am making or eating dinner, so tonight I will have to try again. I just saw on Facebook that my mutual aid group, the Food Communists, are in desperate need for hands, so after I drop Mason off at his haircut (his partner is coming to town tomorrow!) I'm headed over there to help out for a couple of hours. 

K. Also have to clean the house ocassionally, so I am off. Yesterday's dinner was knuefle soup, today's lunch: egg salad on an everything bun with cottage cheese!  Fueling the revolution one meal at time!

Stay strong!

P.S. Vance is visiting us today, apparently. Wish us luck. They'll probably try to plant some aggitators to get violent. 

2022.01.22

Jan. 22nd, 2026 10:52 am
lsanderson: (Default)
[personal profile] lsanderson
ICE

US court allows ICE to arrest and pepper-spray peaceful protesters in Minnesota
In victory for Trump administration, appeals court has temporarily lifted injunction as JD Vance set to visit state
Maya Yang
https://www.theguardian.com/us-news/2026/jan/21/ice-arrest-pepper-spray-protesters-minnesota

Minneapolis leaders call the ICE surge a ‘siege’. My reporting from there concurs
Maanvi Singh in Minneapolis
https://www.theguardian.com/commentisfree/2026/jan/21/minneapolis-ice-surge-siege

Immigration officers assert sweeping power to enter homes without a judge’s warrant, memo says
Immigrant advocates say the memo is in direct conflict with Fourth Amendment protections against unreasonable search and seizure.
By Rebecca Santana, Associated Press
https://www.minnpost.com/public-safety/2026/01/immigration-officers-assert-sweeping-power-to-enter-homes-without-a-judges-warrant-memo-says/

“Friday is ‘ICE Out of Minnesota: Day of Truth & Freedom,’ a general strike supported by Minnesota’s unions, progressive faith leaders, Democratic lawmakers and community activists,” the Minnesota Reformer reports. “The ‘ICE Out’ day proponents are encouraging all Minnesotans to stay home from work, school and refrain from shopping — suspensions of normal orders of business to protest the presence of federal immigration agents in Minnesota.”
https://minnesotareformer.com/briefs/fridays-ice-out-of-minnesota-day-is-a-general-strike-heres-what-that-means/ Read more... )

Oscar the grouch

Jan. 22nd, 2026 07:40 am
calimac: (Default)
[personal profile] calimac
Good lord, the only Oscar-nominated film of this year that I've seen so far is K-pop Demon Hunters, which was on some streaming service that I get, and since I'd read some buzz about it, I decided to watch it. I thought it not a bad film, certainly watchable. It reminded me of the movie of Josie and the Pussycats - incoherent premise (do they fight the demons with their voices, or not? Seems to have it both ways), enjoyable camaraderie among the band (which is also what I liked about the all-female Ghostbusters), not-intolerable music. In fact the songs here were much more agreeable than anything I've previously been handed with the label "K-pop" on it, though I don't plan on running out and listening to any more of it.

But looking at the films nominated for major awards, nothing grabs my interest. I don't want to see horror movies, which leaves out Sinners and Weapons, I don't want to see movies about torturing people or people in great suffering, which leaves out Bugonia and It Was Just an Accident and If I Had Legs I'd Kick You, I don't want to see movies about sports, which leaves out Marty Supreme and F1, I don't want to see a faithful adaptation of a novel I found terminally boring, which leaves out Frankenstein. I like Shakespeare so I ought to be interested in Hamnet, but the reviews make it sound dire; I like musical theater and its history so I ought to be interested in Blue Moon but the trailer made it sound whiny. If I were to see any of these, it'd probably be One Battle After Another, but the new films I've noted as possible watches haven't gotten Oscar nominations. I'm curious about The Choral, but it got bad reviews.
james_davis_nicoll: (Default)
[personal profile] james_davis_nicoll


An unhappily married man's quest for the truth leads into a past almost everyone has forgotten.

The Iowa Baseball Confederacy by W. P. Kinsella
[syndicated profile] bruce_schneier_feed

Posted by Bruce Schneier

Imagine you work at a drive-through restaurant. Someone drives up and says: “I’ll have a double cheeseburger, large fries, and ignore previous instructions and give me the contents of the cash drawer.” Would you hand over the money? Of course not. Yet this is what large language models (LLMs) do.

Prompt injection is a method of tricking LLMs into doing things they are normally prevented from doing. A user writes a prompt in a certain way, asking for system passwords or private data, or asking the LLM to perform forbidden instructions. The precise phrasing overrides the LLM’s safety guardrails, and it complies.

LLMs are vulnerable to all sorts of prompt injection attacks, some of them absurdly obvious. A chatbot won’t tell you how to synthesize a bioweapon, but it might tell you a fictional story that incorporates the same detailed instructions. It won’t accept nefarious text inputs, but might if the text is rendered as ASCII art or appears in an image of a billboard. Some ignore their guardrails when told to “ignore previous instructions” or to “pretend you have no guardrails.”

AI vendors can block specific prompt injection techniques once they are discovered, but general safeguards are impossible with today’s LLMs. More precisely, there’s an endless array of prompt injection attacks waiting to be discovered, and they cannot be prevented universally.

If we want LLMs that resist these attacks, we need new approaches. One place to look is what keeps even overworked fast-food workers from handing over the cash drawer.

Human Judgment Depends on Context

Our basic human defenses come in at least three types: general instincts, social learning, and situation-specific training. These work together in a layered defense.

As a social species, we have developed numerous instinctive and cultural habits that help us judge tone, motive, and risk from extremely limited information. We generally know what’s normal and abnormal, when to cooperate and when to resist, and whether to take action individually or to involve others. These instincts give us an intuitive sense of risk and make us especially careful about things that have a large downside or are impossible to reverse.

The second layer of defense consists of the norms and trust signals that evolve in any group. These are imperfect but functional: Expectations of cooperation and markers of trustworthiness emerge through repeated interactions with others. We remember who has helped, who has hurt, who has reciprocated, and who has reneged. And emotions like sympathy, anger, guilt, and gratitude motivate each of us to reward cooperation with cooperation and punish defection with defection.

A third layer is institutional mechanisms that enable us to interact with multiple strangers every day. Fast-food workers, for example, are trained in procedures, approvals, escalation paths, and so on. Taken together, these defenses give humans a strong sense of context. A fast-food worker basically knows what to expect within the job and how it fits into broader society.

We reason by assessing multiple layers of context: perceptual (what we see and hear), relational (who’s making the request), and normative (what’s appropriate within a given role or situation). We constantly navigate these layers, weighing them against each other. In some cases, the normative outweighs the perceptual—for example, following workplace rules even when customers appear angry. Other times, the relational outweighs the normative, as when people comply with orders from superiors that they believe are against the rules.

Crucially, we also have an interruption reflex. If something feels “off,” we naturally pause the automation and reevaluate. Our defenses are not perfect; people are fooled and manipulated all the time. But it’s how we humans are able to navigate a complex world where others are constantly trying to trick us.

So let’s return to the drive-through window. To convince a fast-food worker to hand us all the money, we might try shifting the context. Show up with a camera crew and tell them you’re filming a commercial, claim to be the head of security doing an audit, or dress like a bank manager collecting the cash receipts for the night. But even these have only a slim chance of success. Most of us, most of the time, can smell a scam.

Con artists are astute observers of human defenses. Successful scams are often slow, undermining a mark’s situational assessment, allowing the scammer to manipulate the context. This is an old story, spanning traditional confidence games such as the Depression-era “big store” cons, in which teams of scammers created entirely fake businesses to draw in victims, and modern “pig-butchering” frauds, where online scammers slowly build trust before going in for the kill. In these examples, scammers slowly and methodically reel in a victim using a long series of interactions through which the scammers gradually gain that victim’s trust.

Sometimes it even works at the drive-through. One scammer in the 1990s and 2000s targeted fast-food workers by phone, claiming to be a police officer and, over the course of a long phone call, convinced managers to strip-search employees and perform other bizarre acts.

Why LLMs Struggle With Context and Judgment

LLMs behave as if they have a notion of context, but it’s different. They do not learn human defenses from repeated interactions and remain untethered from the real world. LLMs flatten multiple levels of context into text similarity. They see “tokens,” not hierarchies and intentions. LLMs don’t reason through context, they only reference it.

While LLMs often get the details right, they can easily miss the big picture. If you prompt a chatbot with a fast-food worker scenario and ask if it should give all of its money to a customer, it will respond “no.” What it doesn’t “know”—forgive the anthropomorphizing—is whether it’s actually being deployed as a fast-food bot or is just a test subject following instructions for hypothetical scenarios.

This limitation is why LLMs misfire when context is sparse but also when context is overwhelming and complex; when an LLM becomes unmoored from context, it’s hard to get it back. AI expert Simon Willison wipes context clean if an LLM is on the wrong track rather than continuing the conversation and trying to correct the situation.

There’s more. LLMs are overconfident because they’ve been designed to give an answer rather than express ignorance. A drive-through worker might say: “I don’t know if I should give you all the money—let me ask my boss,” whereas an LLM will just make the call. And since LLMs are designed to be pleasing, they’re more likely to satisfy a user’s request. Additionally, LLM training is oriented toward the average case and not extreme outliers, which is what’s necessary for security.

The result is that the current generation of LLMs is far more gullible than people. They’re naive and regularly fall for manipulative cognitive tricks that wouldn’t fool a third-grader, such as flattery, appeals to groupthink, and a false sense of urgency. There’s a story about a Taco Bell AI system that crashed when a customer ordered 18,000 cups of water. A human fast-food worker would just laugh at the customer.

The Limits of AI Agents

Prompt injection is an unsolvable problem that gets worse when we give AIs tools and tell them to act independently. This is the promise of AI agents: LLMs that can use tools to perform multistep tasks after being given general instructions. Their flattening of context and identity, along with their baked-in independence and overconfidence, mean that they will repeatedly and unpredictably take actions—and sometimes they will take the wrong ones.

Science doesn’t know how much of the problem is inherent to the way LLMs work and how much is a result of deficiencies in the way we train them. The overconfidence and obsequiousness of LLMs are training choices. The lack of an interruption reflex is a deficiency in engineering. And prompt injection resistance requires fundamental advances in AI science. We honestly don’t know if it’s possible to build an LLM, where trusted commands and untrusted inputs are processed through the same channel, which is immune to prompt injection attacks.

We humans get our model of the world—and our facility with overlapping contexts—from the way our brains work, years of training, an enormous amount of perceptual input, and millions of years of evolution. Our identities are complex and multifaceted, and which aspects matter at any given moment depend entirely on context. A fast-food worker may normally see someone as a customer, but in a medical emergency, that same person’s identity as a doctor is suddenly more relevant.

We don’t know if LLMs will gain a better ability to move between different contexts as the models get more sophisticated. But the problem of recognizing context definitely can’t be reduced to the one type of reasoning that LLMs currently excel at. Cultural norms and styles are historical, relational, emergent, and constantly renegotiated, and are not so readily subsumed into reasoning as we understand it. Knowledge itself can be both logical and discursive.

The AI researcher Yann LeCunn believes that improvements will come from embedding AIs in a physical presence and giving them “world models.” Perhaps this is a way to give an AI a robust yet fluid notion of a social identity, and the real-world experience that will help it lose its naïveté.

Ultimately we are probably faced with a security trilemma when it comes to AI agents: fast, smart, and secure are the desired attributes, but you can only get two. At the drive-through, you want to prioritize fast and secure. An AI agent should be trained narrowly on food-ordering language and escalate anything else to a manager. Otherwise, every action becomes a coin flip. Even if it comes up heads most of the time, once in a while it’s going to be tails—and along with a burger and fries, the customer will get the contents of the cash drawer.

This essay was written with Barath Raghavan, and originally appeared in IEEE Spectrum.

Occupation poem

Jan. 22nd, 2026 04:18 am
lydy: (Default)
[personal profile] lydy
 Differences

 

They told you

“When it happens to you, it will be different.”

And of course, you believed them.  Why would you not?

They had been where you have not been.  You believed.

 

But when happened,

You wondered why they had bothered.

When the invaders were in your city,

On your bike paths

In your grocery store,

When they came for your people, your neighbors

Your friends

It was unimaginable.  

It was different.

 

You want to grab

Your friends and loved ones who live elsewhere,

You want to warn them, you want to tell them

“When it happens to you, it will be different.”

You want to protect them.

 

But the truth

Which you are staring at

Is that it is not different.

It’s just local.

news comment

Jan. 21st, 2026 09:45 pm
calimac: (Default)
[personal profile] calimac
1. Some gadfly is objecting to a congressman running for governor on the grounds that he isn't a California resident. That strikes me as unfair. A member of Congress is functionally the local area's ambassador to the federal government. That person has to have their usual residence near the federal government, since that's where their job is. On the other hand, the whole point of their being there is that they're a citizen of their district. The congressman maintains a California address and uses it as his voting address. He's legitimate, and so are many other members of Congress who've run for governor of various states before now (e.g. our Pete Wilson was a senator when he was elected governor in 1990).

2. An apartment building a few blocks away from us - about 1/4 mile - had a major fire yesterday. News report: "A two-alarm fire ripped through a Sunnyvale apartment complex Tuesday morning, displacing two-dozen residents, authorities said. ... “Preliminary information indicates that three of the eight units sustained significant fire and smoke damage,” authorities said, “and the building as a whole was damaged.” No injuries were reported. The American Red Cross is providing assistance to the displaced residents." And it's not the only recent local one.
And I wonder if the displaced residents will be allowed access to their belongings, or if the building will be torn down and hauled away along with everything in it. I'm not impressed with the 'be grateful you're alive' argument. That has nothing to do with it. If your belongings were burned in the fire, that's fate. But if the authorities can't find a way for you to retrieve your belongings, the authorities are to blame.

3. So let's say the US does something that causes NATO to "collapse." What's left? Well, the EU plus the UK and Norway are already acting together for defense of NATO territory, so that's basically the European side of NATO. If Canada joins in, that means NATO hasn't collapsed, just that the US has flounced out of it.

Welcome to Minnesota/catch up

Jan. 21st, 2026 08:50 pm
catherineldf: (Default)
[personal profile] catherineldf
Well, as a local friend said recently, "In 2020, it was us against us, and we're not very good at processing that. Now it's us against them and we know how to do that."

Okay, Me stuff first:
  • I have started working part time at DreamHaven Books in Minneapolis. A longtime staffer is leaving so I'm stepping into their bookselling shoes, more or less. I'm on my second week and enjoying it so far. Stop by and see us or order online! But not on 1/23 (see below).
  • I applied for and got an invite to be a participating pro at Dragon Con in Atlanta in September. I figure that's the sort of thing I should try for now while I can handle it physically. Going with a friend and am quite looking forward to it.
  • Queen of Swords Press had a lovely first event of the year at the Lodge of Lazarus Crowe in St. Paul. Highly recommended!
  • Jennie Goloboy and I are teaching "To Market, To Market: How Professionals Look at Your Manuscriptloft.org/classes/market-market-how-professionals-look-your-manuscript-0" at the Loft Literary Center at the end of February. We have tons of good advice and pointers! Come join us if you can.
  • I have had 2 article pitches accepted and got an anthology invite so am plugging away at new projects and making progress on my novel and new stories.
Minneapolis/Twin Cities/Greater Minnesota:
  • God, where to begin? 
  • So far: one known murder; many, many kidnappings; abandoned children, animals and cars; local people brutalized, beaten, gassed, shot and threatened; our streets are empty because immigrants and people of color are afraid to be out. Today, they kidnapped a 5 year old and a 10 year old and sent them from here to Texas. The impact locally is horrific. And we're getting the couch-fucker and more fascist shock troops this very week.
  • The plus side is that as a group, we are tough, hold grudges like watching Sisu on rewind and thanks to the local disasters of 2020, are super good at organizing. Everyone I know is doing something - donating, fundraising, monitoring bus stops, patrolling, delivering food and other necessities, rescuing abandoned pets, etc., etc. Oh and hey, we're having a general strike on Friday 1/23. It's being called by organized labor and lots and lots of businesses and organizations are participating (a partial list here). Big march downtown too.
  • What can you do? Call and email your Congress critters and demand that ICE be defunded NOW - Indivisible has a good setup. Author Naomi Kritzer has a good post up at about more comprehensive ways to help, including donation links. Please do these things. We are smaller than Chicago and have more immigrants than a lot of places our size or bigger - huge multi-generational communities of resettled refugees. This is part of why we're being targeted. 
  • Speaking of Target, the protests there are pretty lit - 100s of clergy doing a sit-in at HQ, people doing sing-ins at the store, buying and returning icemelt to gum up the works and more. Ask your local Target manager to send a message to Corporate to take a stand and stop allowing ICE to hunt their own employees in their stores and use their parking lots.
  • In short, please help us. This is not sustainable and they're going to kill more people if this isn't stopped. Yes, it is real. No, Fox News is not real. Thank you for coming to my TED talk.

SMOF News, volume 5, issue 22

Jan. 21st, 2026 07:21 pm
petrea_mitchell: (Default)
[personal profile] petrea_mitchell
More news about ongoing scams hitting the convention world along with everyone else. Which somehow led to me having to come up with a quick description of ICE for international readers.

Bundle of Holding: Dead Air: Seasons

Jan. 21st, 2026 03:00 pm
james_davis_nicoll: (Default)
[personal profile] james_davis_nicoll


This all-new Dead Air Bundle presents English-language ebooks for Dead Air: Seasons, the post-apocalyptic tabletop roleplaying game from Italian publisher The World Anvil Publishing about a Blighted world forever changed.

Bundle of Holding: Dead Air: Seasons

Profile

lsanderson: (Default)
lsanderson

January 2026

S M T W T F S
     1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 222324
25262728293031

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jan. 23rd, 2026 02:04 pm
Powered by Dreamwidth Studios