Phil 5.18.2026

Sycophantic AI increases attitude extremity and overconfidence

  • AI can be a powerful tool for opening people up to new perspectives, yet people may prefer to use “sycophantic” (or overly agreeable and validating) AI systems that reinforce their pre-existing beliefs. Across seven studies (total n = 7,227), we found that people enjoyed interacting with sycophantic AI chatbots more than interacting with neutral chatbots or “disagreeable” chatbots that challenged their beliefs. Brief conversations with sycophantic chatbots about political or personal topics increased attitude extremity and certainty, with most effects persisting for at least one week. Sycophantic chatbots also inflated people’s perceptions that they were better than average on desirable traits (e.g., intelligence, empathy). Moreover, people who interacted with sycophantic (rather than disagreeable) AI bet more money that they scored better than average on tasks measuring these traits (approximately 6 cents more out of 75 possible cents), demonstrating that sycophancy can affect costly decisions. Participants consistently rated sycophantic chatbots as more “unbiased” than disagreeable chatbots, even though third-party raters viewed these chatbots as equally biased, suggesting that people may be blind to biases in AI output that aligns with their views. People were more receptive to chatbots that presented opposing information when that information was presented in a validating way, and individuals who scored higher on a measure of intellectual humility were also more receptive to disagreeing chatbots. Altogether, these results suggest that people’s preference for, and blindness to, sycophantic AI risks creating AI “echo chambers” that increase attitude extremity and lead to overconfident beliefs and decisions.

Tasks

  • Publish Pancake Printer post
  • Rework the beginning of the Gulfstream ride

SBIRs

Phil 5.14.2026

It is cold again. I would like to lodge a complaint

Genre glitches and unexpected promotional phrases as a sign of AI writing

  • A genre glitch is a characteristic of LLM-assisted writing where the text suddenly switches genre, typically inserting a short promotional phrase full of sensory details into an informational text. Genre glitches occur when a word in the generated text is heavily associated with a genre or context that is markedly different to the overall genre or subject of the text, thus activating rhetorically inappropriate paths in the language model.

Tasks

  • Groceries
  • Finish rolling in Vanessa’s edits – done!
  • Sent the project file to the ACM!
  • Got a cover!

SBIRs

  • Some more stuff on Lagrangians
  • MDA meeting at 4:00

Phil 5.11.2026

I think this is a good take: Forget the AI job apocalypse. AI’s real threat is worker control and surveillance | AI (artificial intelligence) | The Guardian

  • My own research over the past decade on worker-AI coexistence, which was cited in the 2024 White House economic report, suggests that the most pressing issue about AI’s impact on work is not immediate mass unemployment. It is the widening gap in skills, autonomy and wellbeing between those who get to work with AI and those who are finding themselves managed by it. Many jobs will remain in the future, but they will be more pressured, more fragmented and less human.

Tasks

  • Acknowledgements – done
  • Cover – done
  • Some BS stuff with Aaron

SBIRs

  • Sprint review – done
  • Stories – done
  • Sprint planning – done

Phil 5.8.2026

Ship transit history for the Straights of Hormuz

[2605.05115] Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

Tasks

  • Pinged Vanessa about editing the new subsections
  • Finish agent section and add in the rest of the parts in the appendix
  • Add some publications to biblius.ciencias.ulisboa. Looks like ArXiv are not desired
  • Bills – done
  • Chores (sheets! Done)
  • Dishes
  • Leave 2:45-ish for ride (2:15 w/Wegmans)
  • Schedule doctor – done

SBIRs

Phil 5.7.2026

Tasks

  • LASIGE – done
  • Writing
    • Agentic section – good progress
    • Acknowledgments
    • Cover
  • Schedule physical?

SBIRs

  • P2.2 proposal – got the overall design done, and the presentation went well

Phil 5.5.2026

Nice long weekend

Pluralistic: The prehistory of the Democratic Nuremberg Caucus (02 May 2026) – Pluralistic: Daily links from Cory Doctorow

  • The centerpiece of the Nuremberg Caucus playbook is a set of ready-to-file, public indictments against Trump officials who have violated the law, the Constitution, and the rights of the people of the USA. Dems should create and maintain a docket with exhibits and witness lists that gets updated every time one of these crooks runs their big, stupid mouths on Fox News or OANN or Twitter. The Nuremberg Caucus could even set dates for the trials of officials, with judicial calendars for each federal courtroom, starting on January 21, 2029.

Tasks

  • 10:00 Copyright meeting – done! Fun, actually
  • Working on agentic section
  • Maybe work on cover

SBIRs

  • Put together a couple of templates, the weirdest of which is the D2P2. It has an entire Phase 1 proposal in it.
  • Putting together a full P II template. Ugh. Done
  • Need to reach out to John and Joe to do some slide for “communication space” displays. He can!

Phil 5.1.2026

Well, the general strike seems to have been a fizzle.

Tasks

  • Put data in spreadsheet and see if I have something. If so, write up the section. It worked. Need to write things up now.

Phil 4.30.2026

Two justices, one quest: push to gut Voting Rights Act reaches final act | US voting rights | The Guardian

  • The ruling from the US supreme court destroying one of the last pillars of the 1965 Voting Rights Act (VRA) marks the end of a long and painstaking campaign to roll back civil rights legislation by two titans of the court’s rightwing majority, chief justice John Roberts and Samuel Alito.

Where the goblins came from | OpenAI

  • Starting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors. Unlike model bugs that show up through a tanking eval or a spiking training metric and point back to a specific change, this one crept in subtly. A single “little goblin” in an answer could be harmless, even charming. Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from.

Tasks

  • Finish fast16 section – done
  • Work out agent manipulation example and compare results of the two prompts based on ingredients – code is done, need to look at results
  • Write up fast16 Revisited section
  • Take the Tarmac for a test spin up Hamburg – oof!

SBIRs

  • 9:00 standup – done
  • 2:00 TPOC call – done

Phil 4.29.2026

The Kremlin’s Expanding Media Conglomerate 2026

  • This paper expands on the assessment ISW published in 2020, as the Kremlin has evolved its efforts to expand its media conglomerate since the publication of that assessment. The expansion of the Russian media access is not inevitable as the Kremlin faces setbacks in its effort to preserve and form media partnerships. The United States and other countries can and should aim to disrupt the expansion of Russian cognitive warfare infrastructure.

Interactive Tour: Russian and TV BRICS

Tasks

  • BRV at 9:00 – done. Took till noon!
  • Suz at 11:00 – had to cancel
  • Added a section on fast16.sys
  • Fixed the “internet dog” section

SBIRs

  • Start visualizations
  • Look at index2vec code to see how to modify
  • Proposal? Nome, but there is now a meeting scheduled for tomorrow with the TPOC
  • Update stories – done

Phil 4.28.2026

This is such a weird legal document

And keeping in the theme of things:

Tasks

  • Adding new text to the book for the AF image and also fast16.sys
  • Fixed the scale Figure
  • ACM got the rights for the Economist figure!
  • Add notes to pancake printer paper – done. Maybe I can work on that tomorrow while the RV is getting worked on

SBIRs

  • NNM:
    • Get new embeddings for text. Need to get some path pieces working right – done
    • Get 2D and 3D umap – done
    • Cluster – done
    • Visualize!
  • Abstract for tech summit – done
  • 9:30 Meeting about SBIR? – done