Monthly Archives: September 2023

Phil 9.11.2023

Twenty-two years ago, I remember this day starting as a crisp autumn morning with infinite, clear blue skies.

Sam Bankman-Fried’s jail conditions offer a glimpse at systemic failure

Everything I’ll forget about prompting LLMs

SBIRs

  • Submit expenses
  • We need another story. In this case, it’s another war room vignette, but this time from the defense’s side. Maybe with M again? Of course, part of this is figuring out what defenses might actually look like. One thing I’d like to re-use in the idea of diverse operator teams looking for misbehaving models. In this case though, the models are trained to be honeypots for attacks maybe? They go along in their day-to-day, sending emails, running dummy companies, having dates, etc. When they start acting too aligned, then it’s time to start looking for trouble. Maybe digital twins of important people?
  • 9:00 Sprint demos. Make Slides
  • 2:00 Weekly MDA meeting
  • 3:00 Sprint planning

GPT Agents

  • Start filling out IRB form

Phil 9.8.2023

SBIRs

  • Had a good chat with Rukan yesterday. What worked with the hdfproc data didn’t work with the new offsets? He’s going to run some tests
  • I really want to add a new project to the LLM IRAD. Something like NNMap-enabled group support. Need a better name, some slides (mentioning “killer app” and all the possible uses), and a schedule.
  • Tweaked the Jan6 AI subsection to integrate better into the rest of the section
  • Need to add a “Detect and Defend” section
  • Need to add an “AI Arms Control for Societal AI Weapons” section. Show that this is in everyone’s best interests. Authoritarian regimes are potentially at greater risk, particularly for Spanner and Lobotomy attacks.

Phil 09.07.2023

SBIRs

  • 9:00 standup
  • LLM schedule planning with Aaron. Done
  • 2:00 Dahlgren follow-up meeting
  • More scale paper. Add QAnon as the other main component of Jan6

GPT Agents

  • Tests with Roger and/or Aaron?

Phil 09.06.2023

SBIRs

  • Submitted my technical fellows stuff
  • Steve’s presentation – added comments
  • Installing sw on the laptop – done
  • MDA next steps (intersection of TI and current time allows for sync. We’d need several points, but not too many
  • LLM planning. Need to create schedules?

GPT Agents

  • 3:00 meeting? Yup. Alden seems to be finding traction

Phil 9.5.2023

Nice three-day weekend, but boy did it end hot!

SBIRs

  • Q6 Report:
    • Add an overview of SEG’s white paper to commercialization section. Done
    • Submit – done!
    • Check with Aaron about the white paper to see if it’s good to go in – done
    • Spending a lot of time with Rukan on seeing if the propagation is the same for the two setups
    • Get started with the Solid getting started documentation. Once I have a framework up and running, then I can load the StampedeTheory chapter summaries to supabase. Then access using LangChain
    • MCWL meeting

GPT Agents

  • Ping Roger for some testing?

Phil 9.1.2023

Wow. September. And the Halloween candy displays have been up for a while already

SBIRs

GPT Agents

  • Sent out an email to the team to schedule some user testing

And I’ve run out of gas. Going to clean house