Author Archives: pgfeldman

Phil 4.4.2024

Dont you (forget NLP): Prompt injection with control characters in ChatGPT

  • As part of this work, we recently observed some unusual behavior with two popular large language models from OpenAI, in which control characters (like backspace) are interpreted as tokens. This can lead to situations where user-controlled input can circumvent system instructions designed to constrain the question and information context. In extreme cases, the models will also hallucinate or respond with an answer to a completely different question.

SBIRs

  • 2 hour meeting in Moorestown with a 2.5 hour drive each way. Not sure it’s worth it, but the meeting went well,
  • Closed out my stories, made slides, and created new stories. Still need to do one for the IS Dept presentation on the 22nd

Phil 4.3.2024

5:00 power washer

CUI 2024 paper:

SBIRs

  • Slide deck and stories for Monday – put in a story to make a sandbox project using a makefile. cookiecutter, name, venv, and source.

GPT Agents

  • 3:00 Alden

Phil 4.2.2025

Need to move the crown appointment

Review board faults Microsoft for ‘cascade’ of errors in China hack

  • The report details what it calls a “cascade of avoidable errors.” For instance, Microsoft had not noticed the presence of an old signing key from 2016 that should have been disabled but wasn’t. “That one just sat for years, kind of forgotten,” a second person said. Part of the problem was that Microsoft was supposed to switch from a manual key rotation to an automated system that minimized the chance of human error. But that switch never happened. “They never prioritized fixing the problem,” the first person said.

SBIRs

  • More CUI paper. Bring in papers on dark patterns in CUIs and CSCW
  • 9:00 Standup
  • Ron’s in today? Maybe more setup. Done! Wrote up a story too.
  • AI Ethics?
  • MCML?

Phil 4.1.2024

Call powerwasher

SBIRs

  • Working on the CUI paper
  • Did the vague letter thing. Still not sure why anything should happen, but it’s an encantation maybe?

Phil 3.28.2024

Pick up the truck if the rain stays away this morning – done!

Good Organizational Lobotomy writeup: The Descent of Elon Musk

SBIRs

  • Made good progress on the LLM notes yesterday. Waiting for the AWS instance to get turned on so I can play with UMAP. Got a lot done, though there are resolution issues that need to be worked out.
  • 9:00 Standup – done
  • 11:30 CSC touchpoint – done
  • 1:00 Cyber COP presentation – done, and went well.

GPT Agents

  • LLM Meeting – went over the NNM work with Shimei
  • Started on CUI provocation. As part of that, I’ve started reading Tim O’Reily’s biography(?) of Frank Herbert. It’s really good. He also says that Dune is an alternate Foundation, and Paul is The Mule. Fascinating.
    • In Dune, each of the players—the Emperor, the Baron Harkonnen (archenemy of the Atreides), the monopolistic Spacing Guild, even the seemingly wise Bene Gesserit gene manipulators—tries either to dominate the situation or to control it in such a way as to minimize his own risks. And in the end all are overwhelmed. The elemental forces of history can only be ridden, not controlled. Paul alone is victorious, because he chooses to ride the whirlwind. He risks everything. His initiation by the Fremen into riding the sandworms is symbolic of his choice. These predators represent all the elemental forces of Arrakis: their native name means “maker,” and they are the heart of the ecological matrix of the planet, source of the spice, the sand, and thief of water. And, like nature itself, they abhor artificial boundaries; they are drawn irresistibly to destroy the protective energy shields relied on by off-worlders. They close the desert to all who try to isolate themselves from it; only the Fremen “sandriders,” who move with the rhythms of the desert, and mount the fearsome worm, can brave its wilds.
  • Start White Hat AI section – tomorrow?

Phil 3.27.2024

AI experts: Racist audio of Baltimore County principal’s voice is fake – The Baltimore Banner

  • The Eiswert audio, Lyu said, shows the danger of the degree to which artificial intelligence could be used to harm individuals. When deepfakes are used on celebrities or well-known political figures, they are easier to detect, both because there’s an abundance of video and audio of their voices and so the public is more likely to believe and spot a fake. “If they are focusing on less prominent people … the damage they are causing is bigger.”

Example: Pandas Excel output with a chart

##############################################################################
#
# An example of converting a Pandas dataframe to an xlsx file with a chart
# using Pandas and XlsxWriter.
#
# SPDX-License-Identifier: BSD-2-Clause
# Copyright 2013-2024, John McNamara, jmcnamara@cpan.org
#

import pandas as pd


# Create a Pandas dataframe from some data.
df = pd.DataFrame({"Data": [10, 20, 30, 20, 15, 30, 45]})

# Create a Pandas Excel writer using XlsxWriter as the engine.
writer = pd.ExcelWriter("pandas_chart.xlsx", engine="xlsxwriter")

# Convert the dataframe to an XlsxWriter Excel object.
df.to_excel(writer, sheet_name="Sheet1")

# Get the xlsxwriter workbook and worksheet objects.
workbook = writer.book
worksheet = writer.sheets["Sheet1"]

# Create a chart object.
chart = workbook.add_chart({"type": "column"})

# Get the dimensions of the dataframe.
(max_row, max_col) = df.shape

# Configure the series of the chart from the dataframe data.
chart.add_series({"values": ["Sheet1", 1, 1, max_row, 1]})

# Insert the chart into the worksheet.
worksheet.insert_chart(1, 3, chart)

# Close the Pandas Excel writer and output the Excel file.
writer.close()

SBIRs

  • Spent some time adding references to the slide deck
  • Add a “to_excel()” method – done
  • Meet with Aaron to set up AWS for UMAP? Things are working, but I don’t have an instance. Blocked for now
    • Install ecco
    • Install HFace
    • Maybe langchain?
  • 1:30: LLM UI Tools – done

GPT Agents

  • Meeting with Alden

Phil 3.26.2024

Here’s Google’s generative API/playground/documentation site: Vertex AI

SBIRs

  • 9:00 standup. Going to go over the IUI conference
  • Working on the NNM again. Just getting back into things and generated a cosine similarity (by layer) to the average vector for each layer:
  • Although it looks interesting, I think it would make more sense to export as a spreadsheet to show numbers and colors. Also, I’m not convinced that this is the right way to go. I’d like to try UMAP, but can’t get it to behave on my box. Going to try spinning up an instance on AWS

Phil 3.25.2024

On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial

  • The development and popularization of large language models (LLMs) have raised concerns that they will be used to create tailor-made, convincing arguments to push false or misleading narratives online. Early work has found that language models can generate content perceived as at least on par and often more persuasive than human-written messages. However, there is still limited knowledge about LLMs’ persuasive capabilities in direct conversations with human counterparts and how personalization can improve their performance. In this pre-registered study, we analyze the effect of AI-driven persuasion in a controlled, harmless setting. We create a web-based platform where participants engage in short, multiple-round debates with a live opponent. Each participant is randomly assigned to one of four treatment conditions, corresponding to a two-by-two factorial design: (1) Games are either played between two humans or between a human and an LLM; (2) Personalization might or might not be enabled, granting one of the two players access to basic sociodemographic information about their opponent. We found that participants who debated GPT-4 with access to their personal information had 81.7% (p < 0.01; N=820 unique participants) higher odds of increased agreement with their opponents compared to participants who debated humans. Without personalization, GPT-4 still outperforms humans, but the effect is lower and statistically non-significant (p=0.31). Overall, our results suggest that concerns around personalization are meaningful and have important implications for the governance of social media and the design of new online environments.

Tasks

  • Dentist!
  • Truck? – Nope. Should be ready Friday
  • Eclipse email

SBIRs

  • Expense report
  • Put together a set of slides that covered what I thought was interesting at the conference
  • CUI Provocation
  • Killer Apps post for HCAI Medium Magazine(?)
  • 4:00 meeting with Bob S. to go over the MP part of the LM white paper
  • Need to get back to NNM

Phil 3.24.2024

Just realized that Fritz Lang’s Metropolis is an example of a deepfake/KillerApp:

 …the false Maria unleashes chaos throughout Metropolis, driving men to murder and stirring dissent among the workers.

https://en.wikipedia.org/wiki/Metropolis_(1927_film)

Deepfake Kari Lake video shows coming chaos of AI in elections

  • They brainstormed ideas for about a week and enlisted the help of a tech-savvy friend. On Friday, Stephenson published the piece, which included three deepfake clips of Lake.

Phil 3.23.2024

Drove back from Greenville NC yesterday. Eleven hours or so. Electric cars are still not common on the long stretches between cities. I live in the Baltimore/DC region, and I’d say 1 in 20 cars that I see on the roads at this point is electric, with a big mix of manufacturers. Still mostly Teslas. On my drive, I only saw Teslas, and not very many of them. Charging infrastructure is solid. Leaving with a full charge, it required 2 SuperCharger recharges to cover the 535 miles. Probably $20 in electricity? And autopilot makes everything much nicer, though you always have to be ready for phantom braking.

Need to ping Mario, Ossi, and James about Eclipse coordination

Phil 3.21.2024

IUI 2024 Notes

Information visualization and Visual Analytics

25% confident

Applications of Language Models

Generative AI: Theory and Applications

Phil 3.20.2024

Got accepted for my talk at the 92nd MORS symposium!

IUI 2024 Notes

I had an interesting chat yesterday with Ossi about Active Measures Leading up to and since October 7. We need some time to sit down and talk. My sense is that all sides have been under prolonged external influence for a long time with the specific intent to raise the political temperature so that exactly this situation happened.

Keynote: Prof. Krzysztof Gajos (check Google Scholar for references)

  • Predictive text can manipulate the users, who wind up reflection the biases of the predictive text model. Change the organization’s model, change the bias of the organization.
  • The mere presence of an explanation increases the credibility of AI assistance, regardless of the content. Fact-like assertions increase the perceived competence of the AI. This is a dark pattern that needs to be detected.
  • Learning means that cognitive engagement occurs, but AI answers vs. cognitive forcing does not impact the amount of learning
  • Providing the material to support a decision but not a decision suggestion, worked better than any answer-based decision aids. This may be a key for complementation
  • Denial of a request is treated emotionally, not cognitively. This is another vector that needs to be recognized and adapted to. A source could be paper submission rejects.
  • Critical techical practice – question assumptions

HCAI, Bias and Fairness in AI

Lunch ride!

AI Tools, User Interfaces and Interaction

AI for Health

Dinner Banquet – fun 🙂

Had a thought for the day. For a learning assignment, have students build a context prompt that lets an LLM answer a question on the rubric correctly. Bonus points if the models is able to answer a question that is outside the domain where a raw LLM has struggled. This way you have a project that requires students learning the topic, and also exposes them to weaknesses and strengths of LLMs. Not sure if this is a good idea, but it could be worth poking at.

Phil 3.19.2024

Did my talk and demo at HAI-GAN yesterday. It went well, but I had forgotten my poster! I ad-hoc-ed on with the hotel printer and my slide deck:

Some really interesting talks:

Main conference starts today. I’ll post notes here:

  • Keynote – pathsto effective AI for diverse real people
    • Stumble forward empirracally
    • Draw from the field of education (e.g learning goals imply associated measures)
    • Add “after actin review” to your AI design – one of the things that seems to help is letting users come up with their own labels for things, which helps building abstractions that help overall understanding
    • Measure inclusiveness, and determine the why (GenderMag survey? Also SocioeconomicMag, AgeMag, InclusiveMag)
      • More inclusive designs of humans+AI ecosystem
      • Persona-based AI debugging

Got to chat with Mauro Martino about maps a bit

Sesssion 1:AI in Personalization, Recommendation and Search

Multimodal Models and Interaction

Poster session. Met a bunch of nice folks, and had a longer chat with Mauro. Need to follow up.

Phil 3.17.2024

No idea what to make of this:

  • Red Dragon 1949.com is the premiere global web location for all updated information focused on the People’s Republic of China. Our intent is to develop a mutual cooperation and understanding of how the Internet and connected systems can be used by a nation state as a military weapon system.

Dammit

Phil 3.16.2024

Chores

  • Drop off truck
  • Haircut
  • Slide backup
  • Laundry
  • Get gas for mower
  • Weed
  • Mow lawn
  • Clean house
  • Pack!
  • Note!
  • Trash

Kind of an interesting take on a way that white hat AI could work. Something like AI-generated annotations for all the manipulations we are being exposed to and the way that it is supposed to make you feel. And a knob that replaces the manipulative text with neutral text (possibly with whitelist fact checks):