Phil 11.8.2024

This is good and thoughtful: The Lesson: The real lesson we should draw from what occurred Tuesday. I also think there are some tactical issues: Undecided voters didn’t believe that some of the highest profile things that happened during Trump’s presidency—even if they saw these things negatively—were his fault.

Perplexity is now returning citations: “Effective immediately, all API users will see citations returned as part of their requests by default. This is not a breaking change. The *return_citations* parameter will no longer have any effect. Refer to our docs.

Chores

  • 8:00 chat with Matt – done
  • Clean house – done
  • Dishes – done
  • Bills – done
  • Yard – nope, tomorrow
  • Work on book?
  • Ping Carlos about HQA collaboration – Monday, but maybe write up some notes first
  • Ping Nathan – done
  • 5:00 Verizon
  • Call Barbara – pinged

Phil 11.5.2024

Welp, here we go:

Tasks:

SBIRs

  • Up to NJ today. Maybe chat with Aaron while I’m driving. Good meeting! I think we have a plan. Need to write things up this week.

GPT Agents

Phil 10.31.2024

Tasks

  • Call Jim Donnie’s

SBIRs

  • When building the randomizer:
    • Sign of the weave trig functions
    • Size of the envelope
    • range +/-
    • height +/-
  • 9:00 standup – done
  • Write a nice reply to RSA to see if he could provide any introductions to people who might be interested in supporting NNM research – done
  • 4:30 Book club – done

Phil 10.30.2024

I watched KH’s “closing argument” speech and it was quite good. At the same time, Aaron Rupar put together a back-to-back sample of DJT speeches from the beginning of his first campaign and his speech from yesterday. The change in Trump’s energy is stunning.

I’ve also been thinking about ways to detect manipulative images for WH/BH/AI. It could be easier to reverse engineer a prompt, then have an LLM examine that for manipulative intent. It looks like the tools exist in some form. Here’s the CLIP-based prompt generator:

Tasks

SBIRs

  • 9:00 RayTune
  • Continue with trajectory experimentation. I realize that I can break up a trajectory by parts. Also, I need to start using 3D
  • Looks like I can generate 50k trajectories of 1,000 samples in a bit over 3 seconds! This may work.
  • And I was able to split the trajectory into parts and work on them separately:

Phil 10.29.2024

Experimental narratives: A comparison of human crowdsourced storytelling and AI storytelling | Humanities and Social Sciences Communications

  • The paper proposes a framework that combines behavioral and computational experiments employing fictional prompts as a novel tool for investigating cultural artifacts and social biases in storytelling both by humans and generative AI. The study analyzes 250 stories authored by crowdworkers in June 2019 and 80 stories generated by GPT-3.5 and GPT-4 in March 2023 by merging methods from narratology and inferential statistics. Both crowdworkers and large language models responded to identical prompts about creating and falling in love with an artificial human. The proposed experimental paradigm allows a direct and controlled comparison between human and LLM-generated storytelling. Responses to the Pygmalionesque prompts confirm the pervasive presence of the Pygmalion myth in the collective imaginary of both humans and large language models. All solicited narratives present a scientific or technological pursuit. The analysis reveals that narratives from GPT-3.5 and particularly GPT-4 are more progressive in terms of gender roles and sexuality than those written by humans. While AI narratives with default settings and no additional prompting can occasionally provide innovative plot twists, they offer less imaginative scenarios and rhetoric than human-authored texts. The proposed framework argues that fiction can be used as a window into human and AI-based collective imaginary and social dimensions.

Tasks

  • Call Jim Donnie’s
  • Halloween treats

SBIRs

  • Start on trajectory experimentation
  • 9:00 Standup
  • 10:00 LM/SA chat

Phil 10.28.2024

Just a bit over a week until they start counting votes.

This looks like a nice way of creating code documentation first pass: lmdocs: Generative AI for code documentation

Tasks

  • Call Jim Donnies
  • Vote! Plenty of time between the morning and afternoon meetings – done!

SBIRs

  • Start looking at the trade show project. I think the first thing I’ll do is set up an overleaf project. Then create a data generator to ease back into coding
    • Underlying curve with additional horizontal and vertical weave patterns.
    • Goal is to generate at least 10,000 samples fast)
    • Calculate intersections of a straight line to points on the curve. For each point, calculate the time for iterators on the two lines to intersect. It might be possible to project this into a 2D space, since in this case the lines are functions, which means the intersection is a function, too.
    • Or maybe, just have the data generator extrapolate a straight line, calculate the intercept to that, and see if at that time, the two source lines are within a threshold. I think I like that. This should be pretty fast and generate nice data.
  • Do a getting started on PyTorch 2.5.
  • Train a model to predict something that supports the heat map display. I think it could simply be the distance between the points at the time of intersection with the projected line.
  • 10:00 – 11:30 SimAccel review. Some nice stuff! I need to talk to Ron about using some of the (RayTune at least?) pipeline for the demo project. Because I kind of like being able to specify datasets, a range of architectures, and let it decide what the best/fastest model for learning a new trajectory/intersection dataset.
  • Uploaded the proposal to the ASRC overleaf. Some last-second tweaks, so redid that.
  • 3:00 Tradeshow demo tagup.

Phil 10.25.2024

Found a good source for election interference that’s been vetted by the IC: Election Security | Cybersecurity and Infrastructure Security Agency CISA

Day off for real this time!

Put together an intercept version for the M2M ride. They are supposed to leave at 9:00, so I’ll leave at 9:30?

Chores

  • Get stove, dishwasher, and washing machine running – done
  • Scrap metal and textiles – done
  • Groceries – done
  • Clean house – done
  • Dishes – done
  • Vote! Tried, but there was a line and I didn’t have time
  • Bills – done

Phil 10.24.2024

Today in malicious use of AI: American creating deep fakes targeting Harris works with Russian intel, documents show

  • The documents show that John Mark Dougan, who also served in the U.S. Marines and has long claimed to be working independently of the Russian government, was provided funding by an officer from the GRU, Russia’s military intelligence service. Some of the payments were made after fake news sites he created began to have difficulty accessing Western artificial intelligence systems this spring and he needed an AI generator — a tool that can be prompted to create text, photos and video.

And today in Unintended Consequences for vulnerable groups: Can A.I. Be Blamed for a Teen’s Suicide?

  • “It’s going to be super, super helpful to a lot of people who are lonely or depressed,” Noam Shazeer, one of the founders of Character.AIsaid on a podcast last year.
  • Now, as a rule, when a headline is in the form of a question, the rule of thumb is “no.” However, this aligns more with what could happen if weapons-grade AI, used in an apparently innocuous app, identified easily manipulatable targets and exploited them. Replika is another example of this sort of accidental effect that could easily be weaponized

10:00 MCC meeting. Turns out that AdAstra might have the capability. I mean, it should!

SBIRs

  • 9:00 standup
  • Wrap up proposal? Made a quad_main.tex fil to hold the chart. Smallest LaTeX ever!
\documentclass[12pt]{article}
\usepackage{style/govstyle}
%opening
\pagestyle{fancy}
\rfoot{UNCLASSIFIED}
\lfoot{\textbf{ADS Quad Chart}}
\renewcommand{\headrulewidth}{0pt}


\begin{document}
\pagenumbering{gobble}

\fbox{\includegraphics[scale=0.8,angle=-90]{assets/Quad-chart.pdf}}

\end{document}
  • More tweaks on the proposal – nothing major
  • 4:30 book club – fun! Quick!

GPT Agents

  • 2:45 meeting – Nope? Wrong link?

Phil 10.22.2024

Move some $$ around for contracting. May have to fire up the home equity LoC.

Testing and Evaluation of Health Care Applications of Large Language Models
A Systematic Review

  • Of 519 studies reviewed, published between January 1, 2022, and February 19, 2024, only 5% used real patient care data for LLM evaluation. The most common health care tasks were assessing medical knowledge such as answering medical licensing examination questions (44.5%) and making diagnoses (19.5%). Administrative tasks such as assigning billing codes (0.2%) and writing prescriptions (0.2%) were less studied. For NLP and NLU tasks, most studies focused on question answering (84.2%), while tasks such as summarization (8.9%) and conversational dialogue (3.3%) were infrequent. Almost all studies (95.4%) used accuracy as the primary dimension of evaluation; fairness, bias, and toxicity (15.8%), deployment considerations (4.6%), and calibration and uncertainty (1.2%) were infrequently measured. Finally, in terms of medical specialty area, most studies were in generic health care applications (25.6%), internal medicine (16.4%), surgery (11.4%), and ophthalmology (6.9%), with nuclear medicine (0.6%), physical medicine (0.4%), and medical genetics (0.2%) being the least represented.

SBIRs

  • 9:00 standup
  • Work on proposal. I think finish up Technical, and start to figure out the SOW – done with the first draft of both! Tomorrow is the Quad chart