Phil 8.18.2022

OPT model for online use: opt.alpa.ai

Book

Got a decline from Princeton
Sent off the last proposal to University of Toronto. Next are the for-profit presses
Got a nice decline from the University of Toronto, that suggested I look at NYU press
Sent an example proposal to Jimmy, and got a very encouraging response back!
- I took a look through Dr. Feldman’s proposal. It’s fascinating stuff; very appropriate and timely with regard to how people get their messaging from social media. I’ve forwarded it to one of my colleagues who signs in the social sciences area to see if it might be a good fit for her portfolio. If she thinks it is, I’d be happy to make introductions.
- With regard to the proposal itself, I don’t really have any comments. It’s well done and compelling. I just have one comment about the sample chapter, unrelated to the proposal as a whole. Dr. Feldman includes several 3^rd party figures that are not properly attributed, and with therefore questionable permissions. For instance, Figure 2: Ruby Bridges with U.S. Marshals. The caption should include information about where the image came from and should indicate that permission was given to use it. The same goes for figures 1 and 4. Perhaps he’s planning to sort that all out once he has a contract but authors do need to be very careful about any third party content they include, even in a sample chapter.

Getting in contact with a copy editor via school – get the email and sent an intro. She’s $20/hr. I need to send a sample

SBIRs

Continue to push MORS up hill – Submitted!
More markdown documentation for Chirp – Done!. Also update PyPi – Done!
9:15 standup
11:30 CSC
Quarterly report

GPT Agents

Install and play with topic2vec?

Phil 8.17.2022

GPT-Agents

Continue Chirp submission
See if topic2vec works, and if it can tell the difference between ivermectin and paxlovid posts
IUI 2023
- Full papers
- Demos
I have a fun idea for a paper. Use a mad-libs approach to min –dalle prompt generation and see how well the system(s) perform as the prompts go from normal to borderline. We could use machine image description to validate.

SBIRs

Quarterly report
Chirp
MORS
RCSNN/3D graphics

Book

University of Toronto press? Looked at this last night, but I had just come back from the dentist and didn’t have the motivation. It’s a letter, so it should be straightforward. Then I think the National Academic press is pretty vague, and may just be a letter too. Then it’s time to poke at the for-profit academic press
Send Jimmy an example proposal that he can pass on for a sanity check with his editor.

Phil 8.16.2022

You can really tell that the days are getting shorter

Efficient Training of Language Models to Fill in the Middle (This is basically the reverse GPT concept)

We show that autoregressive language models can learn to infill text after we apply a straightforward transformation to the dataset, which simply moves a span of text from the middle of a document to its end. While this data augmentation has garnered much interest in recent years, we provide extensive evidence that training models with a large fraction of data transformed in this way does not harm the original left-to-right generative capability, as measured by perplexity and sampling evaluations across a wide range of scales. Given the usefulness, simplicity, and efficiency of training models to fill-in-the-middle (FIM), we suggest that future autoregressive language models be trained with FIM by default. To this end, we run a series of ablations on key hyperparameters, such as the data transformation frequency, the structure of the transformation, and the method of selecting the infill span. We use these ablations to prescribe strong default settings and best practices to train FIM models. We have released our best infilling model trained with best practices in our API, and release our infilling benchmarks to aid future research.

Patching open-vocabulary models by interpolating weights

Open-vocabulary models like CLIP achieve high accuracy across many image classification tasks. However, there are still settings where their zero-shot performance is far from optimal. We study model patching, where the goal is to improve accuracy on specific tasks without degrading accuracy on tasks where performance is already adequate. Towards this goal, we introduce PAINT, a patching method that uses interpolations between the weights of a model before fine-tuning and the weights after fine-tuning on a task to be patched. On nine tasks where zero-shot CLIP performs poorly, PAINT increases accuracy by 15 to 60 percentage points while preserving accuracy on ImageNet within one percentage point of the zero-shot model. PAINT also allows a single model to be patched on multiple tasks and improves with model scale. Furthermore, we identify cases of broad transfer, where patching on one task increases accuracy on other tasks even when the tasks have disjoint classes. Finally, we investigate applications beyond common benchmarks such as counting or reducing the impact of typographic attacks on CLIP. Our findings demonstrate that it is possible to expand the set of tasks on which open-vocabulary models achieve high accuracy without re-training them from scratch.

Alex Jones and the Lie Economy

Discerning audiences who stumble on Jones’ show turn him off, but his message excites the credulous who, if they don’t fully subscribe to the man’s views, want to hear more of the same. Lies are almost always more exciting and exploitable than dull truths. Having culled the impressionable from the doubting and boosted their pulse rate, he turns them over to his merchandising wing where he sells survivalist gear and health supplements like Brain Force Ultra, Winter Sun Plus Vitamin D and a variety of “Superblue Silver” products (immune gargle, toothpaste and wound dressing) that Jones claimed could mitigate Covid. It’s not incidental that the products he hawks are presented as the fix for coming apocalyptic perils predicted on his shows. Citing court filings submitted by Jones’ attorneys in discovery, HuffPost reports that InfoWars collected $165 million in sales of these products from September 2015 to the end of 2018.

GPT-Agents

Continue Chirp submission
See if topic2vec works, and if it can tell the difference between ivermectin and paxlovid posts
3:30 Meeting
IUI 2023
- Full papers
- Demos

SBIRs

8:30 SEG staffing changes
9:00 Sprint planning
- Chirp
- MORS
- Quarterly Report
- RCSNN/3D graphics

Book

University of Toronto press?
Started on the Strategy and Tactics in Online Conflict proposal

Phil 8.15.2022

Book

Rejection from Columbia
Looked at how to hire a copy editor a bit. Found this and this
Need to continue submissions, and then start followups.
Submitted to McGill-Queens. It’s a Canadian school and I used the deep bias chapter which has the indigenous school fiasco

GPT Agents

Working on Chirp submission – finished the (a?) video and edited it down to three minutes. If I have more time I’ll redo it
Tweaked the KeywordExplorer UI a bit

SBIRs

Working on quarterly report

Phil 8.12.2022

Baseball tix!

Social Simulacra: Creating Populated Prototypes for Social Computing Systems

Social computing prototypes probe the social behaviors that may arise in an envisioned system design. This prototyping practice is currently limited to recruiting small groups of people. Unfortunately, many challenges do not arise until a system is populated at a larger scale. Can a designer understand how a social system might behave when populated, and make adjustments to the design before the system falls prey to such challenges? We introduce social simulacra, a prototyping technique that generates a breadth of realistic social interactions that may emerge when a social computing system is populated. Social simulacra take as input the designer’s description of a community’s design — goal, rules, and member personas — and produce as output an instance of that design with simulated behavior, including posts, replies, and anti-social behaviors. We demonstrate that social simulacra shift the behaviors that they generate appropriately in response to design changes, and that they enable exploration of “what if?” scenarios where community members or moderators intervene. To power social simulacra, we contribute techniques for prompting a large language model to generate thousands of distinct community members and their social interactions with each other; these techniques are enabled by the observation that large language models’ training data already includes a wide variety of positive and negative behavior on social media platforms. In evaluations, we show that participants are often unable to distinguish social simulacra from actual community behavior and that social computing designers successfully refine their social computing designs when using social simulacra.

SBIRs

Submit an abstract by 19 August for the opportunity to participate in MORS’ one-of-a-kind event held at the new IDA Center from 27-29 September! With high-level speakers including Dr. Baruch Fischhoff, Dr. Kristen Kulinowsk, Dr. Michael Ford and Dr. Ryan Barrett, the Emerging Techniques Forum (EFT) is one you will not want to miss this year. All abstracts must be submitted in an unclassified format and 1,500 (including spaces) or less characters without images or videos. If you are submitting an abstract for the classified session, indicate the classification level at the time of submission.
- Mostly done. Need some additional paperwork filled out. Sent that off as well
Prep slides for Sprint review and finish off tasks – done

GPT Agents

Deploy updated versions for Chirp
Test and validate balanced pull
Run balanced and proportional 10,000 tweet pulls for ivermectin and plaxovid
Try running Top2Vec on tweets to see what the topic spaces look like
Try to get some threads in those two spaces and use those to show trajectories through topics
If there are enough intersecting trajectories, then create narrative embedding space
Had a good talk with Aaron yesterday about his discord group and how that could be a nice source of maps.
Submit KeywordExplorer to Chirp Developer Challenge by Aug 19
- Content discovery apps
- Include an App built with the required developer tools and meets the above Project Requirements.
- Include a text description that should explain the features and functionality of your App.
- Include a description of which category you are submitting to.
- Include a link to a fully deployed app.
- Provide Twitter handle associated with the developer account.
- Include a demonstration video of your App. The video portion of the submission:

Book

Submitted to U Columbia Press

Phil 8.11.2022

We asked some of the philosophers we respect most who have interests in these areas to help us make progress on them. We leaned especially on the early career researchers who are doing much of the running on this topic (Canadian Journal of Philosophy).

https://twitter.com/sethlazar/status/1557478815978311680

SBIRs

More writing
Work on RCSNN
- Need to start testing that it draws and runs correctly
- Had a few bugs to fix but it’s working!

Submit an abstract by 19 August for the opportunity to participate in MORS’ one-of-a-kind event held at the new IDA Center from 27-29 September! With high-level speakers including Dr. Baruch Fischhoff, Dr. Kristen Kulinowsk, Dr. Michael Ford and Dr. Ryan Barrett, the Emerging Techniques Forum (EFT) is one you will not want to miss this year. All abstracts must be submitted in an unclassified format and 1,500 (including spaces) or less characters without images or videos. If you are submitting an abstract for the classified session, indicate the classification level at the time of submission.

GPT Agents

Test and validate balanced pull
Run balanced and proportional 10,000 tweet pulls for ivermectin and plaxovid
Try running Top2Vec on tweets to see what the topic spaces look like
Try to get some threads in those two spaces and use those to show trajectories through topics
If there are enough intersecting trajectories, then create narrative embedding space
Had a good talk with Aaron yesterday about his discord group and how that could be a nice source of maps.

Book

Submitted to U Illinois Press

Phil 8.10.2022

Some good sources on belief research referenced in this article:

The Strength of Our Political Loyalties Changes Our Actual Beliefs

Kristoffer Nimark, an economist at Cornell, and Savitar Sundaresan, of Imperial College London, describe belief polarization this way: “The beliefs of ex ante identical agents over time can cluster in two distinct groups at opposite ends of the belief space.” (Inattention and belief polarization)

SBIRs

More writing
Work on RCSNN – Made some good progress. The generated code is cleaner and easier to read. Need to start testing that it draws and runs correctly

GPT Agents

Test and validate balanced pull
Run balanced and proportional 10,000 tweet pulls for ivermectin and plaxovid
Try running Top2Vec on tweets to see what the topic spaces look like
Try to get some threads in those two spaces and use those to show trajectories through topics
If there are enough intersecting trajectories, then create narrative embedding space

Phil 8.9.2022

GPT Agents

Working on improved one day download
3:30 meeting

SBIRs

More report
No response yet from Dr. J
MDA contracts today – will see how ATO is working
More tweaking RCSNN for multiple instances of single class

Phil 8.8.2022

For those of you keeping track of such things, I’m still testing positive. And I largely feel fine, with what may(?) be the odd relapse? I had what felt like a cold that came on Thursday of last week and went through Friday and a bit of Saturday. No fever, though my blood pressure is up a bit and my temperature is reliably up a fraction of a degree from where it normally is – like 97.4F vs. 96.9F. I’ll be curious if things go back to their previous levels after I start testing negative.

Book

Start on the remaining proposals
- Sent a letter to University of California Press
Sent Gaia Vince a note over the weekend

GPT Agents

Get balanced working with clamping. Got most of the code working in a scratch. Need to migrate to the main branch

SBIRs

Sent Dr. J a note on Friday. No response yet
Work on the report and start to frame up the slide deck
I realize that I can change the module creation to produce multiple instances of the same child class in the generated bdmon code. It should be a minor change. Need to do that and verify that everything still works with the UI

Phil 8.4.2022

Book

More submission. Or is it prostration?

GPT Agents

Add an option to save or update an experiment as well
Test corpus size reached code so that we don’t pull more tweets per item than we should. Written but not tested

SBIRs

Work on the quarterly report
Reach out to Dr. J to schedule presentation? Still need to do this
RCSNN meeting with Aaron
Standup

Phil 8.3.2022

Still testing positive – it’s been 9-10 days. I feel fine though. And it turns out that my health insurance doesn’t cover antigen tests because of course it doesn’t

Book

Woke up early thinking about writing the next proposal for the MC Press. To do it right is going to be complex. Princeton wanted stuff that I already had in the template, so I sent that in instead

GPT Agents

Adding an experiment input file that will take a txt file with entries separated by CRs – done. Need to add an option to save or update an experiment as well
Add corpus size reached code so that we don’t pull more tweets per item than we should. Written but not tested

SBIRs

Work on the quarterly report – had a good meeting with Loren
Reach out to Dr. J to schedule presentation? Still need to do this
Is there still a JSC meeting? Nope. Did make a wishlist for data

Phil 8.2.2022

A Deeper Understanding of Deep Learning

Recent research has clarified that learning systems operate in an entirely different regime when they are highly overparameterized, such that more parameters let them generalize better. Moreover, this property is shared not just by neural networks but by more comprehensible methods, which makes more systematic analysis possible.

Book

Sent the proposal off to Yale. Thirteen houses to go and then I don’t know what to do. Probably ping Andreea? Also try to reach out to Gaia Vince
MIT next. Their first cull is a letter that’s a condensed version of the proposal – done

GPT Agents

More Tooltips
Meeting today. Good meeting! We’re going to look at drugs as a proxy. Easier than slang

SBIRs

Sprint planning. I think this is going to be a case of writing the quarterly report and getting the presentation ready and scheduled. Maybe a little coding around the borders
MDA catchup with Ron and Rukan. Need to give them writing assignments. Also Loren

Phil 9.1.2022

If you listen closely, you can feel fall coming

So I went to Spain on a two-week biking vacation with some riding buddies, which was a lot of fun (HOT – We went in the middle of the 2022 European Heat Wave). Getting home sucked though. The flight was delayed 14 hours, and I stood in many lines with an Airbus A300(?) full of people for 5 hours or so, and managed to catch some version of Covid. That was a cough, and then a fever, which got up to 100.5F.

I was able to do a telehealth visit and get a course of Paxlovid, which stopped the fever cold in about 8 hours. It’s a multi-day regimen, and your mouth tastes like metal, but one the whole, one of those remarkable medical feats.

However…

There is a thing known as Paxlovid rebound, where you test positive for Covid, even though you have no symptoms. That’s me at the moment. Eight days after initial symptoms, I’m back to testing positive. Which means at this point, the post-vacation headaches are now half the length as the actual vacation. And I still haven’t applied for the hotel refund from the airline.

I don’t have any lessons to offer, other than saying that international travel is a fraught thing. In the end, a little over half my group wound up with Covid. I’m just not sure that this kind of trip is worth it at this time.

Book

Submit to MIT press

GPT Agents

Look at the code and the TODO’s and figure out what’s next. There don’t seem to be any other than to add tooltips? This did not turn out to be straightforward, but as usual, there is a Stackoverflow solution
- Added tooltips to TweetCountsExplorer

SBIRs

Sprint review
Figure out stories for next sprint (Presentation for Dr. Asbury? Get IDE working with the Lambda box? Get more data from Loren, RCSNN)
Read this and see if it helps anything: Enhancing Backpropagation via Local Loss Optimization

Phil 7.28.2022

Relapsed a bit. Taking it easy today

When Maps Become the World

When Maps Become the World shows us how the scientific theories, models, and concepts we use to intervene in the world function as maps, and explores the consequences of this, both good and bad. We increasingly understand the world around us in terms of models, to the extent that we often take the models for reality. Winther explains how in time, our historical representations in science, in cartography, and in our stories about ourselves replace individual memories and become dominant social narratives—they become reality, and they can remake the world.

Book

Working on the UCP proposal – done!
Finding the next possible publisherd

GPT Agents

Top2Vec is an algorithm for topic modeling and semantic search. It automatically detects topics present in text and generates jointly embedded topic, document and word vectors. Once you train the Top2Vec model you can:
- Get number of detected topics.
- Get topics.
- Get topic sizes.
- Get hierarchichal topics.
- Search topics by keywords.
- Search documents by topic.
- Search documents by keywords.
- Find similar words.
- Find similar documents.
- Expose model with RESTful-Top2Vec
See the paper for more details on how it works.

Phil 7.27.2022

Feeling MUCH better. I took my first dose of Paxlovid around 5:00pm yesterday, with a fever of over 100F, around 3:00am this morning the fever broke and I pretty much feel back to normal with a lingering cough. Might even go for an easy bike ride today!

Book

Figure out where I am with the deliverables and start going through the matrix to send out packages

SBIRs

Finish training
Meet with Aaron to get caught up
- JSC – paper was good. More progress to come?
- MDA – Lambda box is running – RayTune. Manage running and distribution across multiple GPUs
  - ATO? We need to get the files now?
  - Chat with James? He does want a presentation. Touch base with Clay for schedule
  - Is the DB running on the Lambda box?
  - ONE GUI is running
  - Local jobs run remotely using the IDE
  - SEG – a little behind on trajectory and FOM data. Still need to look at distinctly different holdout data
    - Need to introduce regularization
    - Attention with deep networks?
    - Encoder-decoder?
- RCSNN – Aaron is getting MiniAlphaStar working.
  - Starcraft II AI community is enormous.
- Tech conference – register today (done). Check email. Hotel?
  - Write abstract for teleoperation?
- MDBE – Working on new scenario for land, air and sea
- Reference implementation for SimAccel. COTS is not set up for batch. Steve is writing his own version

viztales

Dimension reduction, State, Orientation, and Speed

Phil 8.18.2022

Phil 8.17.2022

Phil 8.16.2022

Phil 8.15.2022

Phil 8.12.2022

Phil 8.11.2022

Phil 8.10.2022

Phil 8.9.2022

Phil 8.8.2022

Phil 8.4.2022

Phil 8.3.2022

Phil 8.2.2022

Phil 9.1.2022

Phil 7.28.2022

Phil 7.27.2022