Author Archives: pgfeldman

Phil 2.5.21

They Stormed the Capitol. Their Apps Tracked Them

While there were no names or phone numbers in the data, we were once again able to connect dozens of devices to their owners, tying anonymous locations back to names, home addresses, social networks and phone numbers of people in attendance. In one instance, three members of a single family were tracked in the data.

Book

2:00 Meeting with Michelle

GOES

Run pure pitch/roll/yaw maneuvers
Yaw – really nice

https://viztales.com/wp-content/uploads/2021/02/image-4.png

Roll – WTaF?

https://viztales.com/wp-content/uploads/2021/02/image-5.png

Pitch – ok?

https://viztales.com/wp-content/uploads/2021/02/image-6.png

11:00 Meeting with Vadim

GPT Agents

Install Ecco and try the intro (here in the video, also ranked choices, and token probability) locally, then message him to see if he’d like to discuss maps.
Have the code running locally (the NVIDIA update worked too!):

Messaged Jay Alamar on twitter. Working on setting up a video chat
Woohoo!

The IJCAI-21 summary reject phase has now ended and we are pleased to inform you that your submission #1558 "Navigating Human Language Models with Synthetic Agents" will enter the full-paper review phase.

3:30 Meeting

Phil 2.4.21

Book

Tweak the conclusions to add flocking and stampeding – closer to really done?

GOES

Finish adding script management – done!
There are issues though:

Adding a ManeuverCreator class that will produce pure pitch, roll, and yaw scripts
Status report – done!

JuryRoom

5:00 Meeting with Priyank

Phil 2.3.21

For Sunday: https://spicysouthernkitchen.com/honey-chipotle-chicken-crispers/

Book

Finished the zeroth? draft?
Not quite, still working on some TODOs

GOES

Verified that everything works – Yay!

, Bringing in the scriptreader

2:00 Meeting

NESDIS

Training – done!

MDS

10:00 Meeting

JuryRoom

5:00 Meeting

Phil 2.2.21

Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches

Datasets are not only resources for training accurate, deployable systems, but are also benchmarks for developing new modeling approaches. While large, natural datasets are necessary for training accurate systems, are they necessary for driving modeling innovation? For example, while the popular SQuAD question answering benchmark has driven the development of new modeling approaches, could synthetic or smaller benchmarks have led to similar innovations?
This counterfactual question is impossible to answer, but we can study a necessary condition: the ability for a benchmark to recapitulate findings made on SQuAD. We conduct a retrospective study of 20 SQuAD modeling approaches, investigating how well 32 existing and synthesized benchmarks concur with SQuAD — i.e., do they rank the approaches similarly? We carefully construct small, targeted synthetic benchmarks that do not resemble natural language, yet have high concurrence with SQuAD, demonstrating that naturalness and size are not necessary for reflecting historical modeling improvements on SQuAD. Our results raise the intriguing possibility that small and carefully designed synthetic benchmarks may be useful for driving the development of new modeling approaches.

Book

Finished the Charles Dawson and Alfred Wegoner bits. Working on play

GOES

Finally got the sim working again! Now back to getting the scriptreader in

NESDIS

Got my account working!

GPT Agents

Meeting and working on probes

Phil 2.1.21

Book

Working on conclusions

GPT-2

Sent Antonio a note on research directions

GOES

Verify yaw flip works, and then add script reading

Phil 1.30.21

The monthly tolls:

https://public.flourish.studio/visualisation/4504138/

https://public.flourish.studio/visualisation/4303726/

Note that Georgia is more than twice as bad as the UK. About one out of every 30 people has died? Can that be right?

Phil 1.29.21

Looks like snow is coming!

Book:

More conclusions.
2:00 Meeting with Michelle

GOES

11:00 meeting with Vadim
I got to pull out my Game Programming Gems #2 for something that wasn’t really on the internet!
Status report

MDS

10:00 meeting with Clay

GPT Agents

3:30 Meeting

Phil 1.28.21

Called Jim Donnies. Best to drop off on Saturday between 9:00 – 12:00

Chocolate! – done!

Shopping

Checked by reality, some QAnon supporters seek a way out

More than a week after Donald Trump departed the White House, shattering their hopes that he would expose the worldwide cabal, some QAnon adherents have concocted ever more elaborate stories to keep their faith alive. But others like Smith are turning to therapy and online support groups to talk about the damage done when beliefs collide with reality.

How to Signal Trust in a Google Search

This study examined the problem of news distrust in a new way. We tested whether trust or distrust in a news outlet can be cued at the “domain level” — alongside search information about the news outlet — rather than beside or within a specific story. To do this, we used information about the news outlet — signals — that could be shown in a sidebar called a Knowledge Panel⁷ when people do a Google search for that news outlet. We conducted one experiment in the U.S. and Germany, and another in the U.S., Germany, and Brazil. We used multiple countries in order to determine how trust signals work across different cultures with varying levels of news trust.⁸ The Google News Initiative funded this research.

Impeachment and Deplatforming Aren’t Enough to Move Forward

In our forthcoming book, You Are Here, Ryan Milner and I explain the limitations of focusing on obvious sources of harm within the information ecosystem. To do this, we use biomass pyramids—diagrams that visually represent the cumulative weight and population size of various organisms within the same ecosystem. Apex predators—the lions, tigers, and bears—are at the very top of the pyramid. There are fewer of them, and as a result their cumulative weight is often relatively low, but they pose outsized threats to the rest of the ecosystem. At the same time, they are utterly dependent on all the lower strata. Take away their prey, take away their prey’s prey, take away the ecological conditions that sustain the biomass, and the lions and tigers and bears don’t have a chance.

Book:

More conclusions. I am not going in the direction that I thought when I started writing.

GOES

Verifying that everything still works after checking out Vadim’s changes. Nope, things are very wrong.
I think I fixed the physics step by making it the calculated time, but the ReactionWheelController needs to be updated to take the scaled_dclock values
Update TopController – not yet, I guess
2:00 customer meeting

MDS

Ok, back to figuring out intersections while I wait for Vadim

Phil 1.27.21

Ok, so we’re still in the middle of a pandemic, it looks like around 400 people might be charged with sedition for the Jan 6 attack on the Capitol, and the GOP doesn’t want to have anything to do with impeachment, but the crazy seems to be much lower.

Vaccines are rolling out, and we’ll probably be at closer to 200M vaccinations by April. The government appears to be doing things again. It feels like we are past the turning point in whatever metaphor you’d like to use here, and now entering the mopping up phase.

The Fairness Doctrine sounds a lot better than it actually was

What America needs instead is a creative, comprehensive effort by both the private sector and the government to disincentivize conspiracies and misinformation on the many platforms on which they flourish. Some social media companies have begun this work, clearing out QAnon sites and banning some far-right and White power users and communities who pose a threat. That work needs to continue, with careful attention to the biggest offenders who game algorithms and media structures to spread misinformation. But sources of misinformation also need to be demonetized, whether they are YouTube channels or national cable networks, and algorithms tweaked to slow down the spread of extreme content.

Book

More conclusions. I’m tempted to look at antisemitism through the dominance hierarchy lens. To show that if you truly feel that you are at the top of the hierarchy (just under your God), and use the Bible as the scaffolding for that understanding, then the fact that the same bible says that the Jews are chosen by God must really require some mental gymnastics. And for some, those gymnastics lead to genocide, since once the Jews are gone, then there is no one between you and God, and the rightful order is restored.
I think this is related to slavery and racism in the US. To be able to treat people as property requires that they be less than people, otherwise dominance makes no sense. The hierarchy of God -> slave holder -> slave justifies any dominance behavior. And in the end the mental gymnastics that are required for that are the same as those required by anti-Semites. These are both co-created fictional social realities that exist in the communities that believe themselves righteously superior to those they oppress. And as we now understand, that process of creating hierarchical social realities can often stampede to the worst possible places.

NOAA

SEVEN trainings by Feb 1! But I can’t get to my email, so that’s going nowhere

GOES

11:00 Meeting with Vadim. He has most of the changes in. Waiting for some final work and then I’ll start rolling in the new TopController

JuryRoom

5:00 Meeting

Phil 1.26.21

This is looking a lot like maps!

Last paper on @PhysRevX!

Survey data by @electionstudies shows that extreme opinions wrt different topics can be correlated. We propose a model that reproduces these polarized, ideological opinions, without assuming apriori such correlations or preexisting social structures 1/3 pic.twitter.com/VbJ08ibPTP
— Michele Starnini (@m_starnini) January 20, 2021

Paper at https://journals.aps.org/prx/abstract/10.1103/PhysRevX.11.011012

Book

Starting conclusions. Going at it from explore/exploit, and why it’s important to have primordial movement, which is the domain of youth

IRAD

Talked to Aaron about using the time for a state switch as part of the input vector, or alternatively doing something like a waterfall CNN-style input to account for time. Didn’t seem to get much traction.

GOES

10:00 Meeting with Vadim. I think we worked through the issues. 11:00 meeting tomorrow to discuss
Something’s wrong with ScriptReaderScratch – fixing

GPT Agents

3:30 meeting – talked about jalammar.github.io/hidden-states
Set up a spreadsheet for the paper notes
Twitter is opening up its full tweet archive to academic researchers for free

JuryRoom

5:30 HW review. Suggested that we needed a usability study for the experimenter’s as well

Phil 1.25.21

3:00 Dentist

JuryRoom

Finished doing Priyank’s analysis

MDS

Lots more math!
Working on parametric solutions to intersecting lines

GOES

11:00 All hands
2:00 Meeting with Vadim. Nope, now tomorrow at 10:00

Phil 1.22.21

VoterFraud2020

We are making publicly available VoterFraud2020, a multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets from 2.6M users that includes key phrases and hashtags related to voter fraud claims between October 23rd and December 16th. The dataset also includes the full set of links and YouTube videos shared in these tweets, with data about their spread in different Twitter sub-communities. Key takeaways from our initial analysis of the data are listed below.
The methodology for our data collection including the tracked keywords is detailed in the paper, “VoterFraud2020: a Multi-modal Dataset of Election Fraud Claims on Twitter”, available from arXiv.

Book

Continuing with Diversity injection – done!
2:00 Meeting with Michelle

MDS

Found another post on solving ballistics, with worked examples:
Going to try this with numpy.roots, which work exactly the way you would want them to do.
One shortcut did occur to me. I just need to determine when velocity is zero and use 2t to figure out what the range will be. I can then use that knowledge to place the source.
A little too cooked to work through the example

JuryRoom

Priyank’s task

GPT agents

Install Ecco and try the intro (here in the video, also ranked choices, and token probability) locally, then message him to see if he’d like to discuss maps
Need to put together the embedding files. It’s working! I need to clean the text and play with queries – done!
3:30 Meeting – nope, it was yesterday

ML Group

Add Ecco to presentation list – done

Phil 1.21.21

https://twitter.com/Atul_Gawande/status/1352238602156187648

Donate to MD food bank today

More work from Jay Alammar and language model saliency

Talk on YouTube
He’s put together a website: Ecco x .ix

Book

Continuing with Diversity injection

NESDIS

Training – done!

MDS

Adding ballistics (from NASA, at a nice intro level)
Working nicely:

GPT agents

Finish watching Alammar video (done), install and try the intro locally, then message him to see if he’d like to discuss maps
Need to put together the embedding files. It’s working! I need to clean the text and play with queries – done!

https://viztales.com/wp-content/uploads/2021/01/image-15.png

There does seem to be a bug with getting the search to work

Phil 1.20.21

Inauguration day!

Book

Not much time to work on it this morning, but added some text about Stephens’ neural coupling. Tomorrow I’ll talk about the agents a bit and in particular how increasing dimensions makes it harder to have a stampede

GPT Agents

Antonio submitted the paper!
Need to generate embedding files for the Tensorflow embedding projector

MDS

Started viewer. It’s sooo cute!

GOES

Working on the interpolation begin/end logic – done!

Phil 1.19.21

Feeling like the inauguration will go smoothly, but holding my breath anyway

Fixing disinformation won’t save us – Ethan Zuckerman

There have been countless fact-checking and other efforts designed to rid social media of misinformation. They’re not going to work until the party and the major ideological amplifiers start explicitly renouncing these points of view. The signs are not good – while Fox News was willing to declare that Joe Biden had won the election, they are still providing platforms for people denying the facts of the victory. And a majority of Republican representatives voted to overturn a democratic election. Until there are consequences for perpetuating those falsehoods, don’t count on changes to the media to solve this problem

The end of the Trump-Fox feedback loop

Twitter’s January 8 decision to permanently suspend Trump’s account closed a rare window into a president’s mindset and policymaking that we are unlikely to ever see again. For the past four years, I documented the sources of the president’s grievances and obsessions, matching Trump’s tweets to the television segments he was watching. The president’s TV addiction inspired at least 1,375 tweets dating back to September 1, 2018. The vast majority came in response to his favorite programs on the pro-Trump Fox News and Fox Business networks.

But if there ever was a coda for the Trump years, this has got to be it:

https://twitter.com/johnastoehr/status/1351607351275528192

https://jalammar.github.io/hidden-states/

In this article, we will focus on the hidden state as it evolves from model layer to the next. By looking at the hidden states produced by every transformer decoder block, we aim to gleam information about how a language model arrived at a specific output token. This method is explored by Voita et al.[1]. Nostalgebraist [2] presents compelling visual treatments showcasing the evolution of token rankings, logit scores, and softmax probabilities for the evolving hidden state through the various layers of the model.
[1]: The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
[2]: interpreting GPT: the logit lens

Book

Start on diversity injection section
Research note: Examining false beliefs about voter fraud in the wake of the 2020 Presidential Election
- The 2020 U.S. Presidential Election saw an unprecedented number of false claims alleging election fraud and arguing that Donald Trump was the actual winner of the election. Here we report a survey exploring belief in these false claims that was conducted three days after Biden was declared the winner. We find that a majority of Trump voters in our sample – particularly those who were more politically knowledgeable and more closely following election news – falsely believed that election fraud was widespread, and that Trump won the election. Thus, false beliefs about the election are not merely a fringe phenomenon. We also find that Trump conceding or losing his legal challenges would likely lead a majority of Trump voters to accept Biden’s victory as legitimate, although 40% said they would continue to view Biden as illegitimate regardless. Finally, we found that levels of partisan spite and endorsement of violence were equivalent between Trump and Biden voters.

MDS

Meeting with Aaron today to discuss nest steps and how to combine with his project?
Still need to be able to access the VPN – more paperwork. Wheee!

GOES

Continue with the new TopController
Reading in and stepping through the script. Now I need to slew through the points and return a done when the l2 dist is within a threshold

GPT Agents

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
- If you use the Hugging Face Trainer, as of transformers v4.2.0 you have the experimental support for DeepSpeed’s and FairScale’s ZeRO features. The new --sharded_ddp and --deepspeed command line Trainer arguments provide FairScale and DeepSpeed integration respectively. Here is the full documentation.
Send data – done!
3:30 Meeting
Finish paper and submit?
Need to export embeddings to tensorboard embedding viewer. Here’s how: tensorflow.org/tensorboard/tensorboard_projector_plugin