Monthly Archives: March 2019

Phil 3.14.19

ASRC AIMS 7:00 – 4:00, PhD ML, 4:30 –

More embedding.
- ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding
- Make sure that there is a vector for the words in the posts. Nope!
  
  Gensim wants sentences as arrays of words. I was giving it the whole sentence string. Now things look much better. And there is an oddly linear artifact:
- Build a word vector average for each post
  - Weighted by word frequency or normalized
  - Return angle between vectors
  - Build post x post matrix
Possible venue for the AI paper? 2019 IEEE International Symposium on Technologies for Homeland Security
JAX autodiff cookbook (Colab intro) (all colab publications)
Shimei’s group
- TF Dev conference updates and links
  - TF 2.0 (Blog) (Keras tweet course)
  - TF probability blog post (Video)
  - Swift and why it matters
  - TF Hub
  - Coral
  - TF Mesh snippet (Lincoln 1)(Lincoln 2)
  - TensorFlow (2.0) Serving with Docker — an end-to-end example!

Phil 3.13.19

7:00 – 5:00 ASRC AIMS

Some good posts on NLP, Attention, and BERT
- How Self-Attention with Relative Position Representations works
- How the Embedding Layers in BERT Were Implemented

SAv3.13

Got the db reading in and creating PostAnalyzer objects for each user by channel
Need to also create a PostAnalyzer that contains the entire set of runs. Since that crosses DBs, I think the best way to do this is to create a method that lets me load additional data into an existing instance
- Added load_data() method to PostAnalyzer. Seems to be working
- The GUI code was getting ugly with the analytics, so I did some refactoring and now have an MVC architecture and am happier
Create the master embedding – done!!!! The number of points seems low (98), but I’ll look at that tomorrow.
Compare user average vectors in a user x user matrix
Compare post average vectors in a post x post matrix
Missed the JuryRoom Skype last night. Aaron was there though. Need to catch up
- Quick notes for JuryRoom:
  - The votes should be for a posted response, not a yes/no to the original question
  - Groups should be able stick together if they want
  - Topics should be “threadable” for groups, with defined and randomized order
Steve S. Is going to read the paper and make suggestions
Here’s how you import into postgres: .\pg_restore.exe -h localhost -p 5433 -U postgres -d GEMSEC_logs -v “D:/Development/A2P/GEMSEC_logs/greatdb.backup”
Aaron’s blog is up!

Click to see trajectories through fashion space (paper)

Phil 3.12.19

7:00 – 4:00 ASRC PhD

Regression with Probabilistic Layers in TensorFlow Probability
- At the 2019 TensorFlow Dev Summit, we announced Probabilistic Layers in TensorFlow Probability (TFP). Here, we demonstrate in more detail how to use TFP layers to manage the uncertainty inherent in regression predictions.
Calculate the cosine similarity between all posts and populate a matrix to view and analyze. See if BERT makes sense for this or start with Word2Vec?
- Looking at my Word2Vec notes as a place to get started, since all the embedding looks about the same
  - Put W2V in a class
  - Started reading the db and creating a postanalyzer for each user in each dungeon. Right now, that’s easy because the user names are the same. Fixed that to be a combination of the channel and the user
- TF Embedding models (https://tfhub.dev/s?module-type=text-embedding)
- Elmo model in TF (https://tfhub.dev/google/elmo/2)
  - How to use Elmo Embeddings (Word Vectors, Sentence Vectors)
- Elmo looks promising. Here’s a tutorial (https://github.com/PrashantRanjan09/Elmo-Tutorial)
Map can be a combination of height and color, sort of like the clustering work
More work on iConf slides/presentation
Does embedding make sense for log files?
- Three Things We Learned About Applying Word Vectors to Computer Logs
- Experience Report: Log Mining using Natural Language Processing and Application to Anomaly Detection
- DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning
- Wrote up a proposal for embedding to analyze log files

Phil 3.11.19

7:00 – 10:00 ASRC PhD. Fun, long day.

Understanding BERT Transformer: Attention isn’t all you need
Word Vectors and NLP Modeling from BoW to BERT
- Since the advent of word2vec, neural word embeddings have become a go to method for encapsulating distributional semantics in text applications. This series will review the strengths and weaknesses of using pre-trained word embeddings and demonstrate how to incorporate more complex semantic representation schemes such as Semantic Role Labeling, Abstract Meaning Representation and Semantic Dependency Parsing into your applications.
Artificial Intelligence and Global Security Initiative Research Agenda
- The Center for a New American Security’s Artificial Intelligence and Global Security Initiative explores these and other issues surrounding the AI revolution. Current AI technology is powerful, but also has a number of vulnerabilities, including susceptibility to spoofing (false data) and control problems. An arms race in AI where nations and other actors rush to use this technology for their advantage without any concern for safety would be harmful to everyone. It is vitally important for the technology and policy communities to come together to better understand the implications of the AI revolution for global security and how best to navigate the challenges ahead.
One more pass through Antonio’s paper this evening – done
Working on getting the Slack chats into the database. It turns out that there can be threaded discussions within channels: `thread_ts`, `reply_count`, `reply_users_count`, `latest_reply`, `reply_users`, `replies` are the variables. It’s not critical now, but it would be nice to read these in as well.
Added encoding=”utf8″ to the read statements
We are over 10,000 rows!
And it looks like the Google Keras team is going to run the dungeon
Starting on SequenceAnalyzer. Not bat progress for a day
Meeting with Wayne
- “Slaying Monsters for Science”
  - http://science.sciencemag.org/content/320/5883/1592.3
- A few from that infamous conference…
  - https://www.amazon.com/Life-Night-Elf-Priest-Anthropological/dp/0472050982/ref=sr_1_2?keywords=bonnie+nardi&qid=1552351901&s=gateway&sr=8-2
- and the role of immersive play in rethinking modern religious expression….

Phil 3.10.19

Learning to Speak and Act in a Fantasy Text Adventure Game

We introduce a large scale crowdsourced text adventure game as a research platform for studying grounded dialogue. In it, agents can perceive, emote, and act whilst conducting dialogue with other agents. Models and humans can both act as characters within the game. We describe the results of training state-of-the-art generative and retrieval models in this setting. We show that in addition to using past dialogue, these models are able to effectively use the state of the underlying world to condition their predictions. In particular, we show that grounding on the details of the local environment, including location descriptions, and the objects (and their affordances) and characters (and their previous actions) present within it allows better predictions of agent behavior and dialogue. We analyze the ingredients necessary for successful grounding in this setting, and how each of these factors relate to agents that can talk and act successfully.

New run in the dungeon. Exciting!

Finished my pass through Antonio’s paper

Zoe Keating (May 1) or Imogen Heap (May 3)?

Phil 3.9.19

Understanding China’s AI Strategy

In my interactions with Chinese government officials, they demonstrated remarkably keen understanding of the issues surrounding AI and international security. It is clear that China’s government views AI as a high strategic priority and is devoting the required resources to cultivate AI expertise and strategic thinking among its national security community. This includes knowledge of U.S. AI policy discussions. I believe it is vital that the U.S. policymaking community similarly prioritize cultivating expertise and understanding of AI developments in China.

Russian Trolls Shift Strategy to Disrupt U.S. Election in 2020

Russian internet trolls appear to be shifting strategy in their efforts to disrupt the 2020 U.S. elections, promoting politically divisive messages through phony social media accounts instead of creating propaganda themselves, cybersecurity experts say.

Backup phone

Work on SASO paper – started

Rachel’s dungeon run is tomorrow! Maybe cross 10,000 posts?

Look at using BERT and the full Word2Vec model for analyzing posts

The Promise of Hierarchical Reinforcement Learning

To really understand the need for a hierarchical structure in the learning algorithm and in order to make the bridge between RL and HRL, we need to remember what we are trying to solve: MDPs. HRL methods learn a policy made up of multiple layers, each of which is responsible for control at a different level of temporal abstraction. Indeed, the key innovation of the HRL is to extend the set of available actions so that the agent can now choose to perform not only elementary actions, but also macro-actions, i.e. sequences of lower-level actions. Hence, with actions that are extended over time, we must take into account the time elapsed between decision-making moments. Luckily, MDP planning and learning algorithms can easily be extended to accommodate HRL.

Phil 3.7.19

Day 2 of the TF Dev summit. Worth the money, though much less research-y and more implementation and production-y

Google Cloud has Fedramp certification, which it does see details here.

Live Transcribe

Coral: On Device Transfer learning (paper)

TF 2.0 API \changes and Behavior changes

Best practices (link: )
Declare variables at the beginning of the code
Keras Functional API
- The Keras functional API is the way to go for defining complex models, such as multi-output models, directed acyclic graphs, or models with shared layers.
Autograd can automatically differentiate native Python and Numpy code. It can handle a large subset of Python’s features, including loops, ifs, recursion and closures, and it can even take derivatives of derivatives of derivatives. It supports reverse-mode differentiation (a.k.a. backpropagation), which means it can efficiently take gradients of scalar-valued functions with respect to array-valued arguments, as well as forward-mode differentiation, and the two can be composed arbitrarily. The main intended application of Autograd is gradient-based optimization. For more information, check out the tutorial and the examples directory.
JAX is Autograd and XLA, brought together for high-performance machine learning research. With its updated version of Autograd, JAX can automatically differentiate native Python and NumPy functions. It can differentiate through loops, branches, recursion, and closures, and it can take derivatives of derivatives of derivatives. It supports reverse-mode differentiation (a.k.a. backpropagation) via grad as well as forward-mode differentiation, and the two can be composed arbitrarily to any order.
Effective TF 2.0: There are multiple changes in TensorFlow 2.0 to make TensorFlow users more productive. TensorFlow 2.0 removes redundant APIs, makes APIs more consistent (Unified RNNs, Unified Optimizers), and better integrates with the Python runtime with Eager execution.

Phil 3.6.19

5:00 – ASRC TL

Got a lot done on the BAA on the flight yesterday
Wrote up a description of LMN and CM for Eric V.
Reading more of the Handbook of Latent Semantic Analysis. It’s giving me some good ideas for calculating similarities of posts using Word2Vec and comparing the average vector for each post
Antonio got an extension to the 12th. Need to see what he’s up to. Wow, there’s a lot there now. Made some comments about what I’d like to see. I’ll pull down the document to read later
Continued to tweak the slides
TF Dev conference main sessions today. Breakouts tomorrow.

Phil 3.5.19

Travelling to the Tensorflow developer’s conference

Started on my iConf talk

More BAA while flying

Phil 3.4.19

7:00 – 5:00 ASRC

Build an interactive SequenceAnalyzer. The adjustments are
- Number of buckets
- Percentages for each analytic (percentages to keep/discard
- Selectable skip words that can be added to a list (in the db?)
Algorithm
1. Find the most common words across all groups, these are skip_words
2. Find the most common words along the entire series of posts per player and eliminate them
3. Find the most common/central words across all sequences and keep those as belief places
4. For each sequence by group, find the most common/central words after the belief places. These are the belief spaces.
5. Build an adjacency matrix of players, groups, places and spaces
6. Build submatrices for centrality calculations? This could be rather than finding the most common
7. Possible word2vec variations?
  1. It seems to me that I might be able to use direction cosines and dynamic time warping to calculate the similarity of posts and align them better than the overall scaling that I’m doing now. DM posts introducing a room should align perfectly, and then other scaling could happen between those areas of greatest alignment
Display
- Menu:
  - Save spreadsheet (includes config, included words, posts(?), trajectories)
  - load data
  - select database
  - select group within db
  - load/save config file
  - clear all
- Fields
  - percent for A1, A2, A3, A4
  - Centrality/Sum switch
  - BOW/TF-IDF switch
  - Word2vec switch?
- Textarea (areas? tabbed?)
  - Table with rows as sequence step. Columns are grouped by places, spaces, groups, and players
- Work on Antonio’s paper got a first draft on introduction and motivation
- BAA
  - Upload latex and references to laptop
- Haircut! Pack!
- Model-Based Reinforcement Learning for Atari
  - Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction — substantially more, in fact, than a human would need to learn the same games. How can people learn so quickly? Part of the answer may be that people can learn how the game works and predict which actions will lead to desirable outcomes. In this paper, we explore how video prediction models can similarly enable agents to solve Atari games with orders of magnitude fewer interactions than model-free methods. We describe Simulated Policy Learning (SimPLe), a complete model-based deep RL algorithm based on video prediction models and present a comparison of several model architectures, including a novel architecture that yields the best results in our setting. Our experiments evaluate SimPLe on a range of Atari games and achieve competitive results with only 100K interactions between the agent and the environment (400K frames), which corresponds to about two hours of real-time play.

Phil 3.3.19

Once more, icky weather makes me productive

Ingested all the runs into the db. We are at 7,246 posts
Reworking the 5 bucket analysis
Building better ignore files and rebuilding bucket spreadsheets. It tuns out that for tymora1, names took up 25% of the BOW, so I increased the fraction saved to the trimmed spreadsheets to 50%
Building bucket spreadsheets and saving the centrality vector
Here’s what I’ve got so far:
Trajectories:
First map:
Here it is annotated:
Some thoughts. I think this is still “zoomed out” too far. Changing the granularity should help some. I need to automate some of my tools though. The other issue is how I’m assembling my sequences.

Phil 3.2.19

Updating SheetToMap to take comma separated cell names. Lines 180 – 193. I think I’ll need an iterating compare function. Nope, wound up doing something simpler

for (String colName : colNames) {
    String curCells = tm.get(colName);
    String[] cellArray = curCells.split("\\|\\|"); <--- new!
    for(String curCell : cellArray) {
        addNode(curCell, rowName);
        if (prevCell != null && !curCell.equals(prevCell)) {
            String edgeName = curCell + "+" + prevCell;
            if (graph.getEdge(edgeName) == null) {
                try {
                    graph.addEdge(edgeName, curCell, prevCell);
                    System.out.println("adding edge [" + edgeName + "]");
                } catch (EdgeRejectedException e) {
                    System.out.println("didn't add edge [" + edgeName + "]");
                }
            }
        }
        prevCell = curCell;
    }

    //System.out.print(curCell + ", ");
    prevCell = cellArray[0];
    col++;
}

Updating GPM to generate comma separated cell names in trajectories

need to get the previous n cell names
Need to change the cellName val in FlockingBeliefCA to be a stack of tail length. Done.
Parsed the strings in SheetToMap. Each cell has a root name (the first) which connects to the roots of the previous cell. The root then links to the subsequent names in the chain of names that are separated by “||”
```
"cell_[4, 5]||cell_[4, 4]||cell_[4, 3]||cell_[4, 2]||cell_[4, 1]"
```
Seems to be working:

Phil 3.1.19

7:00 – ASRC

Got accepted to the TF dev conference. The flight out is expensive… Sent Eric V. a note asking for permission to go, but bought tix anyway given the short fuse
Downloaded the full slack data
Working on white paper. The single file was getting unwieldy, so I broke it up
Found Speeding up Parliamentary Decision Making for Cyber Counter-Attack, which argues for the possibility of pre-authorizing automated response
Up to six pages. IN the middle of the cyberdefense section

viztales

Dimension reduction, State, Orientation, and Speed