Category Archives: research

Phil 8.8.18

7:00 – 4:00 ASRC MKT

  • Oh, look, a new Tensorflow (1.10). Time to break things. I like the BigTable integration though.
  • Learning Meaning in Natural Language Processing — A Discussion
    • Last week a tweet by Jacob Andreas triggered a huge discussion on Twitter that many people have called the meaning/semantics mega-thread. Twitter is a great medium for having such a discussion, replying to any comment allows to revive the debate from the most promising point when it’s stuck in a dead-end. Unfortunately Twitter also makes the discussion very hard to read afterwards so I made three entry points to explore this fascinating mega-thread:

      1. a summary of the discussion that you will find below,
      2. an interactive view to explore the trees of tweets, and
      3. commented map to get an overview of the main points discussed:
  • The Current Best of Universal Word Embeddings and Sentence Embeddings
    • This post is thus a brief primer on the current state-of-the-art in Universal Word and Sentence Embeddings, detailing a few

      • strong/fast baselines: FastText, Bag-of-Words
      • state-of-the-art models: ELMo, Skip-Thoughts, Quick-Thoughts, InferSent, MILA/MSR’s General Purpose Sentence Representations & Google’s Universal Sentence Encoder.

      If you want some background on what happened before 2017 😀, I recommend the nice post on word embeddings that Sebastian wrote last year and his intro posts.

  • Treeverse is a browser extension for navigating burgeoning Twitter conversations. right_pane
  • Detecting computer-generated random responding in questionnaire-based data: A comparison of seven indices
    • With the development of online data collection and instruments such as Amazon’s Mechanical Turk (MTurk), the appearance of malicious software that generates responses to surveys in order to earn money represents a major issue, for both economic and scientific reasons. Indeed, even if paying one respondent to complete one questionnaire represents a very small cost, the multiplication of botnets providing invalid response sets may ultimately reduce study validity while increasing research costs. Several techniques have been proposed thus far to detect problematic human response sets, but little research has been undertaken to test the extent to which they actually detect nonhuman response sets. Thus, we proposed to conduct an empirical comparison of these indices. Assuming that most botnet programs are based on random uniform distributions of responses, we present and compare seven indices in this study to detect nonhuman response sets. A sample of 1,967 human respondents was mixed with different percentages (i.e., from 5% to 50%) of simulated random response sets. Three of the seven indices (i.e., response coherence, Mahalanobis distance, and person–total correlation) appear to be the best estimators for detecting nonhuman response sets. Given that two of those indices—Mahalanobis distance and person–total correlation—are calculated easily, every researcher working with online questionnaires could use them to screen for the presence of such invalid data.
  • Continuing to work on SASO slides – close to done. Got a lot of adversarial herding FB examples from the House Permanent Committee on Intelligence. Need to add them to the slide. Sobering.
  • And this looks like a FANTASTIC ride out of Trento: ridewithgps.com/routes/27552411
  • Fixed the border menu so that it’s a toggle group

Phil 8.7.18

8:00 – ASRC MKT

  • Looking for discussion transcripts.
  • Podcasts
    • Do you get your heart broken by the Nationals, Wizards, Caps and Redskins every single year but you still come back for more? The DMV Sports Roundtable is the podcast for you – Washington’s sports teams from the fans’ perspective – and plenty of college coverage too.
    • Join UCB Theatre veterans Cody Lindquist & Charlie Todd as they welcome a panel of NYC’s most hilarious comedians, journalists, and politicians to chug two beers on stage and discuss the politics of the week. It’s like Meet The Press, but funnier and with more alcohol. Theme song by Tyler Walker.
    • Rasslin Roundtable: Wrestling podcast centered around the latest PPV
    • TSN 1290 Roundtable: Kevin Olszewski hosts the Donvito Roundtable, airing weekdays from 11am-1pm CT on TSN 1290 Winnipeg. Daily discussion about the Winnipeg Jets, the NHL, and whatever else is on his mind!
    • The Game Design Round Table Focusing on both digital and tabletop gaming, The Game Design Round Table provides a forum for conversation about critical issues to game design.
    • Story Works Round Table Before you can be a successful author, you have to write a great story. Each week, co-hosts, Alida Winternheimer, author and writing coach at Word Essential, Kathryn Arnold, emerging writer, & Robert Scanlon, author of the Blood Empire series, have conversations about the craft of writing fiction. They bring diverse experiences and talents to the table from both the traditional and indie worlds. Our goal is for each episode to be a fun, lively discussion of some aspect of story craft that that enlightens, as well as entertains.
  • Some good pix of bike-share graveyards in China that would be good stampede pix from The Atlantic (set 1) (set 2) Bicycles of various bike-sharing services are seen in Shanghai.
  • Starting back on the SASO slides. Based on Wayne’s comments, I’m reworking the Stephens’ slide
    • Flashes of Insight: Whole-Brain Imaging of Neural Activity in the Zebrafish (video)(paper)(paper)

Phil 7.1.18

On vacation, but oddly enough, I’m back on my morning schedule, so here I am in Bormio, Italy at 4:30 am.

I forgot my HDMI adaptor for the laptop. Need to order one and have it delivered to Zurich – Hmmm. Can’t seem to get it delivered from Amazon to a hotel. Will have to buy in Zurich

Need to add Gamerfate to the lit review timeline to show where I started to get interested in the problem – tried it but didn’t like it. I’d have to redo the timeline and I’m not sure I have the excel file

Add vacation pictures to slides – done!

Some random thoughts

  • When using the belief space example of the table, note that if we sum up all the discussions about tables, we would be able to build a pretty god map of what matters to people with regards to tables
  • Manifold learning is what intelligent systems do as a way of determining relationships between things (see curse of dimensionality). As groups of individuals, we need to coordinate our manifold learning activities so that we can us the power of group cognition. When looking at how manifold learning schemes like t-sne and particularly embedding systems such as word2vec create their own unique embeddings, it becomes clear that our machines are not yet engaged in group cognition, except in the simplest way of re-using trained networks and copied hyperparameters. This is very prone to stampedes
  • In conversation at dinner, Mike M mentioned that he’d like a language app that is able to indicate the centrality of a term an order that list so that it’s possible to learn a language in a “prioritized” way that can be context-dependent. I think that LMN with a few tweaks could do that.

Continuing the Evolution of Cooperation. A thing that strikes me is that once a TIT FOR TAT successfully takes over, then it becomes computationally easier to ALWAYS COOPERATE. That could evolve to become dominant and be completely vulnerable to ALWAYS DEFECT

Phil 6.27.18

7:00 – 12:00 ASRC MKT

  • Print out documents! Done. Got passport drive too.
  • Need to write an extractor that lets the user navigate the xml file containing influences of selected agents. This could be a sample-by sample network. Maybe two modes?
    • Select an agent and see all the other agents come in and out of influcene
    • Select an number of agents and only watch the mutual influence.
    • There is an integrated JavaFX charts that I could use, or it could be an uploaded webapp? JavaFX would be easier in the short term, but a webapp would help more with JuryRoom…
    • Another option would be Python, since that’s where the LSTM code will live.
    • On the whole, two days before leaving on travel is probably the wrong time to start coding
  • Fixed a bug in the xml file generation
  • copied the new jar file onto the thumb drive
  • copied the xml file onto the thumb drive

12:00 – 4:00 ASRC A2P

  • Pomoting things to QA – done! Or at least, up to date with the excel files

Phil 6.26.18

7:00 – 5:00 ASRC MKT

  • Started back with the Evolution of Cooperation
  • Social loafing (Scholar results)
    • In social psychologysocial loafing is the phenomenon of a person exerting less effort to achieve a goal when they work in a group than when they work alone. This is seen as one of the main reasons groups are sometimes less productive than the combined performance of their members working as individuals, but should be distinguished from the accidental coordination problems that groups sometimes experience. Research on social loafing began with rope pulling experiments by Ringelmann, who found that members of a group tended to exert less effort in pulling a rope than did individuals alone. In more recent research, studies involving modern technology, such as online and distributed groups, have also shown clear evidence of social loafing. Many of the causes of social loafing stem from an individual feeling that his or her effort will not matter to the group.
  • NELA2017 contains almost every news article from 92 sources between April 2017 and October 2017, amounting to over 136K articles. This data set is the first release of NELA datasets. This version of the data set can be found on github and a full description and use cases can be found in our 2018 ICWSM paper.
  • Submitted “One Simple Trick” final to SASO
  • Updated ArXive
  • Fixed a bug that prevented population interactions in FlockingAgentManager.initializeAgents():
                // add to the global list
                allBoidsList.add(fs);
    
                // add a pointer to the global list to each shape
                fs.setFlockingShapeList(allBoidsList);
    
                // Add to the flock so that we can get flock headings
                List flock = flockListsMap.get(flockName);
                flock.add(fs);

    Seriously, what was I thinking?

  • Continued GUI tweaking. I think it looks pretty good, and it fits (mostly) on my laptop Version6.26.18
  • Verified that the influences record agents from different flocks and sources.
  • Copied all CI 2018 things I can think of onto the thumb drive

Phil 6.22.18

7:00 – 5:30 ASRC MKT

  • Twitter experiment on a fake Gary Indiana secession. IFTTT retweeting leads to interesting behavior.
  • Fixed FlockingShape casting by adding a customDrawStep(GraphicsContext gc) to the SmartShape base class that’s called from draw().
  • Add records to each agent that store a list of source and agent influences at each time sample. It should include the name of the item and the amount of influence. Probably save as an XML file, since it has too many dimensions. The file could then be used to create terms or spreadsheets.
    • Started on CAInfluence class which will be added to CA classes in an arrayList in BaseCA;
  • More file conversion with Bob – and everything worked great until I try one after Bob leaves. Ka-BOOM!
    • Installed all the packages to get everything to run in the debugger. Found what appears to be a perfectly good line “range” that causes the problem? Will start debugging on Wednesday.
  • Project MERCATOR proposal
  • Meeting with Sy

Phil 6.21.18

7:00 – 4:00 ASRC MKT

  • Add an attractor scalar for agents that’s normally zero. A vector to each agent within the SIH is calculated and scaled by the attractor scalar. That vector is then added to the direction vector to the agent – done
  • Remove the heading influence based on site – done
  • Add a white circle to the center of the agent that is the size of the attraction scalar. Done
  • Add attraction radius slider that is independent of the SIH. -done
  • Add a ‘site trajectory’ to the spreadsheet that will have the site lists (and their percentage?)
  • There is now an opportunity for a poster and a demo at SASO
  • Add stories, lists and maps to implication slides – done
  • Got all my connections set up
  • Successfully converted and deployed cosmos-2
  • Voted!

Phil 6.20.18

7:00 – 9:00 2:00 – 5:00 ASRC MKT

  • Redo doodle for all of August – done
  • Schooling Fish May Offer Insights Into Networked Neurons
    • Iain Couzin is deciphering the rules that govern group behavior. The results might provide a fresh perspective on how networks of neurons work together.
  • City arts and lectures: The New Science Of Psychedelics With Michael Pollan
    • Psychedelics reduce the section of the brain that have to do with the sense of self. Pollan thinks that this also happens with certain types of rhythmic music and in crowd situations. This could be related to stampedes and flocking.
    • LSD May Chip Away at the Brain’s “Sense of Self” Network
      • Brain imaging suggests LSD’s consciousness-altering traits may work by hindering some brain networks and boosting overall connectivity
  • Add an attractor scalar for agents that’s normally zero. A vector to each agent within the SIH is calculated and scaled by the attractor scalar. That vector is then added to the direction vector to the agent – done?
  • Remove the heading influence based on site – done
  • Add a white circle to the center of the agent that is the size of the attraction scalar. Done
  • Add a ‘site trajectory’ to the spreadsheet that will have the site lists (and their percentage?)
  • Worked on A2P white paper with Aaron.
  • Worked on a response to Dr. Li’s response

ASRC IRAD 9:00 – 2:00

  • Mind meld with Bob
    • Revisit Yarn
    • Excel stuff?
    • Connect to AWS using bastion. Look in FoxyProxy how to. I need certs
    • Drop on rabbit to deploy to CI and QA and NESDIS  ONE (production)
    • Don’t want sensitive information in Git. We use sharepoint instead
    • Notes and screenshots in document.

Phil 5.17.18

7:00 – 4:00 ASRC MKT

  • How artificial intelligence is changing science – This page contains pointers to a bunch of interesting projects:
  • Multi-view Discriminative Learning via Joint Non-negative Matrix Factorization
    • Multi-view learning attempts to generate a classifier with a better performance by exploiting relationship among multiple views. Existing approaches often focus on learning the consistency and/or complementarity among different views. However, not all consistent or complementary information is useful for learning, instead, only class-specific discriminative information is essential. In this paper, we propose a new robust multi-view learning algorithm, called DICS, by exploring the Discriminative and non-discriminative Information existing in Common and view-Specific parts among different views via joint non-negative matrix factorization. The basic idea is to learn a latent common subspace and view-specific subspaces, and more importantly, discriminative and non-discriminative information from all subspaces are further extracted to support a better classification. Empirical extensive experiments on seven real-world data sets have demonstrated the effectiveness of DICS, and show its superiority over many state-of-the-art algorithms.
  • Add Nomadic, Flocking, and Stampede to terms. And a bunch more
  • Slides
  • Embedding navigation
    • Extend SmartShape to SourceShape. It should be a stripped down version of FlockingShape
    • Extend BaseCA to SourceCA, again, it should be a stripped down version of FlockingBeliefCA
    • Add a sourceShapeList for FlockingAgentManager that then passes that to the FlockingShapes
  • And it’s working! Well, drawing. Next is the interactions: Influence
  • Finally went and joined the IEEE

Phil 5.15.18

7:00 – 4:00 ASRC MKT

Phil 5.14.18

7:00 – 3:00 ASRC MKT

    • Working on Zurich Travel. Ricardo is getting tix, and I got a response back from the conference on an extended stay
    • Continue with slides
    • See if there is a binary embedding reader in Java? Nope. Maybe in ml4j, but it’s easier to just write out the file in the format that I want
    • Done with the writer: Vim
  • Fika
  • Finished Simulacra and Simulation. So very, very French. From my perspective, there are so many different lines of thought coming out of the work that I can’t nail down anything definitive.
  • Started The Evolution of Cooperation

Phil 5.8.18

7:00 – 5:00 ASRC MKT

5:00 – 8:00 ASRC Tech Conference