Author Archives: pgfeldman

Phil 11.17.16

7:00 – 10:00, 10:30 – 5:30 ASRC

Tenure review meeting at 10:00? Show up and see, I guess
Continuing Opinion Dynamics With Decaying Confidence: Application to Community Detection in Graphs. Details here.
We prove, under some conditions, the existence of a solution to the system dynamics, convergence to clusters, and a non-trivial lower bound on the
distance between clusters. Huh. So bubbles must exist at a certain minimum information distance from each other??
Inverse matrices are needed because we can’t divide by a matrix but we can multiply by its reciprocal.
Lemma – a subsidiary or intermediate theorem in an argument or proof.
(From Wikipedia) In measure theory, the Lebesgue measure, named after French mathematician Henri Lebesgue, is the standard way of assigning a measure to subsets of n-dimensional Euclidean space. For n = 1, 2, or 3, it coincides with the standard measure of length, area, or volume. In general, it is also called n-dimensional volume, n-volume, or simply volume.^[1] It is used throughout real analysis, in particular to define Lebesgue integration. Sets that can be assigned a Lebesgue measure are called Lebesgue measurable; the measure of the Lebesgue measurable set A is denoted by λ(A).
The backward slash is kind of the set theory equivalent of subtracting, i.e., $A∖B={a\inA∣a\notinB}.$
Group doc on how to fix fake news
Back to working through NMF.
Looks like we’ll watch videos tomorrow morning

Phil 11.16.16

7:00 – 4:00 ASRC

Continuing Opinion Dynamics With Decaying Confidence: Application to Community Detection in Graphs. My notes are here.
Clean data and apis from propublica
Ulrich Krause
Professor of Mathematics, Bremen University
Positive dynamical systems, opinion dynamics, algebra
- Opinion dynamics and bounded confidence models, analysis, and simulation – looks incredibly clear and helpful.
- Opinion dynamics under the influence of radical groups, charismatic leaders, and other constant signals: A simple unifying model – derived from the above.
From Is there negative social influence? Disentangling effects of dissimilarity and disliking on opinion shifts
- This finding implies for models of opinion dynamics that a complex non-linear social influence function might be unnecessary to characterize the relationship between similarity and opinion change. Our results suggest that not only for the sake of simplicity, but also for the sake of realism, model builders should be cautioned against resorting too readily to a more complex assumption than a simple linear influence function.
I need to add recursion to the QueryComponent (content list and child list) and work through the combinations that way and get rid of the lists. Done! Had an issue with the BufferedWriter not flushing.
BRC kickoff meeting
Made a new arff for Aaron of the BRC doctor data I tagged. Should be enough for a starting junk filter.
Finished the Utopian paper. Need to get up to speed on NMF.

Phil 11.15.16

7:00 – 3:30 ASRC

Continuing Opinion Dynamics With Decaying Confidence: Application to Community Detection in Graphs. My notes are here.
Working on getting the crawl payload builder. I got messed up with permutations. Tomorrow I need to add recursion to the QueryComponent (content list and child list) and work through the combinations that way and get red of the lists.

Phil 11.14.16

7:00 – 5:00 ASRC

Pick up printer paper!
My intuition is that there is a form of information ‘flocking behavior’ with respect to information space. There wouldn’t be quite the same physics as birds or fish in motion, but there do seem to be rules.

Surprisingly, when I started to look at the literature, many of my hits came back from swarm robotics, for example Stable social foraging swarms in a noisy Environment. This is particularly interesting since information search behavior has long been equated with foraging behavior.
The Max Planck Department of Collective Behaviour: “If it’s collective, and a great system for asking questions, then it is of interest to us.”
So, Reading up on flocking.
- Found Stable social foraging swarms in a noisy Environment
- On Krause’s multi-agent consensus model with state-dependent connectivity
- Opinion dynamics with decaying confidence: Application to community detection in graphs <- going to start with this one.
Add wayne to my resume
Quick meetings with Shimei and Aaron

Phil 11.11.16

8:00 – 12:00 – UMBC

Finished the IUI reviews
Doing Shimei’s review
Setting up meeting with Christelle Viauroux
Too frazzled to do coding. Reading Last Place on Earth.

Phil 11.10.16

7:00 – 4:30 ASRC

Had some thoughts last night about how flocking at different scales in Hilbert space might work. Flocks built upon flocks. There is some equivalent of mass and velocity, where mass might be influence (positive and negative attraction). Velocity is related to how fast beliefs change.
Also thought about maps some more, weather maps in particular. A weather map maintains a coordinate frame, even though nothing in that frame is stable. Something like this, with a sense of history (playback of the last X years) could provide an interesting framework for visualization.
Continuing Novelty Learning via Collaborative Proximity Filtering review. Done! Need to submit both now.
Adding StrVec to the ARFF outputs – done
Starting this tutorial on Nonnegative Matrix Factorization
- These slides are also very nice
Working on building JSON files for loading CI
Meeting about Healthdatapalooza

Phil 11.9.16

7:00 – 5:00 ASRC

President-elect Trump. Wow. Just wow.
Starting Novelty Learning via Collaborative Proximity Filtering review
Working with Aaron to get the java version of the classifier working
LibRec (http://www.librec.net) is a Java library for recommender systems (Java version 1.7 or higher required). It implements a suit of state-of-the-art recommendation algorithms. It consists of three major components: Generic Interfaces, Data Structures and Recommendation Algorithms. This should save a *lot* of work. Remember to thank and cite.
The forces that drove this election’s media failure are likely to get worse – Lots of stuff on echo chambers and social media

Phil 11.8.16

7:00 – 6:30 ASRC

Get mileage from Porsche!
Start conference spreadsheet – locations, deadlines, domain, etc
- CHIIR
- IUI
- Journalism Conference
- Simulation Conference
Meeting with Don at 4:30
- Meta Particles -> meta organism = flock
- Homotopy type theory
- Christelle Viauroux
Continuing review of Novelty Learning via Collaborative Proximity Filtering
- A Painless Q-Learning Tutorial
- The problem with the paper is that the authors do not consider that they are really identifying two different populations of users rather than the behavior changes in a single population. I think that if the paper addressed this possibility, it could contribute much more.
Visualizing Data using t-SNE
Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization
An Interactive Visual Testbed System for Dimension Reduction and Clustering of Large-scale High-dimensional Data
Orthogonality and orthography: introducing measured distance into semanticspace
Utopian: User-driven topic modeling based on interactive nonnegative matrix factorization
From frequency to meaning: Vector space models of semantics
Taste Over Time: The Temporal Dynamics of User Preferences.
GA Tech – Jigsaw

Phil 11.7.16

6:30 – 3:00 ASRC

Notes from Aaron to discuss today:
- http://karpathy.github.io/2015/05/21/rnn-effectiveness/?branch_used=true Great article on RNN. Sample code available too.
- Slider based decisions for clustering topic models where we weight similarity contributions individually, including entities (who the document is about via NLP extraction), BOW comparison, TF-IDF LS comparison, etc. The clusters change based off the combined contribution of each vector of attractors.
Starting review of Novelty Learning via Collaborative Proximity Filtering
- Gotta read this!!! Taste Over Time – the Temporal Dynamics of User Preferences
- and Mixtape – Direction-based Navigation in Large Media Collections
- MAchine Learning for LanguagE Toolkit (MALLET)
LingPipe is tool kit for processing text using computational linguistics. LingPipe is used to do tasks like:
- Find the names of people, organizations or locations in news
- Automatically classify Twitter search results into categories
- Suggest correct spellings of queries
GATE is open source software capable of solving almost any text processing problem
Semantic Vectors creates semantic WordSpace models from free natural language text. Such models are designed to represent words and documents in terms of underlying concepts. They can be used for many semantic (concept-aware) matching tasks such as automatic thesaurus generation, knowledge representation, and concept matching.
LSA-based essay grading – could be good for document classification/spam detection

Phil 11.4.16

6:45 – 3:00 ASRC

Nervous enough about the election to move 1/3 of my retirement into long term treasuries.
Writing up review of Topic-Relevance Map – Visualization for Improving Search Result Comprehension for IUI 2017. Done!
Got similarity distance working on retrieved documents using a config file

Phil 11.3.16

7:00 – 3:00 ASRC

Continuing to review Topic-Relevance Map – Visualization for Improving Search Result Comprehension for IUI 2017. I think parts of this are effectively plagiarized. Need to talk to someone about this
How Google Shapes the News You See About the Candidates
The Linear Algebra Behind Search Engines Online tutorial from 2005, which means there’s a chance I can follow. Loo
Matrices, Vector Spaces, and Information Retrieval. Good examples. Looks readable. More on this later.
Jaegul Choo
A human-machine collaborative system for identifying rumors on twitter

Phil 11.2.16

7:00 – 4:00 ASRC

Reviewing Topic-Relevance Map – Visualization for Improving Search Result Comprehension for IUI 2017
This looks important: Directing exploratory search: reinforcement learning from user interactions with keywords
More (work) proposal writing. Done with first draft.

Phil 11.1.16

7:00 – 5:00 ASRC

Playing around with using dissertation to search from. Interesting and different results for non–and-equalized docs and single counts.
- baseline: information model behavior agent result pattern search system between
- equalized docs: information search behavior design system result source document provide
- single counts: information result provide system between search process example behavior
- equalized + single: information result provide between search system process approach design
Finishing survey and sending out – done!
Tested the new CSE
Back to Vector space models of semantics
Worked on proposal with Aaron.

Phil 10.31.16

7:00 – 5:00 ASRC

Working on survey
Reading Vector space models of semantics. Fifty pages. Ow!
Useful neural net blog
Meeting with Wayne
- Add an open comments section “is there anything that you’d like to add”
- UI+AI to determine online trustworthiness
- Ran my proposal (with equalized docs – better!) through LMN. More words in the query mean more specificity, so I went up to 9.
  - The Scholar result is here, set for no patents.
  - Scholar, with ‘single counts’ is here.
  - The standard result, which is also interesting is here.
  - Standard, with single counts (better?) is here.
- Need to start on a spreadsheet of venues. (Get these off the laptop)
  - ICWSM 2017 – Abstracts Jan 6, full papers Jan 13
  - ICPSR is a site containing lots of data. Looking for qualitative corpora. Using ‘interview’ seems to bring up a lot, but you need to log in to use.

Phil 10.28.16

7:00 – 5:00 ASRC

Listening to BBC Business Daily this morning on trusting algorithms.
- This site: Computational Legal Studies. Seems very relevant for all kinds of reasons. Does Aaron Massey know about them?
  - Daniel Martin Katz: “Research Interests include legal futurism, legal informatics, law & entrepreneurship, quantitative modeling of litigation and jurisprudence, quantitative finance, computational legal studies, big data and the law, economics of the legal profession, positive legal theory, technology aided access to justice, legal complexity and the overall impact of information technology, analytics and automation on the future of the legal profession.”
  - Jon Zelner “My research is focused on using spatial and social network analysis to prevent infectious diseases, with a focus on tuberculosis and diarrheal disease, and to understand social and epidemiological systems characterized by complex spatiotemporal dynamics. “
- Cathy O’Neil, author of Weapons of Math Destruction. MathBabe.
Interview with Judea Pearl on Bayesian computation: “We are losing the transparency now with deep learning. I am talking to users who say “it works well” but they don’t know why. Once you unleash it, it has its own dynamics, it does its own repair and its own optimization, and it gives you the right results most of the time. But when it doesn’t, you don’t have a clue as to what went wrong and what should be fixed.”
Working on survey questions. Reading Internet, Mail, and Mixed Mode Surveys, chapter 6,
Need to add a section on ‘Transparent’ Cognitive Computing.
Took a look at Theresa’s slides. Need to call out some ML/AI word salad
Working on adding a key. Slow UI. Done, I think…
Add credit card to semrush
Meeting with Shimei. We started talking about recommender systems, but wound up talking about neural word/sentence/paragraph/topic embedding. Trained against something like the Wikipedia and its outbound links, this might be a good way of having a knowledge coordinate frame that velocity and position can be determined.
- This looks like a good introduction: From frequency to meaning: Vector space models of semantics
- And this looks good too. Check references!

viztales

Dimension reduction, State, Orientation, and Speed

Author Archives: pgfeldman

Phil 11.17.16

Phil 11.16.16

Phil 11.15.16

Phil 11.14.16

Phil 11.11.16

Phil 11.10.16

Phil 11.9.16

Phil 11.8.16

Phil 11.7.16

Phil 11.4.16

Phil 11.3.16

Phil 11.2.16

Phil 11.1.16

Phil 10.31.16

Phil 10.28.16