Phil 1.3.17

First, Call about the House ethics bill

7:00 – 8:00 research

For charting, calculate distances and direction cosine from average center. Export that to excel. We’ll need a sampling interval.
Adding class FlockRecorder

8:30 – 4:00 BRC

Added method to nmf so that matrices with zeros don’t have to be sparse
Working on clustering items in a Labeled2DMatrix. Done as per this page
Had some issues getting the test to work. I may have been trying to be too general. With restructured tests from the GP code, everything seems to be working fine.
Next step is to put into the GUI and figure out some way to show it.

Phil 1.2.17

9:00 – 6:00 Research

Create WeightWidget to support loading from files
Added interactive setting of influence distance
Set color based on cluster Done
Move localBehavior to FlockingBeiliefCA. pass in array of other beliefs and maybe the influence diameter. Done
Make FlockingShape clusterable based on the Euclidian distance to other Shape’s beliefs, Done!
Chasing down NaN errors. Looks to be mostly length = zero and div/zero errors. Occasional. Grr.
For charting, calculate distances and direction cosine from average center. Export that to excel. We’ll need a sampling interval.
Start thinking about network connections
Notes for tomorrow:
- Make the items in the term/document table clusterable
- Make it so zero can indicate empty or a value

Phil 12.30.16

7:00 – 8:00, 8:30 – 5:00 Research

Cool thing. Maybe for Ravi?
Moving speed to FlockingBeiliefCA and adding slewRate – done. Added an update of the global belief to the agent manager behavior loop.
Calculate average heading
Calculate average center
Add decay based on distance?
- Sorted list of boids by distance
- Within a sphere of influence? the vector to the target is scaled by something like 1/distance. Done! looks great!
TODO:
- Move localBehavior to FlockingBeiliefCA. pass in array of other beliefs and maybe the influence diameter
- Make FlockingShape clusterable based on norms() to other Shape’s beliefs
- For charting, calculate distances and direction cosine from average center. Export that to excel. We’ll need a sampling interval.
- Start thinking about network connections

Phil 12.29.16

7:00 – 8:00, 8:30 -3:30 Research

But first, we update IntelliJ…
The goal today is to get boids to set up the methods that control behavior:
- Alignment: steer towards the average heading of local flockmates
- Cohesion: steer to move toward the average position of local flockmates
- Separation: steer to avoid crowding local flockmates
  - I’m not sure about this one. In the case of a belief system, there really isn’t a need for collision avoidance.
Finishing ParticleBelief.interpolateHeading().
Changed the vector so that it is always unit, and added speed variable for the particle belief

boids_12-29-16

Phil 12.28.16

7:00 – 8:00, 8:30 – 4:00 Research

Thom Lieb found this for me today: Reuters Tracer – A Large Scale System of Detecting Verifying Real-Time News Events from Twitter. Downloaded. A must read.
Need to renew my cert for philfeldman.com
Realized last night that I should be able to maintain angle and length for my rotation through ‘n’ dimensions between two vectors.
- Get the angle between the two vectors (dot product)
- Get the chord between the two normalized vectors
- Now I have a 2D problem where I project the angle I want to rotate on the chord line. That gives me a new vector that I just need to normalise.
- Solving simultaneous equations with matrices
- The only condition where I see this failing is a 180 degree rotation.
- Need to verify that this will work. It took forever for stupid math reasons but it does work.
- Reworking ParticleBelief.interpolateHeading()
No pretty screenshots for today. Instead, here’s a murmuration of starlings

Magic Cloud from Marco Campazas on Vimeo.

Phil 12.27.16

Phil 7:15 – 8:15, 8:45 -4:15 Research

Adding start/stop Done. Should probably add a slider to adjust time
Implementing boids as per Craig Reynolds page
- The flocking shape needs an orientation. Since this orientation can be any number of dimensions, where each statement is a dimension, I’m creating a ParticleBelief to contain ParticleStatements.
- Getting the heading and rebuilding shape as boid triangles
- Drawing triangles now. Using the Javafx examples here. Got angle working
- Adding individual flocks
- Working on an angle interpolation that will work for any number of dimensions. Going to start with interpolation and renormalization, since the steps should be incremental. I think it’s done. Will test tomorrow. Need to add the parts to the dprint?

boids_12-27-16

Phil 12.26.16

7:00 – 12:00 Research

Winterize lawn mower – done!
Look into hopkins computational and regular ecology
Downloaded the new version of AtlasTi. It’s on OneDrive
The goal is to start the development of the flocking app. I’m going to start with a new FlockingGui. Done, now switching all the underlying pieces over. I think done for the day is when the agents are moving based on their belief statement value of xpos and ypos.
Update Java – no need
Update IntelliJ – downloaded locally and installed. Odd problem with FILE_PATH not being set in Settings->Appearance & Behavior -> Path Variables. This was just for JavaUtils and seems fine in GroupPolarizationModel. Anyway, everything is compiling and running.
Update TortoiseSVN – no need
Update WinSCP -no need
Copy over java code – done
Ok, the first thing to do is have an agent that takes its position from an xMapping() and yMapping() function. To begin with, this will be taking an xpos and ypos statement that vary as a function of time.
Currently the movement is calculated in the shape, since for force clustering the beliefs/statements don’t know their ‘positions’. This approach is different in that the beliefs are a measure of position in information space. Working through the changes

Phil 12.23.16

7:00 – 8:00 Research

Assembling all the Sociophysics readings into one posting on phlog.

8:30 – 4:00 BRC

Reading in a spreadsheet to GPM -done. Ok results, not great clustering
Tried adjusting the threshold and adding antibelief, but other than helping the refresh rate, no joy. I think I need a different distance/similarity calculation
- Hops. Every agent is connected over the network by going through flag nodes. We could literally draw the node/agent network, and count the hops in an adjacency matrix
- N-dimensional cartesian. As I recall, this is close to what I have already. It’s closely related to n-dimensional flocking, so I’m going to get that running anyway so that I can measure distance between/within flocks
- Cosine similarity. I think that this is a good approach since it decomposes the dimensions in a useful way, particularly for sparse matrices.
- There is similarity there, but not enough distance to make anything emerge. Pretty pix though.

Phil 12.22.16

7:00 – 8:00 Research

Met with Wayne yesterday evening. We’re going to take a look at science team data text to see how it compares with the overall coding by humans. Verified that the data is all available
Interesting stuff on NPR this morning on Russia: The information space opens wide asymmetrical possibilities for reducing the fighting potential of the enemy. In North Africa, we witnessed the use of technologies for influencing state structures and the population with the help of information networks. It is necessary to perfect activities in the information space, including the defense of our own objects [objectives].
- Does Russia Have a Gerasimov Doctrine?
- The Value of Science is in the Foresight – NATO analysis
Continuing with Sociophysics
- Chapter 8: Endnote
  - Definition of consensus in an opinion model – the emergence of long-range order.
  - Looking for phase changes from heterogeneous to homogeneous or clustered states is important. Finding what parameters are causal and the values is considered a publishable result. Canonical types of transitions, such as the percolation threshold are discussed in the appendices.

BRC 8:30 – 4:30

Verify that the META_INF file in src isn’t screwing jar file creation. Deleted, with the same behavior. Sigh
Add fields for renaming columns. Will probably have to save the data out as XML to keep the relationship/mapping?
Find the code that strips off the common leading text (in GoogleCSE2?) Done
Started to work on clustering with Moby Dick and brought Aaron into the conversation to think about clustering issues – how to make like items gather together with other like items. NMF kind of does this by filling in latent values, but the question is where to cluster on
Finally read in the integrity data and it did not look good. I realize now that a matrix made up completely of zeros and ones will not be handled well by NMF since it will try to make all the cells one based on the models’s mechanism of treating zeroes as empty cells.
After talking to Aaron about it for a while, I think the better way to cluster will be based on the Group Polarization model. Need to be able to bring in that spreadsheet and then write out a report. Also, look at the high-dimension flocking.

Phil 12.21.16

7:00 – 8:00 Research

Continuing with Sociophysics
- Chapter 7: of flocks, flows and transports [page 189]
- Phase diagram of a Schelling segregation model (L Gauvin, J Vannimenus, JP Nadal – The European Physical Journal B, 2009). I’m beginning to think that the model could be a combination of a flocking and segregation model. That could be really interesting. I also seem to get nothing when I do a Scholar search on “flocking and segregation agent simulation”
  - Satisfaction criteria – when the number of unlike agents is less than a fixed proportion F. As F gets larger there is an abrupt transition to a segregated state.
  - Definition of segregation coefficient – the weighted average (normalized) of all cluster sizes averaged over all configurations. When only two clusters survive, n(c) = N/2
- Migration in a small world: A network approach to modeling immigration processes (B Fotouhi, MG Rabbat – Communication, Control, and Computing, 2012 – ieeexplore.ieee.org)
- Chapter 8: Endnote [page 202]
  - Frustration in Complexity (2008 – Philippe Binder)- The common thread between all complex systems may not be cooperation but rather the irresolvable coexistence of opposing tendencies.
- Meeting with Wayne at 5:00.

8:30 – 4:00 ASRC

Anomaly Detection in Temporal Graph Data: An Iterative Tensor Decomposition and Masking Approach. Might have a bunch of applications, including tensor rather than matrix factoring.
Need to get equalize docs to work. Should be straightforward – done
Make k adjustable that maxes out on the minimum of the row/column dimension – done
Added reset
Still can’t build an executable file that runs. Getting the “JNI error” message. This seems to be the StackOverflow to deal with it. Need to find the jar that is signed and not include it. http://stackoverflow.com/questions/34855649/invalid-signature-file-digest-for-manifest-main-attributes-exception-while-tryin
Screenshot for today:
The spreadsheet it generated: clustering_12_21_16-15_53_09

Phil 12.20.16

7:00 – 8:00 School

Looks like I need to keep track of my hours better for next year. Getting started now
Continuing with Sociophysics
- Chapter 7: of flocks, flows and transports [page 179]
  - Another component to include would be a Levy Flight (truncated?). That could account for cases where a leader makes a big jump and then the crowd follows with some ejection for those who can’t/won’t keep up.
  - Power law distribution of weight and max step size in the creation of the population
  - Thomas Schelling Segregation Model
    - Dynamic models of segregation Segregation = polarizing in info space?
    - A physical analogue of the Schelling model (2006 Dejan Vinković & Alan Kirman )

8:30 – 4:30 ASRC

Sorting TableColumn(s)- done. The trick is to change the name and the cellValueFactory:
```
tc.setText(colName);
tc.setCellValueFactory(new MapValueFactory<>(colName));
```
Sorting rows is more straightforward. Just got a list of the sorted row sums and reordered based on that.
Need to update the value in the ‘selected field’ textarea.
Verify that the cells are tracking
Add a tab so that it’s possible to switch between the original and the product matrices
Add an Edit/recalculate capability
- Tweak original matrix
- Adjust k
- Column clustering/renaming

Phil 12.19.16

7:15 – 4:15 ASRC

Continuing with Sociophysics
- Chapter 7: of flocks, flows and transports [page 179]
- Boids (Flocks, herds and schools: A distributed behavioral model – Craig Reynolds):
  - Try to avoid collisions with other boids (repulsion)
  - Attempt to match velocity with neighboring boids
  - attempt to stay close to nearby boids
- If the collision avoidance is taken out and the number of dimensions increased, then this could be the model. Rather than the flock converging around a position, look at the distances between the individuals using DBSCAN and cluster.
- Density and noise need to be independent variables and saved on runs. This would also be true in information space. You can have high organization in high density, low noise states. Thinking about that, this also implies one of the emergent properties of an information bubble is the low noise. Even though the environment may be very noisy, the bubble isn’t.
- As with the other social models, individuals can have weight. That way the flock can have leaders and followers. (See Misinformed leaders lose influence over pigeon flocks to inform the model)
- Also, I like the idea of a social network being built from belief proximity, which raises the cost for switching to another flock, even if they are nearby. It could be that once a social network forms that anti-belief repulsion starts to play a role.
BRC
- Updating intellij and Java.
- Intellij failed to patch. Odd. Tried again and it worked.
- Working on getting tables to update
  - Clear() – done
  - Load – done
  - Select row and modify – done
  - Working on columns and cells
  - Need to sort by row and column. Do this as part of the update() process

Phil 12.18.16

1:30 – 4:00 School. Rain, rain, rain.

Continuing with Sociophysics
- 6.5 Is it really a small world? Searching post Milgram
  - 6.5.8 Funneling properties.
    R G B A D E F
    
    Agent1 0.1 0.7
    
    Agent2 0.3 0.2 0.6
    
    Agent3 0.4 1.0 0.2
    
    Agent4 0.3 0.4 0.5
    R G B A D E F Color Notes
    
    Agent1 0.1 0.7 0.8
    
    Agent2 0.3 0.2 0.6 0.3 0.8
    
    Agent3 0.4 1.0 0.2 0.4 1.2
    
    Agent4 0.3 0.4 0.5 1.2
- Knowing a network by walking on it: emergence of scaling (Alexei Vázquez) Looks like an interesting guy with a wide range of publications.

	R	G	B	A	D	E	F	Color	Notes
Agent1	0.1	0.7						0.8
Agent2	0.3			0.2			0.6	0.3	0.8
Agent3			0.4		1.0	0.2		0.4	1.2
Agent4				0.3	0.4	0.5			1.2

Phil 12.16.16

Phil 7:00 – 4:00 ASRC

Continuing with Sociophysics
- Social Phenomena on complex networks
  - Opinion and community formation in coevolving networks (Gerardo Iñiguez González)
    - Abstract: In human societies opinion formation is mediated by social interactions, consequently taking place on a network of relationships and at the same time influencing the structure of the network and its evolution. To investigate this coevolution of opinions and social interaction structure we develop a dynamic agent-based network model, by taking into account short range interactions like discussions between individuals, long range interactions like a sense for overall mood modulated by the attitudes of individuals, and external field corresponding to outside influence. Moreover, individual biases can be naturally taken into account. In addition the model includes the opinion dependent link-rewiring scheme to describe network topology coevolution with a slower time scale than that of the opinion formation. With this model comprehensive numerical simulations and mean field calculations have been carried out and they show the importance of the separation between fast and slow time scales resulting in the network to organize as well-connected small communities of agents with the same opinion.
  - Citing paper: Effects of deception in social networks (Gerardo Iñiguez González)<— Important???
    - Abstract: Honesty plays a crucial role in any situation where organisms exchange information or resources. Dishonesty can thus be expected to have damaging effects on social coherence if agents cannot trust the information or goods they receive. However, a distinction is often drawn between prosocial lies (‘white’ lies) and antisocial lying (i.e. deception for personal gain), with the former being considered much less destructive than the latter. We use an agent-based model to show that antisocial lying causes social networks to become increasingly fragmented. Antisocial dishonesty thus places strong constraints on the size and cohesion of social communities, providing a major hurdle that organisms have to overcome (e.g. by evolving counter-deception strategies) in order to evolve large, socially cohesive communities. In contrast, white lies can prove to be beneficial in smoothing the flow of interactions and facilitating a larger, more integrated network. Our results demonstrate that these group-level effects can arise as emergent properties of interactions at the dyadic level. The balance between prosocial and antisocial lies may set constraints on the structure of social networks, and hence the shape of society as a whole.
- 6.5 Is it really a small world? Searching post Milgram
  - In the introduction to this section [page 168], the authors say a very interesting thing: “Although the network may have the small world property, searches are usually done locally: the individual may not know the global structure of the network that would help them find the shortest path to the target node“. I think that they are talking about social networks explicitly here, but the same concept applies to an information network. This is a network description of the information horizon problem. You can’t find what you can’t see, at least in a broad outline.
  - Also this: “Searching can regarded as a learning process; repeating the search several times can avoid infinite loops and lead to better solutions”
    - Learning in Real-Time Search: A Unifying Framework (Vadim Bulitko & Greg Lee)
- Sprint Planning Meeting
  - Clustering is my only task for the sprint
- Writing out factored matrix
- Put together test case term/doc spreadsheet. There are several test cases with and without randomly generated zeros. The goal is to determine the best way to cluster docs. Do we use the product mat? The factor mats? Only one way to find out.
- Working on sorting the L2mat by row sum or column sum

Phil 12.15.16

7:00 – 5:30 ASRC

This came across my Twitter feed this morning: Indivisible: A Practical Guide for Resisting the Trump Agenda. It’s written by Hill staffers (Apparently), and says something interesting about the Tea Party:
- They were locally focused. The Tea Party started as an organic movement built on small local groups of dedicated conservatives. Yes, they received some support/coordination from above, but fundamentally all the hubbub was caused by a relatively small number of conservatives working together. To summarize:
  - Groups started as disaffected conservatives talking to each other online. In response to the 2008 bank bailouts and Obama’s election, groups began forming to discuss their anger and what could be done. They eventually realized that the locally-based discussion groups themselves could be a powerful tool.
  - Groups were small, local, and dedicated. Local Tea Party groups could be fewer than 10 people, but they were highly localized and dedicated significant personal time and resources. Members communicated with each other regularly, tracked developments in Washington, and coordinated advocacy efforts together.
  - Groups were relatively small in number. The Tea Party was not hundreds of thousands of people spending every waking hour focused on advocacy. Rather, the efforts were somewhat modest. Only 1 in 5 self-identified Tea Partiers contributed money or attended events. On any given day in 2009 or 2010, only twenty local events–meetings, trainings, townhalls, etc–were scheduled nationwide. In short, a relatively small number of groups were having a big impact on the national debate.
- In reading this, I hear several things
  - Group polarization can start or be furthered by small groups. Indeed, it’s the embodiment of the Margaret Mead quote: Never doubt that a small group of thoughtful, committed citizens can change the world; indeed, it’s the only thing that ever has. This also jibes with Sunstein’s statement that small, polarized groups can act like incubators.
  - Though polarized, the members “tracked developments in Washington”, so they were informed. I’d really like to know their sources of information.
  - Scattered groups that are loosely coupled may have a bigger impact than a large single group.
Continuing with Sociophysics
- Social Phenomena on complex networks
- Dynamical Processes on Complex Networks. Got the Kindle edition so now I can search! Interesting section: 10.6 Coevolution of opinions and network
- Similar chapter in this book – Social Phenomena on coevolutionary networks [pg 166]. One of the interesting things here is the use of the iterated prisoner’s dilemma. On a network, the agents typically calculate and aggregate payoff and imitate the strategy of the neighbor with the best payoff. In the coevolutionary model, an agent can cut off the link to a defector with a probability. This seems a bit like polarization, where the group severs ties with entities with sufficiently divergent views (and individuals leave when the group becomes too extreme)
- Coevolution of agents and networks: Opinion spreading and community disconnection Abstract: We study a stochastic model for the coevolution of a process of opinion formation in a population of agents and the network which underlies their interaction. Interaction links can break when agents fail to reach an opinion agreement. The structure of the network and the distribution of opinions over the population evolve towards a state where the population is divided into disconnected communities whose agents share the same opinion. The statistical properties of this final state vary considerably as the model parameters are changed. Community sizes and their internal connectivity are the quantities used to characterize such variations.
  - Follow on 2006 paper: Opinion spreading and agent segregation on evolving networks
  - This also looks good: Consensus formation on adaptive networks From the abstract: The investigation of a variant of the model reveals that the scenarios of transitions between consensus and polarized states are more robust on adaptive networks.
adam g. dunn
clinical epidemiology and medical informatics. Looking for PhD’s working on misinformation
Sprint grooming
Got the data for the npi_raw_integrity_ci_tbl_eligibility_chiroacup_polyrx table. Turning into pivot tables.
Can now read a single matrix into NmFModelGui, factor, and build the product matrix.

viztales

Dimension reduction, State, Orientation, and Speed

Phil 1.3.17

Phil 1.2.17

Phil 12.30.16

Phil 12.29.16

Phil 12.28.16

Phil 12.27.16

Phil 12.26.16

Phil 12.23.16

Phil 12.22.16

Phil 12.21.16

Phil 12.20.16

Phil 12.19.16

Phil 12.18.16

Phil 12.16.16

Phil 12.15.16