Category Archives: thesis

Phil 12.1.15

7:30 – 5:00

Learning: Identification Trees, Disorder
- Trees of tests
- Identification Tree (Not a decision tree!)
- Measuring Disorder – lowest disorder is best test
  - Disorder(set of binaries) = -(positive/total*log2(positive/total)) – (neg/total*log2(neg/total))
    - is the base log related to the base of the set?
    - Add up the disorder of each result in the test to determine the disorder of the test normalized by the number of samples. Lowest disorder is winner
Bringing in my machine learning, pattern recognition and stats books.
Bringing in my big laptop
Setting up dev environment.
- Using the new IDEA 15.x, which seems to be OK for the typescript, will check PHP tomorrow.
- Installed grunt (grunt-global, then grunt-local from the makefiles)
- installed typescript (npm i -g typescript)
- Installed gnuWin32 , which has makefile and touch support, along with all the important DLLs. It turns out that there is also gnuWin64. Will use that next time
- Fixed bugs that didn’t get caught before. Older compiler?
  - commented out the waa.d.ts from the three.d.ts definitelytyped file
  - deleted the { antialias: boolean; alpha: boolean; } args from the CanvasRenderer call in classes/WebGlCanvasClasses
  - added title?:string and assoc_name?:string to IPostObject in RssController
  - had to add the experiments/wglcharts2 folder to the xampp Apache htdocs
  - added word?:string to IPostObj in RssAppDirectives
  - added word_type_name?:string to IPostObj in RssAppDirectives
  - fixed the font calls in WebGl3dCharts IComponentConfig.
- Since these issues really shouldn’t have happened, I’m going to verify that they are not in my home dev environment before checking in.
And the new computer arrived, so I get to do some of the install tomorrow.

Phil 11.26.15

7:00 – Leave

Constraints: Visual Object Recognition
- to see if to signals match, a maximising function that integrates the area under the signal with respect to offsets (translation and rotation) is very good, even with noise.
Dictionary
- Add ‘Help Choose Doctor’, ‘Help Choose Investments’, ‘Help Choose Healthcare Plan’, ‘Navigate News’ and ‘Help Find CHI Paper’ dictionaries. At this point they can be empty. We’ll talk about them in the paper.
- Added ‘archive’ to dictionary, because we’ll need temporary dicts associated with users like networks.
- Deploy new system. Done!
  - Reloaded the DB
  - Copied over the server code
  - Ran the simpleTests() for AlchemyDictText. That adds network[5] with tests against the words that are in my manual resume dictionary. Then network[2] is added with no dictionary.
  - Commented out simpleTests for AlchemyDictText
  - copied over all the new client code
  - Ran the client and verified that all the networks and dictionaries were there as they were supposed to be.
  - Loaded network[2] ‘Using extracted dict’
  - Selected the empty dictionary[2] ‘Phil’s extracted resume dict’
  - Ran Extract from Network, which is faster on Dreamhost! That populated the dictionary.
  - Deleted the entry for ‘3’
  - Ran Attach to Network. Also fast 🙂
And now time for ThanksGiving. On a really good note!

Phil 11.25.15

7:00 – 1:00 Leave

Constraints: Search, Domain Reduction
- Order from most constrained to least.
- For a constrained problem, check over and under allocations to see where the gap between fast failure and fast completion lie.
- Only recurse through neighbors where domain (choices) have been reduced to 1.
Dictionary
- Add an optional ‘source_text’ field to the tn_dictionaries table so that user added words can be compared to the text. Done. There is the issue that the dictionary could be used against a different corpus, at which point this would be little more than a creation artifact
- Add a ‘source_count’ to the tn_dictionary_entries table that is shown in the directive. Defaults to zero? Done. Same issue as above, when compared to a new corpus, do we recompute the counts?
- Wire up Attach Dictionary to Network
  - Working on AlchemyDictReflect that will place keywords in the tn_items table and connect them in the tn_associations table.
  - Had to add a few helper methods in networkDbIo.php to handle the modifying of the network tables, since alchemyNLPbase doesn’t extend baseBdIo. Not the cleanest thing I’ve ever done, but not *horrible*.
  - Done and working! Need to deploy.

Phil 11.20.15

7:00 – Leave

Search: Optimal, Branch and Bound, A*
- Dead Horse principle.
- Admissible vs. consistency heuristics.
Dictionaries
- Wire up Add – Done. Also found a bug in the PHP code that was screwing up parent return. So that took a while…
- Wire up Delete – Done. That went much better!
- Wire up New Dictionary
- Wire up Extract
- Wire up Attach

Phil 11.19.15

7:00 – 5:00 Leave

Reasoning: Goal Trees and Rule-Based Expert Systems
- There, now I’m back in order.
- H. Simon – The complexity of the behavior is max(cplx(prgm), cplx(env))
- More Genesis – Elaboration graphs
- Genesis judges similarity in multiple ways: (this presentation, page 25)
  - Using word vectors
  - Using concept vectors: seeing similarities not evident in the words.
- Genesis aligns similar stories for analogical reasoning (Needleman-Wunch algorithm, which is a way of comparing string similarity using matrices)
IRB renewal – ask Wayne
In a fit of orderliness, created shortcuts to the cmd windows that the makefiles run in
Dictionaries
- Don’t forget to move the text-extraction calls to the methods that need it and see if that speeds up the others. Done. Much faster. PHP is dumb. Or needs a preloader/compiler
- Cleaned up the loading and display code a bit. Need to add a tree view (which looks like it can be done in pure CSS), but that can wait.
- Adding manual entry
  - Finished the form
  - Clear for entry and separate parent is done
  - addition
  - deletion
  - modification (adding parents in particular)
- Adding text extraction

Phil 11.18.15

7:00 – 5:00 Leave

Search: Depth-First, Hill Climbing, Beam
- Looks like I missed one. Will go back tomorrow.
- Patrick Winston is the instructor, and he showed a version of Genesis (around 42 minutes in), which maps stories into graphs and then searches through them at different levels of abstraction. It is capable of providing relevant answers to questions. Here’s a white paper. What I think is interesting here is the following:
  - How they describe their hierarchy (revenge, tit-for-tat, etc). This is built, not computed based on what looks to be a rules engine.
  - How text in a story can be looked at as a graph.
  - And that’s without reading the paper!
Working on dictionary directive
- Ramifying the dictionary choice through the controller. It should be set initially when the network loads (done), but should be changed if a new one is selected.
- There does need to be a second step where the DictPull is run and the text of the items is compared against the selected dictionary and KEYWORDS items are added to the network. This is when the network’s dictionary index should be changed.
- Downloading the dictionary.
- Need to upload a new word
12:00 workshop on Grant Writing in ITE 459
4:00 meeting with Dr. Pan.

Phil 11.17.15

7:00 – 4:00 leave

Reasoning: Goal Trees and Problem Solving (why is this pertinent to me?)
- Apply all safe transforms
- Apply heuristics
- AND nodes and OR nodes (AND/OR, problem reduction, or goal tree)
Realized that I have a redundant user_id in tn_dictionary_entries. This could be used to allow non-owners to add words to the dictionary. Which means that dictionaries could be shared. On the whole, I think that’s a good idea. Adding the change to the dictionary code.
Ok, back to text extraction (Is this a safe transform?)
Added a _buildExtractor() private method and imported all the extractor parts.
It does take a long time. Just to load? I’d like to try profiling…
Wow. Extracting, loading into the database, getting the JSON output and deleting the dictionary all work!
I think the next step is to either get some definitions or start building the directive. After my lunchtime ride, I decided that the directive is probably the best thing to do next. Adding user functionality is a good way of ensuring that the server functionality makes sense.
Added ‘get_user_dictionaries‘ to rssPull.php. We’ll start with that.
Got the skeleton of a directive up and retrieving the dictionary list.

Phil 11.16.15

7:00 – 6:00 Leave

Found a new programmer resource that looks good – I Programmer. They pointed me to an article about Babel, which compiles JavaScript to… other things. It might even be able to monkeypatch modern JS to run on old browsers. Need to test one of these years. It’s based on plugins which really means that it can map from one thing to anything else. My only issue is that it could break debugging unless there is a mapping file like typescript has.
Discovered another communication app – Telegram. ISIS used it to announce Paris?
Noon – Thad Starner in ITE 459. Very interesting. Met Aaron Massey, who might be good on the Committee.
I’ve been reading Tefko Saracevic‘s paper RELEVANCE: A Review of and a Framework for the thinking on the Notion in Information Science. It’s full of really nice stuff, from a time when you couldn’t just throw processing power at problems and brute force an answer. It’s clarified my thinking about the client word-based network:
- Search engines are pragmatic relevance engines (i.e topic-relatedness, quality, novelty, importance, credibility, etc.). The networks that they produce try to correlate knowledge at the ‘source’ – basically ‘in the world’
- We, as individuals are pertinence/situational relevance machines (Wilson’s concerns, preferences and stock of knowledge). Our internal knowledge graph represents our view of the world. We are the ‘destination’ for information.
  - “Situational Relevance is relevance to a particular individual’s situation – but to the situation as he sees it, not as others see it, nor as it really is.”
  - The ‘shape’ of our internal knowledge graph, the sources of information that we lean more heavily on, the weights that we give to certain words (or possibly concepts) may be able to determine whether we are dependably credible or dependably counter-credible.
- By enabling client-side weighting, we let users adjust components of a relevant search so that it becomes pertinent to us.
- The information that we produce in this process (dictionaries, weights, etc) can be stored so that a well-structured record of what is pertinent to individuals (and more importantly, groups of individuals) becomes part of the world knowledge. Correlations with respect to internal credibility may then in turn be able to infer the credibility (or lack of) of information in the world.
Getting back to dictionary integration.
- Re-upped my IntelliJ subscription for another two years
- Updated files and DB. All seems to work
- DbDictionary.removeDictionary returns a fail JSON message. Fixing. Fixed!
- Adding ability to update an entry – done.
- Finishing CreateDictionary. Finished and tested
- Adding DeleteDictionary. Finished and tested
- Adding ModifyDictionary. Finished and tested
- Adding term extraction. Started poking, but that’s it. More tomorrow.

viztales

Dimension reduction, State, Orientation, and Speed

Category Archives: thesis

Phil 12.1.15

Phil 11.26.15

Phil 11.25.15

Phil 11.20.15

Phil 11.19.15

Phil 11.18.15

Phil 11.17.15

Phil 11.16.15