7:00 – ASRC MKT
- Starting on the SASO slides. Found my diversity injection slide story:
- Max Hawkins
- (From NPR’s Invisibilia) “I just started thinking about these loops that we get into,” he says. “And about how the structure of your life … completely determines what happens in it.” Max’s once beautiful routine suddenly seemed unfulfilling. He felt like he was growing closer to people in his own bubble and becoming isolated from those outside of it. “There was something … that just made me feel trapped,” he says. “Like I was reading a story that I’d read before or I was playing out someone else’s script.” As any computer developer would do, Max turned to technology to craft his way out — a series of randomization applications.
- Reading Review: Totalitarianism: The Revised Standard Version
- …they have chosen to identify totalitarianism in terms of a set of six interrelated traits or characteristics-Fried- rich’s oft-referred-to “totalitarian syndrome” (9-io).25 The syndrome includes an official ideology (orientation), a single party typically led by one man (dimension reduction), a terroristic police (herding), a communications monopoly (social influence horizon), a weapons monopoly (??) and a centrally directed economy (dimension reduction)
- Continued to spin up on LSTM effort. Got my dev environment COMPLETELY up to date. Continued with Deep learning & Keras
3:00 – 5:00 Fika & meeting with Wayne
- Worked on the slides for PhD status. I realize that this is actually a good time to have demos with conclusions.
- Talked about options if IRAD falls through
- Need to think about what are the best ways for the work to have impact
“There was no colusion“…”
Anyone involved in that meddling to justice.“
Premises for Data Science Magical Realism
- What follows are some premises for data science magical realism stories based (very, very loosely) on experiences I’ve had or heard about — premises, that is, for stories about impossible, absurd, magical things happening to data scientists in ordinary data science situations. Enjoy!
- More from David Masad
Program Synthesis in 2017-18
- A high-level overview of the recent ideas and representative papers in program synthesis as of mid-2018.
- Alex (Oleksandr) Polozov, a researcher in the Deep Procedural Intelligence group at Microsoft Research AI, Redmond. I work on neural program synthesis from input-output examples and natural language, intersections of machine learning and software engineering, and neuro-symbolic architectures. I am particularly interested in combining neural and symbolic techniques to tackle the next generation of AI problems, including program synthesis, planning, and reasoning.
UMAP Uniform Manifold Approximation and Projection for Dimension Reduction | SciPy 2018 |(video) (paper)
- UMAP (Uniform Manifold Approximation and Projection) is a novel manifold learning technique for dimension reduction. UMAP is constructed from a theoretical framework based in Riemannian geometry and algebraic topology. The result is a practical scalable algorithm that applies to real world data. The UMAP algorithm is competitive with t-SNE for visualization quality, and arguably preserves more of the global structure with superior run time performance. Furthermore, UMAP as described has no computational restrictions on embedding dimension, making it viable as a general purpose dimension reduction technique for machine learning.
- This could be nice for building maps
7:00 – 5:00 ASRC MKT
- Progress on getting my keys back!
- Got everyone’s response on the Doodle, but only 4 of the 5 line up…
- Finish first pass through PhD review slides
- Start SASO slides and poster?
- Continue with exporting terms from the sim and importing them into python. One of the things that will matter is the tagging of the data with the seed terms from the sim as well as the cell name so that reconstructions can be compared for accuracy.
- Added the cell location to each <sampleData> so that there can be some kind of tagging/ground truth about the maps we’re inferring.
- Working on iterating through the etree hierarchy. I can now read in the file, parse it and get elements that I’m looking for.
- Tomorrow will be pulling the seed words out of the code in an ordered list. Generated sentences will need to be timestamped to that conversations can be reconstructed. That being said, it could be interesting to take seed words out of a generated sentence and add them to the embedding seed words. Something to think about.
I wrote up some thoughts about Trump’s press conference with Putin.
7:00 – 4:30 ASRC MKT
- Still can’t connect to the Service center (Betriebsdienst Zentrum) at Zurich U. Tried pinging the conference organizer, who appears to be based on the campus – done. And some progress!
- Travel report for SASO – done
- Hotel in Trento – wait till tomorrow.
- Ping Aaron M. about Doodle – Done
- Set up meeting with Don – done
- Start on slides – started
Vacation is over. Here are some pix
7:00 – 3:00 ASRC MKT
- No problem logging into timesheet or email from the US. Odd.
- Expense Report. Bring Receipts!
- Call Zurich about keys – called. No one there today, call tomorrow before 9:00 +41 44 634 03 09
- Get hotel in Trento
3:00 – 6:00 Fika, then meeting with Wayne
- Schedule a meeting with Don to discuss LSTM agent text, and composer/choreographer for Dance my PhD
- Put together a proposal for the mid-PhD that includes
- Current work
- LSTM next step
- The Wayne Problem
- Keep the committee as is (defend summer of 2019)
- Adjust committee (who becomes co-chair?)
- What to do about JuryRoom
- Make it post-PhD work
- Build an instantiation of the theory, but don’t do anything with it (unpublishable, but next steps would be)
- Build a low-fi version of the website for lab testing
- Build a 1,000 – 10,000 user version (MySQL, PHP, Angular)
- Build a 10,000 – 1,000,000 user version
- Build a fully scaled version
- Lazy, not biased: Susceptibility to partisan fake news is better explained by lack of reasoning than by motivated reasoning
- Gordon Pennycook
- David Rand
- Why do people believe blatantly inaccurate news headlines (“fake news”)? Do we use our reasoning abilities to convince ourselves that statements that align with our ideology are true, or does reasoning allow us to effectively differentiate fake from real regardless of political ideology? Here we test these competing accounts in two studies (total N = 3446 Mechanical Turk workers) by using the Cognitive Reflection Test (CRT) as a measure of the propensity to engage in analytical reasoning. We find that CRT performance is negatively correlated with the perceived accuracy of fake news, and positively correlated with the ability to discern fake news from real news – even for headlines that align with individuals’ political ideology. Moreover, overall discernment was actually better for ideologically aligned headlines than for misaligned headlines. Finally, a headline-level analysis finds that CRT is negatively correlated with perceived accuracy of relatively implausible (primarily fake) headlines, and positively correlated with perceived accuracy of relatively plausible (primarily real) headlines. In contrast, the correlation between CRT and perceived accuracy is unrelated to how closely the headline aligns with the participant’s ideology. Thus, we conclude that analytic thinking is used to assess the plausibility of headlines, regardless of whether the stories are consistent or inconsistent with one’s political ideology. Our findings therefore suggest that susceptibility to fake news is driven more by lazy thinking than it is by partisan bias per se – a finding that opens potential avenues for fighting fake news.
From Alessandro Bozzon (Scholar):
- I am Assistant Professor with the Web Information Systemsgroup, at the Delft University of Technology. I am Research Fellow at the AMS Amsterdam Institute for Advanced Metropolitan Solutions, and a Faculty Fellow with the IBM Benelux Center of Advanced Studies.
My research lies at the intersection of crowdsourcing, user modeling, and web information retrieval. I study and build novel Social Data science methods and tools that combine the cognitive and reasoning abilities of individuals and crowds, with the computational powers of machines, and the value of big amounts of heterogeneous data.
I am currently active in three investigation lines related to Social Data Science: Intelligent Cities (SocialGlass; Crowdsourced Knowledge Creation in Online Social Communities (SEALINCMedia COMMIT/, StackOverflow); and Enterprise Crowdsourcing (with IBM Benelux CAS).
- Modeling CrowdSourcing Scenarios in Socially-Enabled Human Computation Applications
- User models have been defined since the 1980s, mainly for the purpose of building context-based, user-adaptive applications. However, the advent of social networked media, serious games, and crowdsourcing/human computation platforms calls for a more pervasive notion of user model, capable of representing the multiple facets of social users and performers, including their social ties, interests, capabilities, activity history, and topical affinities. In this paper, we define a comprehensive model able to cater for all the aspects relevant for applications involving social networks and human computation; we capitalize on existing social user models and content description models, enhancing them with novel models for human computation and gaming activities representation. Finally, we report on our experiences in adopting the proposed model in the design and implementation of three socially enabled human computation platforms.
- Sparrows and Owls: Characterisation of Expert Behaviour in StackOverflow
- Question Answering platforms are becoming an important repository of crowd-generated knowledge. In these systems a relatively small subset of users is responsible for the majority of the contributions, and ultimately, for the success of the Q/A system itself. However, due to built-in incentivization mechanisms, standard expert identification methods often misclassify very active users for knowledgable ones, and misjudge activeness for expertise. This paper contributes a novel metric for expert identification, which provides a better characterisation of users’ expertise by focusing on the quality of their contributions. We identify two classes of relevant users, namely sparrows and owls, and we describe several behavioural properties in the context of the StackOverflow Q/A system. Our results contribute new insights to the study of expert behaviour in Q/A platforms, that are relevant to a variety of contexts and applications.